Gene Franean1_3672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3672 
Symbol 
ID5672038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4347927 
End bp4348898 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content77% 
IMG OID641242555 
Productalcohol dehydrogenase 
Protein accessionYP_001507975 
Protein GI158315467 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCAC TTGTGACGGA CGGGTCGGCG GCCGGCGGGC TGCGGCTTGG GGAGGTGCCC 
GATCCGGTAC CCGGGCCCGA CCAGGTGCTG ATCAGAACGG CGGCGATCTC GCTGGTCGAC
CGCGACACGG GCTATGCCGC GGCGATGCTC GGCGACGGCG GAGTCTGGGG GTTCGACGCG
GCGGGTGTGG TCGTCGAGGC CGCAGCTGAC GGGAGTGGCC CGCCGGTGGG GTCGACGGTG
CTCACCCTGC TGCCGGCGCC CGGGGCCTGG GCGGAGCTCG TCACCGCCAG CACCGGTGAC
GTTGCCATGC TGCCGCCGGG TGTGGATCCG GGCGTGCTGA CTGGGCTCGC GCTGCCGGCG
GTCTCCGCGG TGCAGGCCCT CGGTGAGGTC GAGGGGCTCG CCGGCAGCCG CGTCCTCGTC
ACCGGCGCCG GTGCCGGCGT GGGCTGGTTC GCCGTTCAGT TGGCCGCGCT GCGCGGGGCC
GAGGTCGTCG CCGTGGCGCG CGATCCCGCG GACGCCGACG ACCTGCGGGC GGCCGGCGCC
CACGAGGTCC GCACCGAGCT GCCGGCGACG GATCCGGGTG ACCCGGTCGG GGGCGATCCG
GCCACGGCTG AGCCAGCCGC GTCGTTGCGG CCGGTGGACG TGGTGGTCGA CGTGGTGGGC
GGGTCGACGA TGACCCGGGC GGTCGACCTG CTGGCGGAGG GCGGCACCGC CCTCGCGGTC
GGCGCGATCT CCGGGGAGCG GATGGTCTTC CCGCCGGCGG CCTTCGCGAG CCCGCTGCGC
CGACGTGTCC GCGGGTTCTG GGGCAGCTGG CCGGTCGGCG GCGACCTGGC CACGGTCGTC
GAGCTGGTCG CCGCCGGGCG GCTGTGCCCA CGGCCGGGCT GGCGCGGTGG CTGGGGTGAG
GTCACGGGTC TGCTCGAGAG CTTCGCCGCC GGCCGGACCC GGCGCCGCCG GGCCGTGCTC
GACGTCGTCT GA
 
Protein sequence
MRALVTDGSA AGGLRLGEVP DPVPGPDQVL IRTAAISLVD RDTGYAAAML GDGGVWGFDA 
AGVVVEAAAD GSGPPVGSTV LTLLPAPGAW AELVTASTGD VAMLPPGVDP GVLTGLALPA
VSAVQALGEV EGLAGSRVLV TGAGAGVGWF AVQLAALRGA EVVAVARDPA DADDLRAAGA
HEVRTELPAT DPGDPVGGDP ATAEPAASLR PVDVVVDVVG GSTMTRAVDL LAEGGTALAV
GAISGERMVF PPAAFASPLR RRVRGFWGSW PVGGDLATVV ELVAAGRLCP RPGWRGGWGE
VTGLLESFAA GRTRRRRAVL DVV