Gene Franean1_5581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5581 
Symbol 
ID5673909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6761562 
End bp6764714 
Gene Length3153 bp 
Protein Length1050 aa 
Translation table11 
GC content77% 
IMG OID641244435 
ProductD-lactate dehydrogenase (cytochrome) 
Protein accessionYP_001509839 
Protein GI158317331 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase
[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.136848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACG GTGATCTGCC CGGGGGCTCC CGGGCGGAGC GGACGACCCG CGTCCAGCCC 
GCCCGGGGCC GGAACCGGGG CCTGGGCCTG GGCCGCTCCC GGCCGGCCGG GAGCGGGAGC
GTAGCGGCAC GGGGCGAGGC TGAGGCCCGG GAGCTGGTCG CGCGGCTGCG CGCGGCCACC
GGATCCGCCG TGGTCCGCAC CGCGGGTGGC ACGGCACCGG TGGTCGATCA TTCGGCGCGG
CGGCGCGCCG AGTACTCCTC GGACGCCTCC AACTACCGGG TCACCCCCGA GGTCGTCGTC
CTGCCGCGCG ACGTCGACGA GGTCCTCGCT GTCGCCGCGG TCTGCCGGGA GACGGGCACC
GCGCTGACGG TGCGCGGCGC GGGGACGTCC ATCGCCGGCA ACGCGGTCGG CCCCGGGGTC
GTCATGGACC TCAGCCGGCA TCTCACCGGA ACTCCGGACA TCGACCCGGA TTCCAGAACT
GCCAAGGTCG CCCCCGGTCT GGTGCTGGAC GACCTGCAGG CCGCCGCGCG CCCGCACCGG
CTGCGGTTCG GCCCCGACCC GTCCACCCAC GGTCGCGCCA CCCTCGGCGG CATGATCGGG
AACAACGCCT GCGGGGCGCG CGCGGTGGCC TACGGGCGGA CCGCCGACAA CGTCGTCGCC
CTCGACGTCG TGGACGGCAC GGGCCGCCGT TTCGCCGCCA CCGCCGCGGC GACAGGCACG
CCACCGTCCG GCAACGGATC GCCTGCCGGG CCGGTGGTGC CGGGGCTCGA CACCTTCGTC
CACGCGAACC TCGGAGTGAT CCGGACGGAG CTGGGGCGGT TCGACCGGCA GATCTCGGGC
TACTCGCTGG AACATCTGCT GCCCGAGCGC GGTGCCCATC TCGGGCGCGC TCTCGTTGGT
ACCGAGGGCA CCTGCGCCGT CGTGCTCGGC GCGACGGTCC GGCTCGTCGA GATACCGACC
GCGACCGCGC TCGCGGTGCT CGGATACCCG GACATGGCGA CCGCCGCCGA CGCGGTCCCC
GGCCTGCTCG GCCACCGCCC GCTCGCCGTC GAGGGCATGG ACGCCCGCCT CGTCGACGGG
GTGCGCCGCC GCCGGGGGAG CGGCGCCGTG CCCGCGCTGC CCGACGGCGG CGGCTGGCTG
TTCGTGGAGG TCGGCGGGCC GACCGGGTCG GACGCCGCGG TCGCGGCGGC GGCGGTCGTG
GCCGACGCGG GAGCGGGGGC GTCCATGGTG CTGCCGGCCG GGCCTGTGGC CGCGGCGCTC
TGGCGCCTGC GGGAGGACGG GGCGGGCCTG GCGGGGCGCA CCCCGGACGG GGCCCCGGCC
TGGCCGGGCT GGGAGGACGC CGCCGTCCCG CCCGCCAACC TGGGCTCCTA CCTGCGTGAG
TTCGGCAAGC TGATGCGGGC CCACCAGTTG GACGGCGTCG CCTACGGTCA CTTCGGTGAC
GGTTGCGTGC ACGTGCGGAT CGACTTCCCG CTCGCCGAGC ACCCCGGCCG GCTGCGGACC
TTCCTCACCG AGGCGGCGCA CCTGGTCGTC GCGCACGGCG GCTCGCTGAC CGGCGAGCAC
GGGGACGGGC GGGCCCGCGG CGAGCTGCTT CCGATCATGT ACTCGCCGGA CGCTCTGGCG
GCCTTCGCCG CCTTCAAACA CATCTTCGAC CCGGACGACC TGATGAACCC CGGGGTCGTC
GTGCGCCCAC GGCCGCTTGA CGCCGACCTG CGTCGCCCGC GGGCCCGTCC ACTGCCGCGG
GTGGAAGGGA GCTTCGCGCT CACCGCGGAC GCAGGGGACC TCACTCAGGC CGTGCACCGC
TGCGTCGGAC TGGCGAAGTG CCGGGCGGAC ACGTCCACGT CCGGGGGCTT CATGTGCCCG
TCCTTCCTGG CCACCCGGGA CGAGAAGGAC TCGACCCGGG GCCGCGCGCG GGTCCTGCAG
GAGCTTGCCG CCGGCGGCCC CGGCGGCCCC GCCGGGCCGG GCGGCGCGCC GGCCATTCCC
GCGCCCGCGC GGCGGGAGTG GCGCGATCCG GCGGTGCGCG AGTCGCTCGA CCTCTGCCTG
TCGTGCCGTG CCTGCGCCCG GGACTGCCCG GCCGGTGTCG ACGTCGCCCG GTACAAGTCC
GAGGTGCTGC ACCGGGCCTA CCGACGTCGC CCGCGACCGG CGGCGCACTA CGCCCTCGGC
TGGCTGCCCC GGTGGACGCG TCTCGCCGGG CGCGTGCCCC GGGTGGTCAA CGCGGCGCTG
CGCACCGGCG CGATCCGCCG TCCACTGTTC CGGCTCGGCG GTCTGGACGC CCGCCGCGCC
GCGCCGGAGT TCGCCGCGGA GCCGTTCCAC CGCTGGTGGC GGCGAACCAG CCGGAACGCG
CCGGCCGCCA CCGGCGACCG GACGGACCCA CCGGTGCTGC TGTGGGTCGA CACCTTCACC
GACGTGTTCG CGCCCGGCGC CGCGCGCGCG GCGGTCGAGG TCCTCACCGC CGCCGGCCAC
CGGGTACTGA TCCCGAGCGG ACGGCCCGCC TGCTGCGGCC TCACCTGGAT CACCACCGGC
CAACTGGACG GGGCGCGGCG CCGCCTGCGC CGGACCCTTG ACCAGCTCGC CCCCTACGCG
CTCAGCGGGA TCCCGATCGT GGGGCTGGAG CCGTCGTGCA CGGCCGTTCT GCGGGACGAC
CTCGTCGAGC TGTTCCCCGG AGACCGGCGG GCCGAGGCGC TCGCGGCGGG CGTGCGGACG
CTCGCCGAGC TGCTGACCCA GGATGGCGGC CCCGAACAGC GCCCGGCCAG GAGCCCCGCC
GGCGGCGGAA ACCCGGCCGG TGGAAGGAAC CCCGCGGGCG GGTGGCGGCT TCCGCGGCTG
GACGGCGTGC GGGTGCTCGC GCAGCCGCAC TGCCACCAGC ACGCGGTCAT GGGCTTCGAG
ACCGACAGCG CGCTGCTGCG GGCCGCCGGC GCGGAGGTCG AGACCCTGGC GGGGTGCTGC
GGCCTGGCCG GTGACTTCGG GATGCGGCGT GGGCACCACG ACATCTCGGT CGCCGTCGCC
GAGCGGGCGC TGCTGCCCGC CCTGCGCGCG GCGGGGGAGG ACACTGTGCT GCTCGCGGAC
GGCTTCTCGT GCCGCACGCA GGCCGCCCAG CTCGGTGGCC GGACGGCCTA TCACCTCGCC
GAGCTACTTG CCAAAAAGTT GCGTCCAGGC TGA
 
Protein sequence
MRDGDLPGGS RAERTTRVQP ARGRNRGLGL GRSRPAGSGS VAARGEAEAR ELVARLRAAT 
GSAVVRTAGG TAPVVDHSAR RRAEYSSDAS NYRVTPEVVV LPRDVDEVLA VAAVCRETGT
ALTVRGAGTS IAGNAVGPGV VMDLSRHLTG TPDIDPDSRT AKVAPGLVLD DLQAAARPHR
LRFGPDPSTH GRATLGGMIG NNACGARAVA YGRTADNVVA LDVVDGTGRR FAATAAATGT
PPSGNGSPAG PVVPGLDTFV HANLGVIRTE LGRFDRQISG YSLEHLLPER GAHLGRALVG
TEGTCAVVLG ATVRLVEIPT ATALAVLGYP DMATAADAVP GLLGHRPLAV EGMDARLVDG
VRRRRGSGAV PALPDGGGWL FVEVGGPTGS DAAVAAAAVV ADAGAGASMV LPAGPVAAAL
WRLREDGAGL AGRTPDGAPA WPGWEDAAVP PANLGSYLRE FGKLMRAHQL DGVAYGHFGD
GCVHVRIDFP LAEHPGRLRT FLTEAAHLVV AHGGSLTGEH GDGRARGELL PIMYSPDALA
AFAAFKHIFD PDDLMNPGVV VRPRPLDADL RRPRARPLPR VEGSFALTAD AGDLTQAVHR
CVGLAKCRAD TSTSGGFMCP SFLATRDEKD STRGRARVLQ ELAAGGPGGP AGPGGAPAIP
APARREWRDP AVRESLDLCL SCRACARDCP AGVDVARYKS EVLHRAYRRR PRPAAHYALG
WLPRWTRLAG RVPRVVNAAL RTGAIRRPLF RLGGLDARRA APEFAAEPFH RWWRRTSRNA
PAATGDRTDP PVLLWVDTFT DVFAPGAARA AVEVLTAAGH RVLIPSGRPA CCGLTWITTG
QLDGARRRLR RTLDQLAPYA LSGIPIVGLE PSCTAVLRDD LVELFPGDRR AEALAAGVRT
LAELLTQDGG PEQRPARSPA GGGNPAGGRN PAGGWRLPRL DGVRVLAQPH CHQHAVMGFE
TDSALLRAAG AEVETLAGCC GLAGDFGMRR GHHDISVAVA ERALLPALRA AGEDTVLLAD
GFSCRTQAAQ LGGRTAYHLA ELLAKKLRPG