Gene Franean1_5475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5475 
Symbol 
ID5673806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6620149 
End bp6621549 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content68% 
IMG OID641244330 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_001509736 
Protein GI158317228 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID[TIGR01350] dihydrolipoamide dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.67474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.756284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGC ACTTTGATCT CGTCGTCCTA GGCGGAGGTC CTGGCGGCTA CGTCGCGGCG 
ATCCGGGCGG CCCAGCTCGG GCTGTCGGTC GCGGTCGTCG AGGAGAAGTA CTGGGGCGGC
GTTTGCCTGA ACGTCGGGTG CATCCCCTCG AAGGCGTTGC TGCGCAACGC CGAGCTCGCG
CACCTGTTCG CCCACGAGGC GAAGACCTTC GGTATCTCCG GCGAGGTGAG CTTCGACTTC
GGCGCCGCCT TCGACCGCAG CCGCCAGGTC GCCGAGGGGC GCGTCAAGGG CGTGCACTTC
CTGATGAAGA AGAACAAGAT CACCGAGTTC ACCGGCCGCG GTACCTTCCG TGACCCGAAC
ACCCTGGACG TCGCGCTCTC CGCCGGCGGC ACCGACCAGG TGAGCTTCGA CCACGCGATC
ATCGCGACGG GTTCCCGGGT CCGGCTGCTG CCCGGCGTCG AGCTCTCCGA CAACATCGTC
ACCTACGAGA CGCAGATCCT CACCCGCGAG CTGCCGCGGT CGATGGCGAT CGTCGGCGCC
GGGGCGATCG GCATGGAGTT CGCCTACGTC CTGCGCAACT ACGGCGTGGA CGTCACGATC
ATCGAGTTCC TCGACCGCGC GCTGCCGAAC GAGGACGCCG ACGTCTCCAA GGAGATCGTC
CGCCAGTACA AGAAGCTCGG CGTGCCGATC CTGACCTCGA CCAAGGTCGA GACGGTGACG
GACAACGGCT CCTCGGTGAC CGTCGAGTAC ACCGGCAAGG ACGGCGCCCG GGGCTCGCTC
GAGGTGGACA AGGTCCTCAT GTCCATCGGG TTCGCGCCCA ACGTCGAGGG CTTCGGCCTG
GAGAACACCG GCGTGGCGCT CACCGACCGC GGCGCGATCG CGATCGACGA CCACATGCGC
ACCAACGTCG AGCACATCTA CGCCATCGGC GACGTGACGG CGAAGCTCAT GCTGGCGCAT
GTCGCCGAGG CTCAGGGCGT CGTCGCGTCC GAGACCATTG CCGGTGCGGA GACGGTGATG
CTCGGTGACT ACCGGATGAT GCCGCGGGCC ACCTTCTGTC AGCCCCAGGT CGCCAGCTTC
GGTCTCACCG AGGCACAGGC ACGGGAGGAG GGCCACGACA TCAAGGTGGC GAAGTTCCCG
TTCACCGCGA ACGGCAAGGC CCACGGCCTG GGCGACCCGA ACGGCTTCGT CAAGCTGATC
TCCGACACGA AGTACGGCGA GCTGCTCGGC GGCCACCTGA TCGGCCCGGA CGTCTCCGAG
CTGCTGCCCG AGCTGACGCT GGCCCAGAAA TGGGACCTCA CCGCGCTCGA GCTCGCCCGC
AACGTGCACA CCCACCCGAC GCTGAGCGAG GCGTTGCAGG AGGCGATCCA CGGCCTCGCC
GGCCACATGA TCAACCTCTG A
 
Protein sequence
MAAHFDLVVL GGGPGGYVAA IRAAQLGLSV AVVEEKYWGG VCLNVGCIPS KALLRNAELA 
HLFAHEAKTF GISGEVSFDF GAAFDRSRQV AEGRVKGVHF LMKKNKITEF TGRGTFRDPN
TLDVALSAGG TDQVSFDHAI IATGSRVRLL PGVELSDNIV TYETQILTRE LPRSMAIVGA
GAIGMEFAYV LRNYGVDVTI IEFLDRALPN EDADVSKEIV RQYKKLGVPI LTSTKVETVT
DNGSSVTVEY TGKDGARGSL EVDKVLMSIG FAPNVEGFGL ENTGVALTDR GAIAIDDHMR
TNVEHIYAIG DVTAKLMLAH VAEAQGVVAS ETIAGAETVM LGDYRMMPRA TFCQPQVASF
GLTEAQAREE GHDIKVAKFP FTANGKAHGL GDPNGFVKLI SDTKYGELLG GHLIGPDVSE
LLPELTLAQK WDLTALELAR NVHTHPTLSE ALQEAIHGLA GHMINL