Gene Franean1_1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1779 
Symbol 
ID5670181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2137072 
End bp2138520 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content76% 
IMG OID641240700 
Product2-oxoglutarate dehydrogenase E2 component 
Protein accessionYP_001506123 
Protein GI158313615 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR02927] 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00910904 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.202606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAT CCGTCACGAT GCCCCGCCTC GGGGAGAGCG TCTCCGAGGG GACGGTCACC 
CGCTGGCTGA AGAAGGAAGG CGAGCGGGTC GAGGCCGACG AGCCGCTCCT CGAGGTCAGC
ACCGACAAGG TCGACACCGA GATCCCCGCG CCGGCCTCCG GCGTGCTCGG CTCGATCAAG
GTCGCCGAGG ACGAGACCGT CGAGGTCGGC GTCGAGCTGG CTGTCATCGA GGACGGCGCC
GCGGGTGCCG GCGCGGCGGC CGCGCCCGCC GAGGCCCCCG CCGCGGCCCC GGAGCCTCCG
CCGGCGCCTG CCGCCGCCGC GCCGCCCCCG CCGCCTGCTC CCCCCGCTCC GTCGGCGCCG
GCGCCCGCTC CCGCGGCGGC ACCGCCTCCG CCGCCGGCTC CCGCCACGCC GGTGCCCGCG
CAGGCCCCGG TCGCCGCCCC GGACGCCGCC GCCGACGGTC TCGGCCGGTA CGTGACGCCG
CTGGTCCGCA AGATGGCCGC GGAGCTGGGC GTCGACCTCG GCTCGGTCAC CGGCAGCGGC
CCGGGTGGAC GCATCACCAA GCAGGACATC CAGGACGCGG CGAAGTCCCG CGGCTCCGCG
CCGGCCGCGG CTCCCAGCGC CCCCGCCGCT CCGGCTGTGC CGGCCGCTCC CGCCCCGGCG
CAGGCCCCCG CGGCGCCCGC GGCCGCCCGG CCGGCTCCCG CCGCCGCGCC GACCGCGTCC
ACCGCGCCCC GTGGCCGCAC CGAGAAGCTG ACCCGGCTGC GCTCGCTGGT CGCCCGTCGG
ATGGTCGAGT CGCTGCAGGT CAGCGCGCAG CTCACCACCG TCGTGGAGGC CGACGTCACG
CGGATCGCGA AGCTGCGCGA GCGGGCCAAG GCGAACTTCC AGGCCCGCGA GGGCGTGAAG
CTGTCGTTCC TGCCGTTCTT CGCGGTGGCC GCGTGCGAGG CACTGCGCGA GCACCCGGGG
ATCAACTCCA GCATCGACCT GGAGGCGGGC ACGGTCACCT ACCACGACTC GGAGAACCTC
GGCATCGCCG TCGACACCGA CCGTGGCCTG GTCGTCCCGG TGATCCACAA CGCCAGCGAC
CTGAACCTCA GCGGGATGGC CCGCAAGATC GACGAGCTGG CCGCGCGGAC CCGCGCCAAC
CAGGTGTCCC CGGACGACCT GGGCGGTGGC ACCTTCACGC TGACCAACAC CGGCAGCCGC
GGCGCCCTGT TCGACACCCC GATCATCAAC CAGCCGCAGG TGGCGATCCT GGGCACCGGG
TCGGTCGTGA AGCGCCCGGC CGTCGTCACC GATCCCGAGC TCGGCGAGGT GATCGCCATC
CGGTCGAAGG TCTACCTGGC CCTCACCTAC GACCACCGGA TCGTGGATGG CGCCGACGCG
GCCCGGTTCC TCACCGCGAT CGCCAGCCGT CTCGAGGAAG GCGCCTTCGA GGCCGAGCTC
GGGCTGTAG
 
Protein sequence
MSVSVTMPRL GESVSEGTVT RWLKKEGERV EADEPLLEVS TDKVDTEIPA PASGVLGSIK 
VAEDETVEVG VELAVIEDGA AGAGAAAAPA EAPAAAPEPP PAPAAAAPPP PPAPPAPSAP
APAPAAAPPP PPAPATPVPA QAPVAAPDAA ADGLGRYVTP LVRKMAAELG VDLGSVTGSG
PGGRITKQDI QDAAKSRGSA PAAAPSAPAA PAVPAAPAPA QAPAAPAAAR PAPAAAPTAS
TAPRGRTEKL TRLRSLVARR MVESLQVSAQ LTTVVEADVT RIAKLRERAK ANFQAREGVK
LSFLPFFAVA ACEALREHPG INSSIDLEAG TVTYHDSENL GIAVDTDRGL VVPVIHNASD
LNLSGMARKI DELAARTRAN QVSPDDLGGG TFTLTNTGSR GALFDTPIIN QPQVAILGTG
SVVKRPAVVT DPELGEVIAI RSKVYLALTY DHRIVDGADA ARFLTAIASR LEEGAFEAEL
GL