Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1779 |
Symbol | |
ID | 5670181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2137072 |
End bp | 2138520 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240700 |
Product | 2-oxoglutarate dehydrogenase E2 component |
Protein accession | YP_001506123 |
Protein GI | 158313615 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR02927] 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00910904 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.202606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTAT CCGTCACGAT GCCCCGCCTC GGGGAGAGCG TCTCCGAGGG GACGGTCACC CGCTGGCTGA AGAAGGAAGG CGAGCGGGTC GAGGCCGACG AGCCGCTCCT CGAGGTCAGC ACCGACAAGG TCGACACCGA GATCCCCGCG CCGGCCTCCG GCGTGCTCGG CTCGATCAAG GTCGCCGAGG ACGAGACCGT CGAGGTCGGC GTCGAGCTGG CTGTCATCGA GGACGGCGCC GCGGGTGCCG GCGCGGCGGC CGCGCCCGCC GAGGCCCCCG CCGCGGCCCC GGAGCCTCCG CCGGCGCCTG CCGCCGCCGC GCCGCCCCCG CCGCCTGCTC CCCCCGCTCC GTCGGCGCCG GCGCCCGCTC CCGCGGCGGC ACCGCCTCCG CCGCCGGCTC CCGCCACGCC GGTGCCCGCG CAGGCCCCGG TCGCCGCCCC GGACGCCGCC GCCGACGGTC TCGGCCGGTA CGTGACGCCG CTGGTCCGCA AGATGGCCGC GGAGCTGGGC GTCGACCTCG GCTCGGTCAC CGGCAGCGGC CCGGGTGGAC GCATCACCAA GCAGGACATC CAGGACGCGG CGAAGTCCCG CGGCTCCGCG CCGGCCGCGG CTCCCAGCGC CCCCGCCGCT CCGGCTGTGC CGGCCGCTCC CGCCCCGGCG CAGGCCCCCG CGGCGCCCGC GGCCGCCCGG CCGGCTCCCG CCGCCGCGCC GACCGCGTCC ACCGCGCCCC GTGGCCGCAC CGAGAAGCTG ACCCGGCTGC GCTCGCTGGT CGCCCGTCGG ATGGTCGAGT CGCTGCAGGT CAGCGCGCAG CTCACCACCG TCGTGGAGGC CGACGTCACG CGGATCGCGA AGCTGCGCGA GCGGGCCAAG GCGAACTTCC AGGCCCGCGA GGGCGTGAAG CTGTCGTTCC TGCCGTTCTT CGCGGTGGCC GCGTGCGAGG CACTGCGCGA GCACCCGGGG ATCAACTCCA GCATCGACCT GGAGGCGGGC ACGGTCACCT ACCACGACTC GGAGAACCTC GGCATCGCCG TCGACACCGA CCGTGGCCTG GTCGTCCCGG TGATCCACAA CGCCAGCGAC CTGAACCTCA GCGGGATGGC CCGCAAGATC GACGAGCTGG CCGCGCGGAC CCGCGCCAAC CAGGTGTCCC CGGACGACCT GGGCGGTGGC ACCTTCACGC TGACCAACAC CGGCAGCCGC GGCGCCCTGT TCGACACCCC GATCATCAAC CAGCCGCAGG TGGCGATCCT GGGCACCGGG TCGGTCGTGA AGCGCCCGGC CGTCGTCACC GATCCCGAGC TCGGCGAGGT GATCGCCATC CGGTCGAAGG TCTACCTGGC CCTCACCTAC GACCACCGGA TCGTGGATGG CGCCGACGCG GCCCGGTTCC TCACCGCGAT CGCCAGCCGT CTCGAGGAAG GCGCCTTCGA GGCCGAGCTC GGGCTGTAG
|
Protein sequence | MSVSVTMPRL GESVSEGTVT RWLKKEGERV EADEPLLEVS TDKVDTEIPA PASGVLGSIK VAEDETVEVG VELAVIEDGA AGAGAAAAPA EAPAAAPEPP PAPAAAAPPP PPAPPAPSAP APAPAAAPPP PPAPATPVPA QAPVAAPDAA ADGLGRYVTP LVRKMAAELG VDLGSVTGSG PGGRITKQDI QDAAKSRGSA PAAAPSAPAA PAVPAAPAPA QAPAAPAAAR PAPAAAPTAS TAPRGRTEKL TRLRSLVARR MVESLQVSAQ LTTVVEADVT RIAKLRERAK ANFQAREGVK LSFLPFFAVA ACEALREHPG INSSIDLEAG TVTYHDSENL GIAVDTDRGL VVPVIHNASD LNLSGMARKI DELAARTRAN QVSPDDLGGG TFTLTNTGSR GALFDTPIIN QPQVAILGTG SVVKRPAVVT DPELGEVIAI RSKVYLALTY DHRIVDGADA ARFLTAIASR LEEGAFEAEL GL
|
| |