Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2654 |
Symbol | |
ID | 8416980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3073588 |
End bp | 3074568 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645025633 |
Product | ApbE family lipoprotein |
Protein accession | YP_003182994 |
Protein GI | 257792388 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.131589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATATC GCGACGCATA CGACCCCATC CCGCTGGAAG ACGTGCACGA AACGCACGGG CCCAACGACG CGGGCATGAT GACGCACCAG TTCTACGCGT TCAACACGAT CATCACCCTG CAAGCCTACG CCGATGCCGC GCAGTGCGCC CCCGCGTTCG ACGCGGCCCG CGGCGCGAGT CGCGCGTTCG AGCGGCGGCT TTCGCGCACG CTGCCGCACT CCGACATCTC GCGGCTGAAC GCGGCTGCAG GCAAGCGCGT GGCCGTCCAC GACGACACGG CCGAGCTGCT GCGCGCGGCC ATCGGGTACT GCGCCGACAG CGAGGGCCTG TTCGACGTCA CCGTGGGCTC GGCGGTGCGG CTGTGGAACT TCCACGAGGG CACGGTGCCC GAGCGCGCCG ACGTGGAGCG CGCGCTGACG CACGTGGATT GGCGCGCGCT GCGCGTAAGC GAGGCCGGAG AGCCCGGCGG GTCCTGGGCG CAGCTGGCCG ACCCGCAGGC GGCCGTGGAT GTAGGCGGCA TCGCGAAGGG ATGGATCGCC GACCGGCTTT CCGCGGTGCT AGCCGAGCAC GGGCTGGACT CGTTCGTGGT GAACCTGGGC GGCAACGTGA TGGCGCACGG GCAGAAGCCA GACGGCAGCC CATGGCGCGT AGGCTTGCAG GATCCGCGCG ACAAGGGCTC CATCGTGGGC GCCGTGACCG TGCGCGACGC CTCGGCCGTG ACCAGCGGCG TGTACGAGCG CTGCTTCGAG CGAGATGGCG TGTTCTACCA CCACATCCTC GACCCGAAGA CGGGCTTCCC CGTCGAGACG GATGCCGCGG GAGCCACCGT GGTGGCGCGC CGTTCGATCG ATGCGGAGGG CTACTCGACC ACCCTGCTGG CATTGGGGAT CGAACGCGGC CTGGCGTTCG CCCGCGAGCG CGATGCGATC CTGGGCGCGT ATTTCGTGGA CCGGGACGGC AAGGTGGCAG GGATCGCCTA G
|
Protein sequence | MEYRDAYDPI PLEDVHETHG PNDAGMMTHQ FYAFNTIITL QAYADAAQCA PAFDAARGAS RAFERRLSRT LPHSDISRLN AAAGKRVAVH DDTAELLRAA IGYCADSEGL FDVTVGSAVR LWNFHEGTVP ERADVERALT HVDWRALRVS EAGEPGGSWA QLADPQAAVD VGGIAKGWIA DRLSAVLAEH GLDSFVVNLG GNVMAHGQKP DGSPWRVGLQ DPRDKGSIVG AVTVRDASAV TSGVYERCFE RDGVFYHHIL DPKTGFPVET DAAGATVVAR RSIDAEGYST TLLALGIERG LAFARERDAI LGAYFVDRDG KVAGIA
|
| |