Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1658 |
Symbol | |
ID | 8415957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1960838 |
End bp | 1962055 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645024627 |
Product | thiamine biosynthesis/tRNA modification protein ThiI |
Protein accession | YP_003182015 |
Protein GI | 257791409 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.151293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00140445 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGAAT TCCAACGCAT CTGCTTGGTC CATTACCACG AGATCGGGCT CAAGGGGCAC AATCGATCGA CGTTCGAGAT GAGGTTGCTC AAGAACCTCG AAGCGCTGCT GAAGCCTTTC CCCGTGGTCG TTATCCATCG CATCGCGGGC CGTTTGTGCG TGTTCTTGCG TGAGGGCACC GATTGGACCA CCGCGAAGGA GGCGGCCGAC GTCATCGGCA AGGTGCCGGG CGTAGCGCGC GTGTCGTGCG GTTTCAAGTG CGAGCGCGAC CTCGACGAGA TGACGGAGGC GGCGCTCGCG GCGATGGCCG AGGCGGGCGA GTTCGACACG TTCAAGGTGG CGGCGCGGCG CAACCACACC GATTTCGCCA CGGGTTCGAT GGACATGAAC CAGATCATCG GCTCGGCGCT GTGCGCCGCG CACCCCGAGA AGTCCGTTAA GATGAAGAAG CCCGACGTTA CGGTGGGCGT CGAGGTGGTG CAGAACGCGG CGTACGTGTA CGCGCGCTCG CTGCCGGGCG TGGGAGGGCT GCCGGTGGGC AGTTCGGGCC TGGTCGTGAG CCTGCTGTCG TCGGGCATCG ACTCGCCGGT GGCGACGTGG AAGCTCGCGC GGCGCGGCGC GGTGTGCATA GGCGTGCACT TCTCCGGACG ACCTCAAACA TCCGATGCCA GCGAGTACCT CGTGGACGAC ATCGCGCAGG TGCTGGAGCG CACGGGCTGC ATCGCTCGCG TGTACGCGGT GCCGTTCGGC GACTACCAGC GCGAGATCGC GCTGACCGTG CCGCCCGAGC TGCGCGTCAT CATGTACCGT CGTCTCATGT TCAAGGTGGC TGAGGAGATC GCACGCCGCG AGCGCGCGGG AGCGCTGGTG ACGGGAGAGA GCCTGGGTCA GGTTGCCTCG CAGACGCTCG ACAACATCCG CTGCACTGAC GCGGCGGTTG ACCTGCCCGT CTTTCGTCCG CTCATCGGCA CCGACAAGCT GGAGATCATC GCCGAAGCCG AGCGTTTGGG CTCGTTCGAG ATCTCGTCGC AGGATGCGCC CGACTGCTGC ACGCTGTTCA TGCCGCGCAG TCCGGAGACG CATGCGAAGC TGCCCGTGGT GTTGGAAGCC GAGGCCGCGC TGCCCATCGA GCGCTGGGTA CCCGAGATCG CCGACGCCGC AGAGGTTCGC GACTACGCCT GCCCCGCCTA CAAGCCGAAG AAGAAACGCG CTTCCTGA
|
Protein sequence | MTEFQRICLV HYHEIGLKGH NRSTFEMRLL KNLEALLKPF PVVVIHRIAG RLCVFLREGT DWTTAKEAAD VIGKVPGVAR VSCGFKCERD LDEMTEAALA AMAEAGEFDT FKVAARRNHT DFATGSMDMN QIIGSALCAA HPEKSVKMKK PDVTVGVEVV QNAAYVYARS LPGVGGLPVG SSGLVVSLLS SGIDSPVATW KLARRGAVCI GVHFSGRPQT SDASEYLVDD IAQVLERTGC IARVYAVPFG DYQREIALTV PPELRVIMYR RLMFKVAEEI ARRERAGALV TGESLGQVAS QTLDNIRCTD AAVDLPVFRP LIGTDKLEII AEAERLGSFE ISSQDAPDCC TLFMPRSPET HAKLPVVLEA EAALPIERWV PEIADAAEVR DYACPAYKPK KKRAS
|
| |