Gene Elen_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1658 
Symbol 
ID8415957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1960838 
End bp1962055 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content66% 
IMG OID645024627 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_003182015 
Protein GI257791409 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.151293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00140445 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGAAT TCCAACGCAT CTGCTTGGTC CATTACCACG AGATCGGGCT CAAGGGGCAC 
AATCGATCGA CGTTCGAGAT GAGGTTGCTC AAGAACCTCG AAGCGCTGCT GAAGCCTTTC
CCCGTGGTCG TTATCCATCG CATCGCGGGC CGTTTGTGCG TGTTCTTGCG TGAGGGCACC
GATTGGACCA CCGCGAAGGA GGCGGCCGAC GTCATCGGCA AGGTGCCGGG CGTAGCGCGC
GTGTCGTGCG GTTTCAAGTG CGAGCGCGAC CTCGACGAGA TGACGGAGGC GGCGCTCGCG
GCGATGGCCG AGGCGGGCGA GTTCGACACG TTCAAGGTGG CGGCGCGGCG CAACCACACC
GATTTCGCCA CGGGTTCGAT GGACATGAAC CAGATCATCG GCTCGGCGCT GTGCGCCGCG
CACCCCGAGA AGTCCGTTAA GATGAAGAAG CCCGACGTTA CGGTGGGCGT CGAGGTGGTG
CAGAACGCGG CGTACGTGTA CGCGCGCTCG CTGCCGGGCG TGGGAGGGCT GCCGGTGGGC
AGTTCGGGCC TGGTCGTGAG CCTGCTGTCG TCGGGCATCG ACTCGCCGGT GGCGACGTGG
AAGCTCGCGC GGCGCGGCGC GGTGTGCATA GGCGTGCACT TCTCCGGACG ACCTCAAACA
TCCGATGCCA GCGAGTACCT CGTGGACGAC ATCGCGCAGG TGCTGGAGCG CACGGGCTGC
ATCGCTCGCG TGTACGCGGT GCCGTTCGGC GACTACCAGC GCGAGATCGC GCTGACCGTG
CCGCCCGAGC TGCGCGTCAT CATGTACCGT CGTCTCATGT TCAAGGTGGC TGAGGAGATC
GCACGCCGCG AGCGCGCGGG AGCGCTGGTG ACGGGAGAGA GCCTGGGTCA GGTTGCCTCG
CAGACGCTCG ACAACATCCG CTGCACTGAC GCGGCGGTTG ACCTGCCCGT CTTTCGTCCG
CTCATCGGCA CCGACAAGCT GGAGATCATC GCCGAAGCCG AGCGTTTGGG CTCGTTCGAG
ATCTCGTCGC AGGATGCGCC CGACTGCTGC ACGCTGTTCA TGCCGCGCAG TCCGGAGACG
CATGCGAAGC TGCCCGTGGT GTTGGAAGCC GAGGCCGCGC TGCCCATCGA GCGCTGGGTA
CCCGAGATCG CCGACGCCGC AGAGGTTCGC GACTACGCCT GCCCCGCCTA CAAGCCGAAG
AAGAAACGCG CTTCCTGA
 
Protein sequence
MTEFQRICLV HYHEIGLKGH NRSTFEMRLL KNLEALLKPF PVVVIHRIAG RLCVFLREGT 
DWTTAKEAAD VIGKVPGVAR VSCGFKCERD LDEMTEAALA AMAEAGEFDT FKVAARRNHT
DFATGSMDMN QIIGSALCAA HPEKSVKMKK PDVTVGVEVV QNAAYVYARS LPGVGGLPVG
SSGLVVSLLS SGIDSPVATW KLARRGAVCI GVHFSGRPQT SDASEYLVDD IAQVLERTGC
IARVYAVPFG DYQREIALTV PPELRVIMYR RLMFKVAEEI ARRERAGALV TGESLGQVAS
QTLDNIRCTD AAVDLPVFRP LIGTDKLEII AEAERLGSFE ISSQDAPDCC TLFMPRSPET
HAKLPVVLEA EAALPIERWV PEIADAAEVR DYACPAYKPK KKRAS