Gene Elen_2064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2064 
Symbol 
ID8416380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2429696 
End bp2431021 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content68% 
IMG OID645025045 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003182416 
Protein GI257791810 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.27669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0343903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAATG CCAACGAAGG GAGTTCTATG ACGCAGATCG ACGCGGCACG CGCCGGCACC 
ATCACCCGCG AAATGGCCAT CGTGGCCGAG AAGGAGGGGC GCGACCCCGA GTTCATCCGC
GAGGGGGTTG CGGCCGGGCG CATCGCCATC CCCGCCAACA TCCACCATAC CAGCTTGTCG
CCCGAAGGCG TGGGCGGCGG CCTGCGCACG AAGGTGAACG TGAACCTGGG CATTTCGGGC
GACGTCGCCG ACGAGGCCGA GGAATGGAAG AAGGTGGACG TGGCGCTGGA GCTGGGCGCC
GAGGCCATCA TGGACTTGTC GAACACCGGC AAGACGCACG CGTTCCGCAG CGCGCTCATC
GAGAAGTCGC CGGCCATGAT CGGCACCGTG CCCATGTACG ACGCCATCGG CTATCTGGAG
AAGCCGCTGA TCACCATCAC GGTGGAGGAC TTCCTCGACG TGGTCCGCGC GCATGCAGAG
GACGGCGTGG ACTTCGTCAC CATCCACGCG GGCATGAATC GGCGCACCAT CGAGTCGTTT
CGCGAGACGG GCCGCCTCAC GAACATCGTG AGCCGCGGCG GGTCGCTCAT CTTCGCGTGG
ATGGAGGCCA CCGGCAACGA GAACCCCTTC TACGAGTTCT ACGACGAGGT GCTGGCCATC
CTGCACGAGC ACGACGTGAC CATCAGCCTG GGCGATGCCA TGCGGCCCGG CTCCTCGTAC
GACGCCACCG ACGCGGGGCA GATCGCCGAG CTCATCGAGA TCGGCAAGCT CACGAAGCGC
GCATGGGACG CGGGCGTGCA GGTGATGGTG GAAGGCCCCG GGCATATGGC GCTCGATGAG
ATCGCCGCCA ACATGAAGCT GGAAAAGCGG CTGTGCCACG ACGCGCCCTT CTACGTGCTG
GGGCCGCTGG TCACCGACAT CGCGCCGGGC TACGACCATA TCACGGCCGC CATCGGGGGC
GCGGTTGCCG CGGCTTCGGG CGCCGACTTC CTCTGCTACG TCACGCCGGC CGAGCATCTG
CGCCTGCCCG ACGCCGCCGA CGTGCGCGAG GGCCTCGTGG CCACGAAGAT CGCCGCGCAT
GCGGCCGACA TCGCGCGCGG GGTGCCCGGA GCGCGCGACC GCGACAACCG CATGAGCGAC
GCTCGGCGCC GCGTGGACTG GGAGGGCATG TTCGCCGAGG CGCTCGATCC GGTCAAGGCG
CGCCGCTACT TCGAGAGCGC TCCGCCCTCG ACCGACGGCA CCTGCACCAT GTGCGGCGAG
ATGTGCGCCA TGCGAACCGT GAACACCATC ATGGACGGCC TGACGGTCGA TCTCGGGAAG
GAGTAG
 
Protein sequence
MGNANEGSSM TQIDAARAGT ITREMAIVAE KEGRDPEFIR EGVAAGRIAI PANIHHTSLS 
PEGVGGGLRT KVNVNLGISG DVADEAEEWK KVDVALELGA EAIMDLSNTG KTHAFRSALI
EKSPAMIGTV PMYDAIGYLE KPLITITVED FLDVVRAHAE DGVDFVTIHA GMNRRTIESF
RETGRLTNIV SRGGSLIFAW MEATGNENPF YEFYDEVLAI LHEHDVTISL GDAMRPGSSY
DATDAGQIAE LIEIGKLTKR AWDAGVQVMV EGPGHMALDE IAANMKLEKR LCHDAPFYVL
GPLVTDIAPG YDHITAAIGG AVAAASGADF LCYVTPAEHL RLPDAADVRE GLVATKIAAH
AADIARGVPG ARDRDNRMSD ARRRVDWEGM FAEALDPVKA RRYFESAPPS TDGTCTMCGE
MCAMRTVNTI MDGLTVDLGK E