Gene TDE0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE0047 
SymbolhutI 
ID2741662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp56010 
End bp57251 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content45% 
IMG OID637158917 
Productimidazolonepropionase 
Protein accessionNP_970664 
Protein GI42525566 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTAT TTATAAGCGA CAGTATTTTT TCTTCTACCG AAAAAAAAGG GGTAGACTTT 
GATAAGGCTT TTGCCGGCTA CATCGTTGTA GAAAACGGCC TCATTCAAAA GGTGGGCAAG
GGAGATGCTC CCGAAAGTTT AAAAAGCCAA GCCGAAAAAA TAATAGATGC ACGGGGAAAG
ACTATTACGG CAGGGCTTGT CGATGCACAC ACACACTTGG TGCACGGCGG TTCACGCGAA
CATGAGCTTG CAATGAAACT TGCAGGAAAA ACTTATCTTG AAATCCATGC AAGCGGCGGC
GGTATCTTCA GCACTGTAAG AGCTACCAGA GCAGCTTCAA AAGAAGAGTT GACGCAAAAA
GCTTTGACTA GCCTTGACCG AATGCTTATT CACGGTACAA CCACCGCCGA ATCGAAAAGC
GGTTACGGCC TCGACATGGA AACCGAAATT AAGTGTCTCG AAATAAATTC TTATCTGAAT
AAAAACCACC CAATCGACAT TGTTTCAACC TATATGGGTG CTCATGCAAC CCCGCCCGAA
TTTAAGGACA ACAAGGAAGG CTATATCAAG TTTATGATAG AAGAGGTTAT GCCCGAGGTT
AAAAAACGCG GCTTGGCGGA ATTCTCCGAT GCCTTTTGTG AAGACAAGAT TTTTTCCGTA
GAAGAAACCG AAAGAATAAT GAAGGCCGCT GCCGACTTAG GTTTTAAGCT GAAACTTCAT
GCCGATGAGA TTATTCCTTT AAAGGGAGCG GAGCTTGCAG CAAAGATGAA TGCTCACTCA
GCCGAGCACT TGATGGCTAT ATCCGACGAG GGCATTACGG CTCTTGCAAA ATCGGGAACG
GTTGCCGTTC TTCTCCCTGC GACTTCTTTC TTTTTGATGT CGCCTATTTA TGCACCTGCA
AAAAAGATGA TTGAAGAAGG CGTAAGGGTC GCCCTTGCAA CCGATTACAA CCCCGGAAGC
AGCCCGACAG AAAACCTGCA AATGGCCATG TGGGCTGCCT GTTATAAGAT GAAACTTTTA
CCTGCACAGA TTTTACGCGG CGTTACAATA AATGCAGCTT ATGCAATAGA TCGTGAAAAA
ACTATAGGCA GCATCGAAGA AGGAAAGCAG GCTGACCTTG TTATCTTTGA TGCCCCGAAT
ATAGATTTCC TTGTTTATCA TTTCGGTGTA AATTCCGTCG ATCAGGTTTG GAAAAAGGGA
AAGCTTGTTG CCGAAAAGGG CCGGCTTGTT TATAAGAACT GA
 
Protein sequence
MTLFISDSIF SSTEKKGVDF DKAFAGYIVV ENGLIQKVGK GDAPESLKSQ AEKIIDARGK 
TITAGLVDAH THLVHGGSRE HELAMKLAGK TYLEIHASGG GIFSTVRATR AASKEELTQK
ALTSLDRMLI HGTTTAESKS GYGLDMETEI KCLEINSYLN KNHPIDIVST YMGAHATPPE
FKDNKEGYIK FMIEEVMPEV KKRGLAEFSD AFCEDKIFSV EETERIMKAA ADLGFKLKLH
ADEIIPLKGA ELAAKMNAHS AEHLMAISDE GITALAKSGT VAVLLPATSF FLMSPIYAPA
KKMIEEGVRV ALATDYNPGS SPTENLQMAM WAACYKMKLL PAQILRGVTI NAAYAIDREK
TIGSIEEGKQ ADLVIFDAPN IDFLVYHFGV NSVDQVWKKG KLVAEKGRLV YKN