Gene TDE1442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE1442 
SymbolhisS 
ID2741405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp1487556 
End bp1488902 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content40% 
IMG OID637160321 
Producthistidyl-tRNA synthetase 
Protein accessionNP_972048 
Protein GI42526950 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATT TAATACAACC GAAAGTTTTA AAAGGATTTA GAGATTTCCT TCCGGCAGAT 
GAGATTGAAA GAGCTCTTCT TATGGAAAGA CTGGTAAAGG TTTTTAGGGA TTACGGCTTT
GTGCCTATAG ACACCCCTGC CTTAGAATAT TCCGAGATTC TATTGAGAAA GAGCGGAGGA
GAGACCGAAA AGCAGGTTTT CCGGTTTAGC GACAATGGCG GACGGGATGT TGCTATGCGT
TTTGACTTAA CCGTTCCCTT AGCTCGATTT GTTGCAGAAC ATAAGTCCGA GATATATTTC
CCGTTTAAGC GTTACCATTT GGGAAAGGTT TGGCGCGGTG AAAAGCCTCA GGCCGGACGG
TATCGGGAAT TTTTGCAATG TGATTTTGAT ACATTGGGTT CCGATTCGGC TGCCGTTGAT
TTTGAAATTT TGCGTCTTAT TAAAAAAGCT TTAAACGAAT TAGGTGTTTC CAATTTTAAA
ATACACGTTT CCCATAGGGG TATATTTAAC CGTTTTTTAA AGTCTTTAAA CCTATCGGAA
GACAGCGAAG AGGTTTTGCG TATTGTAGAT AAACTTGCTA AGATAGGGGA GGACGAGGTT
TTAAAGCTCC TTACCGATAT AAGCTCCGAA GAAAGTGCCA AAAAAATATT GGCTTATATT
TCCGGTGTAA GCAAGGAGTT AAAAAGCGAA GACTTTGAAA AGACTCTTTC CCACTTGGAA
AACCTCGCAG GCGGCCCCGA CGAAGATACA AAACGCATGA GAGATATCTA TGCCTTGGTA
AAGGCTGTAG GTATTGAAGA TTCAATTGTT TTTGATCCTT CAATTACACG AGGTCTGGAT
TATTACACCG GTGTTGTATT TGAGACCTTT TTAAACGATT TACCATCGAT AGGCTCTGTT
TGCTCAGGCG GAAGGTACGA TAACCTTACG GCTCTTTATA TGAAGGAGTG TATTACCGGA
GTGGGTGCTT CCATAGGGCT TGACAGGCTT TTAGCTGCCC TTGAACTCTT GGGCCATCAA
AAAACAAAGG CCAGCTTTAC CGACCTCCTT ATTTTTTCTT TACCTGAAGA TGATCTGGTT
CTTTCGTATA AGATAGTAAA TTTTTTTGAA GCCGAAAAAA TAAATGCTGA AGTATATCCT
GAACCTAAAA AGATGAATCA TCAGTACACC TATGCCGAAA AAAAAGACAT AAGGTGGGGG
CTTTTCTTAG ATAAAGATTC TTGTGTGGAA GAATTCGATA AGGCCCCTCA AAGGTTTAAG
ATAAAATTAA AAGATATGAC TAATAGAACA GAGGATGAAA CGCCTCTTAG CGAGGCCGTA
AAAAAGATAA GAGCTTCAAA AAATTAA
 
Protein sequence
MSDLIQPKVL KGFRDFLPAD EIERALLMER LVKVFRDYGF VPIDTPALEY SEILLRKSGG 
ETEKQVFRFS DNGGRDVAMR FDLTVPLARF VAEHKSEIYF PFKRYHLGKV WRGEKPQAGR
YREFLQCDFD TLGSDSAAVD FEILRLIKKA LNELGVSNFK IHVSHRGIFN RFLKSLNLSE
DSEEVLRIVD KLAKIGEDEV LKLLTDISSE ESAKKILAYI SGVSKELKSE DFEKTLSHLE
NLAGGPDEDT KRMRDIYALV KAVGIEDSIV FDPSITRGLD YYTGVVFETF LNDLPSIGSV
CSGGRYDNLT ALYMKECITG VGASIGLDRL LAALELLGHQ KTKASFTDLL IFSLPEDDLV
LSYKIVNFFE AEKINAEVYP EPKKMNHQYT YAEKKDIRWG LFLDKDSCVE EFDKAPQRFK
IKLKDMTNRT EDETPLSEAV KKIRASKN