Gene WD1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD1076 
SymbolhisS 
ID2737672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp1033248 
End bp1034474 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content35% 
IMG OID637173229 
Producthistidyl-tRNA synthetase 
Protein accessionNP_966797 
Protein GI42520882 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAATC AAACGGTTAG AGGAACAAAA GATCTCCTGT TTGATGAATG GTACAAGTTT 
AAATACATAG AGCAAACAGC AAACAGAATC TCAAGTTTAT ACGGCTTTTT GCCCGCCCAA
ACTCCAATAT TTGAGTGCAC GGAGGTTTTT ACAAAAACCT TAGGTGATAG CTCAGACATC
ATCACTAAAG AGATGTATAG CTTCAATGAT AAAGGGGGAA AGAGCATAAC TTTACGTCCT
GAGTTCACTG CAGCGATTGT CAGGCTACTC ATTGAAAAAA AACTGCAAAC ACCGATAAAA
TTATTCTCAA CAGGACCTGC ATTTCGCTAT GAAAGACCGC AAAAGGGAAG ACAAAGGCAA
TTTCATCAGA TAAATTTTGA GGTTTTTGGC ATAGAAGATC CAAAAGCTGA TATTGAATTG
ATATCACTCG CTCAGCACTT ACTAACTGAA TTTGGCATCA ATAAAAATGT AAAGCTGGAA
ATCAACTCTT TAGGTGATGG TGAAACAATA ACTAAATATA GAGAAGCTTT GATCTTATAC
TTTACAAAGT ATCAAAATGA TCTATCAGAA GATAGCAAAA ATAGGTTGAT CAAAAATCCA
CTCAGAATAC TGGATTCCAA GGATGAGAAA GATAAATCAA TAATTTCTGA TGCGCCTAAA
ATCAGCAATT ATTATACGAA AGAGTCCTCA GATTTCTTTG AACAAATACT AAATGGATTA
ACAATTCTTG ATATACCTTA TACCGTAAAC AATAAACTAG TTCGAGGTTT GGATTATTAT
TGCCACACAG TGTTTGAGTT TGTTACAGAG GATTTAGGTG CACAAGGGGC AGTTTTTGCC
GGTGGAAGAT ACGATAATTT AGTATCTTCA GTAGGTGGAA AACACACTCC AGCAATAGGA
TTTGCAGGGG GTATTGAGCG CATAATGGAG TTAATCAATT ATTCTCCGAA AGAAGAGCGA
CCTATTTACC TAATTCCAAT CGGTAGAGAG GCTGAAAAAC ATGCTTTAAC ACTTGCAAAT
GAATTGCGCA GAAATGGTTT ATATGTAATC TATGAATATA GCGGAACGCT CAGAACCCGA
ATGAAAAAAG CAAATCAAGC AAATGCTAAA GCTGCTTTAA TCTTTGGCGA TGAAGAATTG
AGTAGTAAAA CTTTAAAGAT TAAAAACATG GATACAGGTG AAGAAAAAAT AATTGCTCGC
GATAACACAA TAGAAAATAT CTATTAG
 
Protein sequence
MTNQTVRGTK DLLFDEWYKF KYIEQTANRI SSLYGFLPAQ TPIFECTEVF TKTLGDSSDI 
ITKEMYSFND KGGKSITLRP EFTAAIVRLL IEKKLQTPIK LFSTGPAFRY ERPQKGRQRQ
FHQINFEVFG IEDPKADIEL ISLAQHLLTE FGINKNVKLE INSLGDGETI TKYREALILY
FTKYQNDLSE DSKNRLIKNP LRILDSKDEK DKSIISDAPK ISNYYTKESS DFFEQILNGL
TILDIPYTVN NKLVRGLDYY CHTVFEFVTE DLGAQGAVFA GGRYDNLVSS VGGKHTPAIG
FAGGIERIME LINYSPKEER PIYLIPIGRE AEKHALTLAN ELRRNGLYVI YEYSGTLRTR
MKKANQANAK AALIFGDEEL SSKTLKIKNM DTGEEKIIAR DNTIENIY