Gene HMPREF0424_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0784 
SymbolhisS 
ID8709561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp887503 
End bp888951 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content43% 
IMG OID646482885 
Producthistidine--tRNA ligase 
Protein accessionYP_003374002 
Protein GI283783248 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTAA GCCATGATTT TTCCGCAACA CCACCTATTA TAGTGGACAT GGTACAAGGC 
TCATCAATAT CGGGTTTCCC TGAATGGCTT CCTAGTGAAC GCGCTGTTGA ACAGCAAGTA
ATTGACACAC TAAGAGAAGT GTTTGAACTC AACGGTTTTA TTGGAATTGA AACGCGTGCA
GTAGAACAGG GTTCAAGCTT ATTAAAAAAA GGCGAAACCA GTAAAGAAAT TTATTTATTA
TCGCGTTTAC AAGAAGTTGG TCACGAATCC GACACTCCTG TAGAAGAACG CATTGGCTTA
CATTTCGATT TAACAGTTCC ACTCAGCCGT TACGTCGTTG AGCATACTGG CGATTTGACG
TTCCCATTTA AGCGCTGGCA AATGCAAAAA GTTTGGCGAG GGGAGCGTCC ACAAGAAGGT
CGTTTCCGTG AGTTTGTTCA AGCAGATATC GATGTCGTAG GAAATGGCGA ATTACCTTCA
CACTACGAAG TTGAACTTCC ACTTGTTATG GTTGAAGCTC TTGAGCGTTT ACGTAAGTTT
GGTCTTCCTA AGGCAACTGT TCATGCTAAT AATCGAAAGC TTTCTGAAGG TTTTTATCGC
GGTATTGGAT TAAGCGATAT CGAGGGAGTT CTTCGAGAAA TCGATAAGCT AGACAAAATT
GGCGCTGAAG AAGTATCCAA GCTTTTGGTA AAAGAATGCG GCGCTACGGC TTCTCAAGCT
GATGCATGCT TAGAATTAGC TGAACTTACT GCCGAAACAG GAGATGAGTT GAAGTCTCGT
TTCAACGAAT TATGCGATAA GCATAATATT TCTCGAAGTG ATGATAACGA GTCTTACGTT
TTAGCTTCTC AAGGTGTTGA AACTCTTGCG ATGATTGTAG ACGAAGCAGC ACGTATTCGC
CCAGGCGCTG TTGTTGCTGA TTTGAAGATT GCTCGTGGAC TTGACTACTA TACTGGTTCT
GTTTACGAAA CTTTCCTTGA TAATGCAGCA TCTCTTGGTT CAATTTGCTC TGGTGGACGT
TACGATAATT TAGTTTCACA GGGCAAGAAG AAGTATCCAG GCGTAGGACT TTCAATTGGC
ATGTCTCGAC TGCTTTCTTA TATGCTTCAC ACTGCAGGAG CTACTGTTTC CCGAGTTTCT
CCTGCTGCAG TACTTGTTGC AGTGTGGAAT GAAGAAGATC GTCCTGCATG CAATGCTATT
GCTAAAACTT TACGAGATCG AGGCATTGCA GCAGATGTTG CTCCTAGCGC AGCAAAGCTT
GGTAAGCAAA TAAAATATGC TGACAAGCTC GGCATTCCTT ATGTGTGGTT CCCTGCTGAT
AGCACTGAAT CAGAGTCATC TCACGATGAA GTAAAAAATA TTATTACTGG CGAACAGGAA
ACCGCGGATG CCACGTCTTG GCAACCGGAT ACTGTTTATG CCCGGCAGAC AGTTTCTTGC
GCTAAATAA
 
Protein sequence
MTVSHDFSAT PPIIVDMVQG SSISGFPEWL PSERAVEQQV IDTLREVFEL NGFIGIETRA 
VEQGSSLLKK GETSKEIYLL SRLQEVGHES DTPVEERIGL HFDLTVPLSR YVVEHTGDLT
FPFKRWQMQK VWRGERPQEG RFREFVQADI DVVGNGELPS HYEVELPLVM VEALERLRKF
GLPKATVHAN NRKLSEGFYR GIGLSDIEGV LREIDKLDKI GAEEVSKLLV KECGATASQA
DACLELAELT AETGDELKSR FNELCDKHNI SRSDDNESYV LASQGVETLA MIVDEAARIR
PGAVVADLKI ARGLDYYTGS VYETFLDNAA SLGSICSGGR YDNLVSQGKK KYPGVGLSIG
MSRLLSYMLH TAGATVSRVS PAAVLVAVWN EEDRPACNAI AKTLRDRGIA ADVAPSAAKL
GKQIKYADKL GIPYVWFPAD STESESSHDE VKNIITGEQE TADATSWQPD TVYARQTVSC
AK