Gene DvMF_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_2037 
SymbolhisS 
ID7173956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2526655 
End bp2527935 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID643540554 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002436448 
Protein GI218887127 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA ACAATTCCGA CAAGACCGCA CAAAAGGTTA CGGGCGACAA GGTGGGCACC 
ATCAAGGGGT TCGCCGACAT GTTCAGCCCC GACAGTGACG TGTTCACCTT CATGGAGAAC
ACCGCGCGCG AGGTGTTTGG CCGCTACGGC TACGCCGAAC TGCGCACCCC GCTGCTGGAA
CGCACCGAAC TGTTCTGCCG CTCCATCGGC ACCGAGACCG ACGTGGTGCA GAAGGAAATG
TACACCTTCC CCGACCGCAA GGGCCGTTCG CTGACCCTGC GCCCGGAAGC CACGGCGGGC
GTCATGCGCG CGTTCATCGA TGCGGGCCGC CACGCGCAGG AGCCGGTCTC CAAGCTGTTC
ACCACCGGCC CCATGTTCCG CTACGAGCGC CCGCAGAAGG GCCGCATGCG CCAGTTCCAC
CAGATCAACT GCGAATGCCT TGGCCCGCAG GAACCGCAGG CCGACGCCGA ACTGGTGCTG
ATGCTCATGA CCTTCCTGCG CGAACTGGGG CTGACCGACC TTTCGTTGCA GGTGAACTCG
CTGGGCTGCC GCGAATGCCG CCCGGTGTAC CGGGCCGCGC TGCGCGACTT TCTGGATTCC
ATCGACCGCG AATCGCTGTG CGAGGACTGC CGCCGCCGCA TCGACACCAA CCCGTTGCGG
GTGCTGGACT GCAAGGTGCC CACCTGCCGC GAGCTGACCG CCGAGGCCCC GCGCATCATC
GACCACAACT GCCCGGAATG CCGCAGCCAC TTCGACACGG TGCTGCGCGT GTTCGACGCC
GCGCAGTTGC CCTACGTGCT CACCCCGCGC CTGGTGCGCG GGCTGGACTA CTACAACCGC
ACCACCTTCG AGGTGGTGTC CGGCTCCATC GGCGCGCAGT CGTCGGTGGC GGGCGGCGGG
CGGTATGACG GCCTGGTGGC GCAACTGGGC GGCCCCGACG TGCCCGGCGT GGGCTTTGCC
TGCGGCATGG AACGCCTGGC CCTGATGATG CCCGCGCTGG AGAAGAAGCG GCCCGATTTC
TACATCGCCG TGCTGGACCC GGCTGCGGCG GACGCGGCCA TGCTGCTGGC GCAGGAACTG
CGCGCGGCGG GCAAGGCGGG CGAGGTGTCC TTTGCCGCGC GCGGCATCAA GGGCCAGATG
CGCCAGGCCG GACGCACCGG CGCGCGCTGC ACCCTGCTGC TGGGCGGCGA CGAGATGGCC
AACGGCACCG TTGTCATCAA GGACATGGAC AGCGGCGAGC AGCGCAGCGT GCCGCAGGGC
GAGGCCGCAA ACCACGTATA G
 
Protein sequence
MSTNNSDKTA QKVTGDKVGT IKGFADMFSP DSDVFTFMEN TAREVFGRYG YAELRTPLLE 
RTELFCRSIG TETDVVQKEM YTFPDRKGRS LTLRPEATAG VMRAFIDAGR HAQEPVSKLF
TTGPMFRYER PQKGRMRQFH QINCECLGPQ EPQADAELVL MLMTFLRELG LTDLSLQVNS
LGCRECRPVY RAALRDFLDS IDRESLCEDC RRRIDTNPLR VLDCKVPTCR ELTAEAPRII
DHNCPECRSH FDTVLRVFDA AQLPYVLTPR LVRGLDYYNR TTFEVVSGSI GAQSSVAGGG
RYDGLVAQLG GPDVPGVGFA CGMERLALMM PALEKKRPDF YIAVLDPAAA DAAMLLAQEL
RAAGKAGEVS FAARGIKGQM RQAGRTGARC TLLLGGDEMA NGTVVIKDMD SGEQRSVPQG
EAANHV