Gene VC0395_A0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0289 
SymbolhisS 
ID5135784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp302957 
End bp304225 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content50% 
IMG OID640531747 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001216245 
Protein GI147674318 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAAAAA CTATTCAAGC AATCCGAGGC ATGAACGATT GTCTCCCAAC CCAGTCTCCA 
CTTTGGCAAA AAGTGGAAGG CGTGGTGAAA AATGTAATCA GCGCTTACGG TTACAGCGAA
GTTCGTATGC CAATCGTTGA GATGACTCAT CTATTTAGCC GCGCCATCGG TGAAGTGACC
GATGTGGTGG AAAAAGAGAT GTACACCTTT GAAGATCGCA ATGGTGATAG CTTAACGCTG
CGACCTGAAG GTACGGCGGG CTGTGTGCGC TCTGGTATCG AAAATGGTTT GCTGTACAAC
CAAGAGCAAC GTTTGTGGTA CATGGGACCG ATGTTCCGTC ACGAACGTCC GCAAAAAGGT
CGTTACCGTC AATTCCATCA GTGTGGTGTT GAAGTGTTTG GTTTAGATGG CCCCGATGTG
GACGCTGAAC TGATCATGAT GACGGCACGT CTGTGGCGCG AATTGGGTAT TGCACAACAT
GTGCGTTTAG AGCTCAACTC GATTGGCTCT CTAGAGGCTC GCGCTAATTA TCGCACCGCC
TTGATTGACT ATCTTGAGCA GTACCAAAAC GTACTGGATG AAGATTGCAA GCGCCGCATG
TACACCAACC CGCTGCGTGT GCTTGATTCG AAGAATCCTG ATGTACAAGC GATTTTAGGT
GATGCCCCTC AGCTCTCTGA TTATCTCGAT GCTGAATCAA AACAACATTT TGCGGGCTTG
TGTGAACTTC TGGATGCGGC GGGTATCGAA TACACGGTAA ATCAACGTTT AGTTCGCGGC
CTTGATTATT ACAACCGCAC GGTTTTTGAG TGGATCACCG AAAGTCTGGG ATCGCAAGGT
ACCGTTTGTG GCGGCGGCCG CTATGATGGC TTGGTTGAAC AACTGGGCGG TAAACCAACC
CCTGCGGTAG GTTTCGCTAT GGGCCTAGAG CGTTTAGTGC TGATGATGGA AACACTCGGT
AATACGGATG TCCGTCGCAG CGTAGATGTG TATATGGTTA CTGCAGGTGA AGGCACCATG
ATGGCGGGAA TGAAGCTTGC GGAACAGTTA CGTGAGCAAG TGCCCGGCCT ACGTGTGATG
ACTCACTTCG GTGGCGGCAA TTTTAAAAAG CAATTTAAAC GCGCGGATAA AGTGGGCGCA
GCGATTGCCT TGGTTTTGGG TGAAGATGAA GTTGCAGCCC AAACCGTTGT GGTAAAAGAT
TTGGCGGGAG GCGAGCAAAA TACTGTTGCC CAAGCTGAAG TAGCTAAACT ACTGGCACAT
TTAGCCTAA
 
Protein sequence
MAKTIQAIRG MNDCLPTQSP LWQKVEGVVK NVISAYGYSE VRMPIVEMTH LFSRAIGEVT 
DVVEKEMYTF EDRNGDSLTL RPEGTAGCVR SGIENGLLYN QEQRLWYMGP MFRHERPQKG
RYRQFHQCGV EVFGLDGPDV DAELIMMTAR LWRELGIAQH VRLELNSIGS LEARANYRTA
LIDYLEQYQN VLDEDCKRRM YTNPLRVLDS KNPDVQAILG DAPQLSDYLD AESKQHFAGL
CELLDAAGIE YTVNQRLVRG LDYYNRTVFE WITESLGSQG TVCGGGRYDG LVEQLGGKPT
PAVGFAMGLE RLVLMMETLG NTDVRRSVDV YMVTAGEGTM MAGMKLAEQL REQVPGLRVM
THFGGGNFKK QFKRADKVGA AIALVLGEDE VAAQTVVVKD LAGGEQNTVA QAEVAKLLAH
LA