Gene Spro_3608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3608 
SymbolhisS 
ID5605858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3986929 
End bp3988203 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID640939159 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001479832 
Protein GI157371843 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.602511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAAAGA ACATTCAAGC CATTCGCGGC ATGAACGACT ACCTGCCGGA AGAAACGGCA 
TTATGGCAGC GTATTGAAGG CATCCTCAAG CAGGTGCTTA GCGGCTACGG TTACAGCGAA
ATCCGCTTGC CGATTGTAGA GCAGACCCCG TTATTCAAAC GCGCGATCGG CGAAGTGACC
GACGTCGTGG AAAAAGAGAT GTATACCTTC GACGACCGCA ATGGCGAAAG CCTGACGCTG
CGTCCGGAAG GCACGGCTGG TTGCGTTCGC GCCGGCATCG AACATGGTCT GCTGTACAAT
CAGGAGCAGC GTCTGTGGTA CGTCGGTCCG ATGTTCCGCT ACGAGCGCCC GCAAAAAGGC
CGCTATCGCC AGTTCCATCA ACTGGGGGCC GAAGTGTTTG GCCTGCAGGG CCCGGATATT
GACGCAGAGC TGATCCTGCT GAGCGCCCGC TGGTGGAAAG CGCTGGGGAT TTCCGAGCAC
GTCAAGCTGG AGCTGAACTC CATCGGTTCA CTGGAAGCGC GCGCCAACTA CCGCGATGCT
CTGGTGGCCT TCCTGGAACA GCATCAAGAG AAGTTGGACG AAGATTGCAA ACGCCGTATG
TACAGCAACC CGCTGCGCGT ACTGGACTCG AAAAACCCGG AAGTGCAGGC GCTGTTGAAC
GATGCGCCAC GCCTGTCCGA GTATCTGGAT GCAGATTCCA AAGCCCACTT CGAAGGTCTG
TGTGAACTTT TGGCGCAGGC CGGCATCCCA TATACCATCA ATGAACGCCT GGTGCGCGGT
CTGGACTACT ACAACCGCAC GGTATTCGAA TGGGTAACCA CCAGTCTGGG CGCTCAGGGC
ACGGTATGTG CCGGTGGTCG TTATGACGGC CTGGTAGAGC AACTGGGCGG GCGTGCAACG
CCGGCGGTCG GTTTCGCCAT GGGCCTGGAG CGCCTGGTGT TGCTGGTTCA GGCGGTTAAC
CCAGAGTTCA AGGCCGCGGC GACCATCGAC GTGTATGTGA TCTCCTCCGG TGCCGGTACT
CAGAGTGCAG CAATGCAGCT GGCGGAGCGC GTGCGTGATG CGGCGCCACA GCTCAAACTG
ATGACCAACT ACGGCGGTGG TAACTTTAAG AAGCAGATCA CCCGTGCGGA TAAGTGGGGC
GCGCGCGTCG CCTTGATCCT GGGTGAGAAC GAAGTCGCAG CCCAGCAGGT GGTGGTTAAG
GACCTGCGCA GTGGTGAACA AGAAACGCTG GCGCAAAGCG AAGTCGCAGT GCGCCTGGCT
CTGATGTTAG GTTAA
 
Protein sequence
MAKNIQAIRG MNDYLPEETA LWQRIEGILK QVLSGYGYSE IRLPIVEQTP LFKRAIGEVT 
DVVEKEMYTF DDRNGESLTL RPEGTAGCVR AGIEHGLLYN QEQRLWYVGP MFRYERPQKG
RYRQFHQLGA EVFGLQGPDI DAELILLSAR WWKALGISEH VKLELNSIGS LEARANYRDA
LVAFLEQHQE KLDEDCKRRM YSNPLRVLDS KNPEVQALLN DAPRLSEYLD ADSKAHFEGL
CELLAQAGIP YTINERLVRG LDYYNRTVFE WVTTSLGAQG TVCAGGRYDG LVEQLGGRAT
PAVGFAMGLE RLVLLVQAVN PEFKAAATID VYVISSGAGT QSAAMQLAER VRDAAPQLKL
MTNYGGGNFK KQITRADKWG ARVALILGEN EVAAQQVVVK DLRSGEQETL AQSEVAVRLA
LMLG