Gene GSU1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1659 
SymbolhisS 
ID2685305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1821195 
End bp1822436 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content58% 
IMG OID637126340 
Producthistidyl-tRNA synthetase 
Protein accessionNP_952710 
Protein GI39996759 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0302823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACAG GTATCAAGGG ATTCAACGAC ATTCTTCCCG GGGAAGTGGA GCGGTGGCAG 
CACATCGAAG CCACAGCACG CCGGGTTTTC AGTCTCTACG GATTTTCGGA GATCCGCATC
CCGATTCTTG AAAAAACCGA ATTGTTCCGG CGGTCCATCG GCGATACCAC TGATATCGTG
GAAAAAGAAA TGTACTCCTT TGTGGATAAG GGAGAGAACG CTGTAACCAT GCGTCCTGAA
GGGACAGCAA GCGTAATGAG GTCGTACATA GAGCACAAGC TCTATGCCCA GGATCCGGTG
GCAAAGCTCT ACTACATGGG CCCCATGTTT CGATACGAGC GGCCCCAGAA AGGGCGTTAC
CGCCAGTTCC ACCAGATCGG TGCCGAGGTG ACCGGAGTTA CCGACCCCAA GGTCGATGCC
CAGGTTCTCA CCATGCTCTG CCACTATTTC GCCGAGCTTG GACTGACTGA ACCGACACTT
CAGATCAACT CCCTCGGATG CCCCGAATGC CGTCCCGCAT ATCGTCAAGC CCTGATCGAT
TTCCTCCGGG AACGGCTCGA CAGCCTCTGC GAAGACTGCA AGCGCCGCTA TCAGACCAAT
CCCCTGCGTG CGCTTGACTG CAAATCGGCC CACTGCAAGG AGGCGACCGC CTCCGCTCCG
GCCATGCTTG ATAGCCTCTG TGCAGGCTGC GACGATCATT TCACCGCTAC CCGACGCCAT
TTGGAGCGGG CAGGCACCAC TTACAGCATC AATAACCGGA TGGTCCGCGG GCTTGATTAC
TATACCCGAA CCACCTTCGA ACTGGTGACC GGCCTGCTTG GTGCTCAGAG CGCGGTTGCT
GCTGGAGGCC GGTACGACGG TCTTATCTCT GACCTTGGCG GGCCCGCCAT TCCCGGTATT
GGATTTGCCA TGGGGGTGGA GCGGATAGCC CTCCTGCTGG GGGACCAACA CTTCGTCGGC
CGGCCGGATC TCTTCATAGC CGCCCTTGGC GAGGAGGCCC AGGACGAGGC GTTTCGCCTC
ATGTGCGGCC TGCAGCGAAA CGGGGTGGCC GTGGAAATGG ACTATGAAGG CAAGAGCCTC
AAGAGCCAGA TGCGCCGGTC GGACAAGTTC AACGCGCGTT TTACCCTGAT CATCGGTGGT
GATGAGTTGG CAATAGGGGC GGCTGTACTC AAGGCAATGG ACACCGGTGT CCAGGTGGAA
GTCCCGTTGA CGCCGGAAGA GGTCGCCGCA CGAATAACGT GA
 
Protein sequence
MITGIKGFND ILPGEVERWQ HIEATARRVF SLYGFSEIRI PILEKTELFR RSIGDTTDIV 
EKEMYSFVDK GENAVTMRPE GTASVMRSYI EHKLYAQDPV AKLYYMGPMF RYERPQKGRY
RQFHQIGAEV TGVTDPKVDA QVLTMLCHYF AELGLTEPTL QINSLGCPEC RPAYRQALID
FLRERLDSLC EDCKRRYQTN PLRALDCKSA HCKEATASAP AMLDSLCAGC DDHFTATRRH
LERAGTTYSI NNRMVRGLDY YTRTTFELVT GLLGAQSAVA AGGRYDGLIS DLGGPAIPGI
GFAMGVERIA LLLGDQHFVG RPDLFIAALG EEAQDEAFRL MCGLQRNGVA VEMDYEGKSL
KSQMRRSDKF NARFTLIIGG DELAIGAAVL KAMDTGVQVE VPLTPEEVAA RIT