Gene Gura_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1887 
SymbolhisS 
ID5165163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2189589 
End bp2190836 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content55% 
IMG OID640549378 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001230650 
Protein GI148263944 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGATCA GCGGCATCAA GGGTTTTAAC GACATACTTC CCGGAGAAGT GGAAAAGTGG 
CAGCACATTG AGGCAACTGC CCGGCGGGTC TTTGAATTGT ACGGATTTGC GGAGATCAGG
GTGCCGATAC TGGAGAAAAC GGAATTGTTT TGCCGTTCTA TCGGCGACTC CACCGATATC
GTCGAAAAGG AGATGTATTC CTTCACCGAT AAGGGGGAAA ACAGCGTCAC CATGCGCCCG
GAAGGAACGG CCAGCGTTAT GCGTGCCTTC ATCGAGCATA AGCTTTATGT TGCCGATGCG
GTTGCCAAGC TTTATTACAT GGGCCCCATG TTCCGTTATG AGCGGCCGCA AAAAGGTCGT
TACCGTCAGT TCCACCAGAT AGGAGCCGAG GTTACCGGGG TGACTGATCC AAAGGTTGAT
GCCCAGGTTC TGACCATGCT CTGTCATTTT TTTGCCGAAC TTGGCCTTGA CGAGCCGTCA
TTGCAGATCA ACTCTCTTGG TTGCCCGGAC TGCAGGCCGC AGTATCGGCA GGTGCTGAAG
GATTTCCTCC GTTCCAGGCT CGAGTACCTC TGCGACGACT GCAAAAGAAG GTTTGAAACC
AACCCGCTCA GGGCGCTTGA CTGTAAATCG ACCGGTTGCA AGGAGGCCAC TGTTGGAGCG
CCATCGGTAC TGGACCACCT CTGCGCCAGC TGCAACGACC ATTTTACCAG AACCAGGCAG
CATCTTGAGG CGGTCGGCAC CACCTACAGT ATCAATGCGC GCATGGTCAG GGGGCTCGAT
TATTACACCC GCACCACCTT TGAACTTGTT ACCGGCCTTC TCGGCTCCCA GAGCGCAGTT
GCCGCCGGAG GTCGTTACGA CGGCCTGATA TCCGATCTTG GCGGACCGCA ACTTCCAGGC
ATCGGCTTTG CCATGGGGGT GGAGAGGGTG GCGCTTCTCC TTGCTGAAAA GGATTTTCGC
AAACGCCCCG ACCTGTTCAT TGCCGCCCTC GGTGAATCGG CTCAAAATAT GGCATTCAGG
CTGATGTGCG GGCTGCAGAA GGAAGGCGCT GCAGTGGAGA TCGATTATGA AGGTAAAAGC
CTGAAAAGCC AGATGCGCCG CGCTGACAAG TTCAACGCTC GTTTTACCCT GATTATCGGT
GATGATGAAC TGACAAAGGG GGCTGCGGGT CTGAAAAACA TGGACGAGGG GAGCCAGTCG
GACGTAGAAC TGAACGTCGG TGTCATTGCG CAGCGGATAA AGGGGTAG
 
Protein sequence
MAISGIKGFN DILPGEVEKW QHIEATARRV FELYGFAEIR VPILEKTELF CRSIGDSTDI 
VEKEMYSFTD KGENSVTMRP EGTASVMRAF IEHKLYVADA VAKLYYMGPM FRYERPQKGR
YRQFHQIGAE VTGVTDPKVD AQVLTMLCHF FAELGLDEPS LQINSLGCPD CRPQYRQVLK
DFLRSRLEYL CDDCKRRFET NPLRALDCKS TGCKEATVGA PSVLDHLCAS CNDHFTRTRQ
HLEAVGTTYS INARMVRGLD YYTRTTFELV TGLLGSQSAV AAGGRYDGLI SDLGGPQLPG
IGFAMGVERV ALLLAEKDFR KRPDLFIAAL GESAQNMAFR LMCGLQKEGA AVEIDYEGKS
LKSQMRRADK FNARFTLIIG DDELTKGAAG LKNMDEGSQS DVELNVGVIA QRIKG