Gene TM1040_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0503 
SymbolhisS 
ID4078249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp530303 
End bp531805 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content61% 
IMG OID638005799 
Producthistidyl-tRNA synthetase 
Protein accessionYP_612498 
Protein GI99080344 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.315875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAG CCAAGAAACA GCCCCGTCCC AAGGCAATTA CGCCCAAAGG ATTCCGTGAC 
TATTTCGGCG AAGAGGTGAC CCAGCGCACG CATATGCTGG CCACCATTGC GGAGGTCTAT
CATCATTACG GGTTTGAGGC GCTTGAGTCT TCTGCCGTTG AAACGGTCGA GGCTCTGGGC
AAGTTCCTCC CCGATGTGGA TCGCCCGAAT GAGGGCGTCT TTGCCTGGCA AGAGGCCGAG
GACGATGGCA AAGGTGACTG GATGGCGCTG CGCTATGATC TGACAGCGCC TCTGGCGCGG
GTTTATGCCC AGCACCGTAA CGATCTGCCA AGCCCCTATC GCCGCTATGC GATGGGGCCT
GTGTGGCGCA ACGAGAAGCC CGGCCCCGGT CGGTTCCGTC AGTTCTATCA ATGCGATGCG
GATACCGTTG GCACCGCGTC GATGGCAGCA GATGCCGAGA TCTGCATGAT GCTCTCGGAT
ACGCTGGAAA AGGTCGGCAT CCCGCGTGGC GACTATCTGG TGCGCGTCAA CAACCGCAAG
GTTCTGAACG GTGTTCTCGA GGCGATGGGC CTGCTCGAGG ACGACCCCAA GCGCGACGAC
GTCCTGCGCA CCATCGACAA GTTCGACAAG GTCGGCGAAA GCGGCGTGCG TGAGCTTCTG
GGTAAAGGGC GGCTTGACGC CTCTGGAGCC TATATCGACG GCGTGGGCCT TGAAGATCAT
CAGGCCGAGC CGGTTTTGGC GTTCCTGACT TCGAAGGGCG ACACCGTTAC CGAGACCATG
ACCAACCTGC GGGCTGCCGT AGGGGACAGC AAGGTCGGCC AGGAGGGCAT CGCCGAGCTG
GAGCTGATGG GGTCGCTCTT TGCCGCTGCG GGCTATGGTG AAGACCGCAT CCTGATCGAC
CCGTCGATTG TGCGTGGCCT TGGCTACTAC ACCGGTCCGG TATTTGAGGC GGAGCTGACC
TTTGAAATCT TTGACGAAAA AGGCCGTAAG CGCCAGTTCG GCTCGGTTGC AGGCGGTGGG
CGCTATGACG GTTTGGTGAA ACGTTTCACC GGCCAAGAAG TCCCTGCCGT GGGGCTCTCG
ATTGGCGTGG ACCGCCTGTT GGCGGCGCTG CGCGAAAAGG GCCGTTTGGC CGCGACCCCG
ACGGGGCCGG TTGTGGTCAC GGTGATGGAT CGTGATCGCA TGGCGGACTA CCAGGCGATG
GTGGCTGAGC TGCGTCAGGC GGGCATTCGC GCCGAGGTCT ATCTCGGCAA TCCCAAGAAC
TTTGGCAATC AGTTGAAATA CGCCGACAAG CGCCATTCGC CGATTGCGGT CATCGAAGGG
GGTGACGAAA AGGATCGTGG CGTGGTCCAG ATCAAGGACC TGATCCTCGG CGCCAAGATC
GCAGAAAGCG CCACCTTGGA AGAGTGGAAG GAACGCCCCA GCCAGTTTGA AGTGCCCCGC
ACGGAACTGG TTGCCAAGGT GCGTGAGATC CTCGTCAGCC AATCCAGCGA TCGGGAGGGC
TGA
 
Protein sequence
MAKAKKQPRP KAITPKGFRD YFGEEVTQRT HMLATIAEVY HHYGFEALES SAVETVEALG 
KFLPDVDRPN EGVFAWQEAE DDGKGDWMAL RYDLTAPLAR VYAQHRNDLP SPYRRYAMGP
VWRNEKPGPG RFRQFYQCDA DTVGTASMAA DAEICMMLSD TLEKVGIPRG DYLVRVNNRK
VLNGVLEAMG LLEDDPKRDD VLRTIDKFDK VGESGVRELL GKGRLDASGA YIDGVGLEDH
QAEPVLAFLT SKGDTVTETM TNLRAAVGDS KVGQEGIAEL ELMGSLFAAA GYGEDRILID
PSIVRGLGYY TGPVFEAELT FEIFDEKGRK RQFGSVAGGG RYDGLVKRFT GQEVPAVGLS
IGVDRLLAAL REKGRLAATP TGPVVVTVMD RDRMADYQAM VAELRQAGIR AEVYLGNPKN
FGNQLKYADK RHSPIAVIEG GDEKDRGVVQ IKDLILGAKI AESATLEEWK ERPSQFEVPR
TELVAKVREI LVSQSSDREG