Gene Rpal_1369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1369 
SymbolhisS 
ID6409026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1440384 
End bp1441913 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content63% 
IMG OID642711268 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001990384 
Protein GI192289779 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0999423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGA AACCGAAAAA ACCGCAGAAA CTGCGCGCCC GGCTGCCGCG GGGCCTCGCC 
GATCGTGGCC CTGCCGAAAT CGCGGCGACG CGCGCGATGG TGGAGAAGAT CCGCGAGGTG
TACGAACGCT ACGGCTTCGA GCCGGTGGAG ACCCCGGCGT TCGAATACAC TGACGCGCTC
GGCAAGTTCC TGCCCGATCA GGACCGTCCC AACGAGGGCG TGTTCTCGCT GCAGGACGAC
GACGAGCAGT GGATCAGCCT GCGCTACGAT CTCACCGCAC CGCTGGCGCG CTACGTCGCC
GAGAACTTCG ATCAGCTCCC GAAGCCATAT CGCAGCTATC GGTTCGGCTG GGTATTCCGC
AACGAGAAGC CCGGCCCCGG CCGCTTCCGG CAGTTCATGC AGTTCGACGC CGACACCGTC
GGCTCCGGCT CGCCCGCCGC CGACGCCGAA ATGTGCATGA TGGCCGCCGA CACCATGGAA
GCGCTCGGCA TCCCGCGCGG CAGCTATCTC GTAAAGCTGA ATAATCGAAA GATTCTGGAC
GGCGTTCTCG AAGCGATTGG CATTGGTGGC GATGAACACA TTAAGCAGAG GCTAGTCGTC
CTCCGTGCGA TCGACAAATT GGATCGCTTG GGCCTGCAAG GTGTGGAACA GCTTCTTGGT
GAAGGGCGGA AGGACGAGAG CGGCGACTTC ACAAGAGGAG CCTCGTTAAA CGCTGGCCAG
ATAAGGGACG TCATCACCCT GCTAAACTTC GCTGGTTGGG GGGACATCGT AGACGGTTCG
AATACGCACA CTCTCGATGA ATGGGAGGGG TTGCGCTTTT CAGTGAACTC AACCTTCTCT
GCAGGCATTC AGGATCTACG CCAGATTACG AAGATCACAG AGGCCTCGGG CTACGACACT
GGCCGAATTC GAGTTGATAA CACTGTCGTC CGCGGCCTTG AGTACTACAC CGGGCCCGTA
TTCGAGGTCG AACTGCTGCT CGATACCAAG GATGAGAAGG GCCGCCCGGT GCGGTTCGGC
TCGGTCGGCG GCGGCGGGCG CTATGACGGC CTGGTGTCGC GCTTCCGCGG CGAGCCGGTG
CCGGCGACCG GGTTCTCGAT CGGCGTGTCG CGGCTGCAGG CGGCGCTGAC GCTGATCGGC
CAGCTCGGCA ACAAGCCGCA GGCCGGCCCC GTCGTCGTCA CCGTGTTCGG CGGCGAGATC
GCGGGCTACC AGAAGATGGT CGCCACCTTG CGCAAAGCCG GCATCCGGGC GGAATTGTAC
TTGGGCAATC CCAAGCACTC GCTCGGCCAG CAGATGAAAT ACGCCGACAA GCGCAACTCG
CCCTGCGCCA TCATCCAGGG CTCGGACGAG AAGCAGCAGG GCATCGTGCA GATCAAGGAC
CTGATCCTGG GCGCCGAACT GGCGTCGCTG GAGAAGGACC GCGACGAGTA TTTGAAGAAG
CAGGCCGAAG CGCAGTTCTC CTGCAAGGAA GATGAAATGG TCGCCAAGGT GCAGGAGCTG
CTGCAGCGCC GCGGGGTGGC GTGGGGATAG
 
Protein sequence
MAEKPKKPQK LRARLPRGLA DRGPAEIAAT RAMVEKIREV YERYGFEPVE TPAFEYTDAL 
GKFLPDQDRP NEGVFSLQDD DEQWISLRYD LTAPLARYVA ENFDQLPKPY RSYRFGWVFR
NEKPGPGRFR QFMQFDADTV GSGSPAADAE MCMMAADTME ALGIPRGSYL VKLNNRKILD
GVLEAIGIGG DEHIKQRLVV LRAIDKLDRL GLQGVEQLLG EGRKDESGDF TRGASLNAGQ
IRDVITLLNF AGWGDIVDGS NTHTLDEWEG LRFSVNSTFS AGIQDLRQIT KITEASGYDT
GRIRVDNTVV RGLEYYTGPV FEVELLLDTK DEKGRPVRFG SVGGGGRYDG LVSRFRGEPV
PATGFSIGVS RLQAALTLIG QLGNKPQAGP VVVTVFGGEI AGYQKMVATL RKAGIRAELY
LGNPKHSLGQ QMKYADKRNS PCAIIQGSDE KQQGIVQIKD LILGAELASL EKDRDEYLKK
QAEAQFSCKE DEMVAKVQEL LQRRGVAWG