Gene Rcas_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0121 
Symbol 
ID5537581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp143241 
End bp144605 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content61% 
IMG OID640892286 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001430275 
Protein GI156740146 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCC GGGTTCAGAA CATTCGAGGC ATGCGCGATC ACCTGCCATC CGCCATGATC 
CTGCGGCAGC ACATTATTAA CACGCTGACT TCCGTTTTTG AGCGTTATGG TTTTGAGCCG
TTGCAGACGC CAATTGTCGA ATATGCCGAA ACGCTCGACG GCAAAATCGG CGATGATGAG
AAGTTGATCT ATCGTTTCGA GGATCACGGC GGGCGGAAGG TGGCGTTGCG CTACGATCAG
ACGGTGCCGC TGGCACGGGT TGTCGCGCAG TATCAGGGGC AACTCACGTT TCCATGGCGG
CGTTATGCCA TTGGTCAGAG TTATCGCGGT GAACGTCCTG GTCGTGGCCG CTACCGCGAG
TTGTGGCAGG CTGATATCGA TATCGTCGGA TCGGCGTCGC CGGTGGCGGA CGCCGAGATT
CTTGCAGTGT TGACCGATGC GCTGACCGCG CTTGGATTCA CCGGTTTTAC GACCCTCATC
AGCCATCGTC AGGTTCTTGG CGGCATCGCG CGCGTTTCTG GTCTTGATGA TGCATCCGCC
GGCAATGTCT ACCGCGCCAT CGACAAACTC GACAAGATTG GCATTGATGG CGTGCGCAAC
GAATTGTTGC AGAGCGGCGT GACGCCTGAC GCCGCTGAGC GCATTCTGGC GCTGATCGAT
CTGTATGGCA GCGCGGACGA TGTGCTGAAT GAACTGGCGC AGCGGTTGCA CGACGACGAG
CGGGCGCAAC AGGCAATCGA CAATCTGCGC GCGATCATCG GCTATGCGCG CGCTATGGGC
GTGCCTGAGG AGCGGATCGC GATCGCACCG CGCCTGGCGC GCGGGTTGTC GTACTACACC
GGCGCCGTCT TCGAATCGAT CATCCAGGAG CCGCCGATGG GGTCGCTGCT CGGCGGCGGG
CGCTACGATG AGTTGATCGG CATGTTCGCC GGGCGCTCGA TCCCTACGGT CGGGCTGGCG
TTTGGCATCG AACGGTTGCA CGATGTGATG GAAGCATTGG GAATGGGACC GGAGTCACGG
ACTATTGCGG TGGCGCTGGT GACGCTCTTC AACCCTGAGA TGGCGATGGA GAGTCTGGGT
TTGGCGCAGG AGTTGCGCCG GGCAGGGTTG ATGATCGAGA CGACGCTCGA CCCCTCCGAA
AAACTTGGGC GGCAACTCCA GTATGCGGAC CGACGCGGCA TTCCGTATGC GCTGGTGCTT
GGTCCCGATG AACTGGCGCG CGGAGAAGTC GTTGTGAAAC ATCTGCGCAG TGGTGAGCAA
CGGAGTGTGG CGCGCAGCGC CGTTGCCGGC ATGCTGCACG CGGCTGCGGA AGCGCAGCGC
ACCCCGCGAA TAGCGAATGA GCAGGGAGGC ATCCATGAGC GATAG
 
Protein sequence
MSSRVQNIRG MRDHLPSAMI LRQHIINTLT SVFERYGFEP LQTPIVEYAE TLDGKIGDDE 
KLIYRFEDHG GRKVALRYDQ TVPLARVVAQ YQGQLTFPWR RYAIGQSYRG ERPGRGRYRE
LWQADIDIVG SASPVADAEI LAVLTDALTA LGFTGFTTLI SHRQVLGGIA RVSGLDDASA
GNVYRAIDKL DKIGIDGVRN ELLQSGVTPD AAERILALID LYGSADDVLN ELAQRLHDDE
RAQQAIDNLR AIIGYARAMG VPEERIAIAP RLARGLSYYT GAVFESIIQE PPMGSLLGGG
RYDELIGMFA GRSIPTVGLA FGIERLHDVM EALGMGPESR TIAVALVTLF NPEMAMESLG
LAQELRRAGL MIETTLDPSE KLGRQLQYAD RRGIPYALVL GPDELARGEV VVKHLRSGEQ
RSVARSAVAG MLHAAAEAQR TPRIANEQGG IHER