Gene Syncc9902_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0220 
SymbolhisS 
ID3744114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp236360 
End bp237652 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID637770389 
Producthistidyl-tRNA synthetase 
Protein accessionYP_376238 
Protein GI78183804 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGCAGC TGCAGAGCCT GAGGGGAATG GTGGATTTAT TGCCTCAAGC ATTACAGCGT 
TGGCAGGCTG TTGAATCAGT TGCGCGAACA CACTTTCAAC GGTCGGGTTT TGGTGAGATC
CGTACACCGG TCATGGAACC CACGGATTTG TTTTGTCGCG GCATCGGTGA AGCCACAGAT
GTTGTTGGCA AAGAGATGTA CACCTTCAAC GATCGTGGGG ATCGTTCATG CACCTTGCGT
CCAGAGGGCA CGGCATCTGT TGTCCGCGCT GCTTTGCAAC ATGGGTTGTT GAGCCAAGGG
CCTCAAAAGC TTTGGTATGC AGGCCCCATG TTTCGCTATG AGCGACCCCA GGCCGGACGC
CAACGTCAAT TCCATCAGAT CGGTGTGGAG TGGTTAGGGG CAGCAAGTGC ACGTGCCGAT
GTTGAAGTGA TTGCGTTGGC CTGGGATTTA TTGGCTTCCT TAGGCGTTGG TGGTTTGGAG
TTGGAATTGA ATAGTCTCGG TTCTACGGAT GATCGGTGCG CTTATCGCAC TGCTTTAGTG
GCCTGGTTAG AGCAACGCTC AAATCTTTTG GATGAAGACT CGCGGGCTCG CTTGAACACC
AACCCGCTAC GAATTTTGGA CTCTAAAAAC AAAGCAACTC AAGCTCTGCT TGATGGGGCT
CCAACCTTGG CAAATTCTCT CGCTCCTGAG AGTCGGGAGC GTTTTGAGGT TGTGCAGCAG
GGGCTTGCTT CACTTGGAAT TCCATTCCGC CTTAGCCCTC GATTAGTTCG CGGTTTGGAC
TACTACTGCC ATACCGCTTT TGAGATCACC AGTGACCAGC TTGGTGCCCA GGCCACGGTC
TGTGGCGGTG GTCGATACAA CGGTTTGATC GGTCAACTAG GAGGACCAGA TACCCCAGCT
GTGGGATGGG CGCTGGGCAT GGAGCGTTTG CTGTTGGTTG TTGAGGCTGC CGCCAACGCT
GATCCTGATG GGGACTCTGC TCGGCTTACA GCTGTGGTCC CACCCAATGC TTATTTGGTG
AATCGTGGCG AACAAGCTGA AAAAGCCGCC CTCATTCTTG CTCGGTCGCT GCGTTGTGCC
GGATTAGTGA TTGAACTCGA CAACTCTGGA GCATCTTTCA GCAAGCAATT CAAGCGTGCT
GATCGTTGCG GTGCACGTTG GGCTCTGGTG CTGGGTGATG AAGAAGTTGA AAAGGGAGAG
GTTCGGATTA AGCCTTTGAG TGATGAGAGT GACGATTTTT TTCTTGGACT CCATGATCTG
ACAGGGTTAC TTGCCAAGCT AACTGCCATG TAA
 
Protein sequence
MTQLQSLRGM VDLLPQALQR WQAVESVART HFQRSGFGEI RTPVMEPTDL FCRGIGEATD 
VVGKEMYTFN DRGDRSCTLR PEGTASVVRA ALQHGLLSQG PQKLWYAGPM FRYERPQAGR
QRQFHQIGVE WLGAASARAD VEVIALAWDL LASLGVGGLE LELNSLGSTD DRCAYRTALV
AWLEQRSNLL DEDSRARLNT NPLRILDSKN KATQALLDGA PTLANSLAPE SRERFEVVQQ
GLASLGIPFR LSPRLVRGLD YYCHTAFEIT SDQLGAQATV CGGGRYNGLI GQLGGPDTPA
VGWALGMERL LLVVEAAANA DPDGDSARLT AVVPPNAYLV NRGEQAEKAA LILARSLRCA
GLVIELDNSG ASFSKQFKRA DRCGARWALV LGDEEVEKGE VRIKPLSDES DDFFLGLHDL
TGLLAKLTAM