Gene Syncc9605_0188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_0188 
SymbolhisS 
ID3736268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp196613 
End bp197950 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content62% 
IMG OID637774768 
Producthistidyl-tRNA synthetase 
Protein accessionYP_380519 
Protein GI78211740 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.207705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00812888 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCTTCCAC CCGGCAGACT GAGCCGCTGT ACCTGCATCC TGCACGTGAG TCAGCTCCAG 
AGCCTCAGGG GCATGGTCGA TCTGTTGCCG GAGGTGCTTC AGCGTTGGCA GGCGGTTGAG
GCCAAAGCGC GCGAGCACTT CCAGCGTTCT GGGTTCGGTG AAATTCGGAC GCCGCTGTTG
GAGACAACGG ATCTGTTCTG CCGAGGCATC GGTGAGGGCA CCGATGTGGT GGGTAAGGAG
ATGTACAGCT TTCAGGATCG GGGCGATCGC TCCTGCACAT TGCGGCCCGA AGGAACTGCA
TCGGTGGTGC GTGCCGCCCT GCAGCACGGC TTGCTCAGCC AGGGCGCTCA GAAGCTCTGG
TATGCCGGGC CGATGTTTCG CTATGAGCGC CCCCAGGCTG GCCGACAGCG GCAGTTCCAT
CAGATCGGGG TGGAGTGGCT TGGAGCTGAG CGGGCCCGCA GTGATGTTGA GGTGATTGCC
CTGGCCTGGG ATCTGCTGGC TTCCCTCGGT GTGGGCGGTT TGCAGTTGGA GCTGAACAGC
CTTGGCACCG CTGAAGACCG TAAGGCCTAT CGCAATGCTC TGGTGGCCTG GCTTGAGCAG
CGATCGGAGG TCCTTGATCC TGATTCCCAG GCGCGGCTGA GCACCAATCC CCTGCGCATC
CTGGATTCCA AAAACAAGAA CACCCAGGCG CTGCTGGAAG ATGCACCCAC GCTGGTGGAT
GCTCTCTCTG ATGCCAGCCG TGAGCGTTTC GAAGAGGTAC AGCGTGGGCT GACTTCCCTT
GGGATTCACT TCCTGTTGAA TCCCCGATTG GTGCGTGGCT TGGATTACTA CAGCCACACA
GCTTTCGAGA TCACCAGCGA TCAACTTGGG GCCCAGGCCA CTGTTTGTGG GGGTGGTCGT
TACGACGGCC TGATCGGCCA GTTGGGAGGT CCTCAGACCC CTGCCATCGG ATGGGCCCTT
GGCATGGAAC GGTTGTTGCT GGTGTTGGAA GCGGCAGCGA AGGCGGATCC TCAGGGTGAC
GCTGCTCGGC TGACGGCAGC TGCAGCCCCT GACGTTTTTC TCGTGAACCG TGGTGACGAG
GCCGAGTGTG TCGCTCTTGC CTTGGCACGG GACCTCCGGG CCGCTGGGCT GCGGGTGGAG
TTGGATGGGT CGGGCTCGGC CTTCGGAAAG CAGTTCAAAC GGGCGGATCG AAGTGGTGCG
AGCTGGGCCA TGGTGCTCGG AGACGAGGAG GTCGAGCGCG GTGAGGTGCG CCTGAAGCGG
TTACAGCAGC AGGCTGAGGA ATCCACCGTT GCGCTTGCGC CAGTTGCCGC GATCGTGGAG
AAACTGCTCA CCCCCTGA
 
Protein sequence
MLPPGRLSRC TCILHVSQLQ SLRGMVDLLP EVLQRWQAVE AKAREHFQRS GFGEIRTPLL 
ETTDLFCRGI GEGTDVVGKE MYSFQDRGDR SCTLRPEGTA SVVRAALQHG LLSQGAQKLW
YAGPMFRYER PQAGRQRQFH QIGVEWLGAE RARSDVEVIA LAWDLLASLG VGGLQLELNS
LGTAEDRKAY RNALVAWLEQ RSEVLDPDSQ ARLSTNPLRI LDSKNKNTQA LLEDAPTLVD
ALSDASRERF EEVQRGLTSL GIHFLLNPRL VRGLDYYSHT AFEITSDQLG AQATVCGGGR
YDGLIGQLGG PQTPAIGWAL GMERLLLVLE AAAKADPQGD AARLTAAAAP DVFLVNRGDE
AECVALALAR DLRAAGLRVE LDGSGSAFGK QFKRADRSGA SWAMVLGDEE VERGEVRLKR
LQQQAEESTV ALAPVAAIVE KLLTP