Gene PCC8801_1184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1184 
SymbolhisS 
ID7104888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1232625 
End bp1233914 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content44% 
IMG OID643474270 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002371408 
Protein GI218246037 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTACAA TTCAAACATT ACCAGGAACA AGAGATATTT TGCCAGAAGA GATCGGATAC 
TGGCAATACG TCGAGACAAT TACTACCGAA ATACTCAGTC GCGCGATGTA TCAAGAAATT
CGTGCCCCTA TTTTTGAGCA AACCTCCTTG TTTGAACGGG GGATTGGAGA AGCCACTGAC
GTAGTGGGCA AAGAAATGTA TACTTTTAAA GATAGGGGCG ATCGCTCTGT TACTCTGCGT
CCCGAAGGAA CCGCAGGAGT GGTTCGTGCT TATCTGCAAA ATAACCTCTA TGCAACTGGA
GGAGTGCAAC GTCTTTGGTA TCAAGGTCCG ATGTTTCGTT ACGAACGTCC TCAAGCTGGT
CGTCAACGAC AATTTCATCA AATTGGTTTA GAATTATTAG GAAGTGGTGA TCCTCGCGCG
GATGTAGAAG TCATTGCCTT AGCAACCGAT ATCCTCAAAA AATTGGGGCT ACAAAGTTTA
AAATTAGACC TGAATTCTGT AGGCGATCGC AACGATAGAC AACGCTATCG AGAAGCTTTA
GTTAACTATT TTTTACCCTA TAAAAATGAG TTAGATCCTG ACTCCCAAGA TCGCTTAGAA
AGAAACCCCC TACGCATCCT TGATAGCAAA AATCAACGCA CAAAAGAAAT CAATCAAAAT
GCCCCCAGTA TTTTACAGTA TCTAGGGGAT CAATCTAAGA AACATTTTGA TCAAGTTCAG
CAATTATTAA CTGACTTAAA TATTAGTTAT CAACTCAACC CTTGCTTAGT TCGCGGCCTA
GATTACTATA CCCATACTGC TTTTGAAATT CAATCCGATG ATTTAGGGGC TCAAGCAACG
GTTTGCGGCG GCGGACGGTA TGATGGGTTA GTGGCCGAAC TAGGGGGACC TGATACCCCT
GCGGTGGGAT GGGCGATCGG AATGGAACGA CTGATTATTC TCCTCAAACA ACGCCAAACT
GTCCCCCATT GCGTCCCTGA TATCTATATC GTGTCTAAGG GAGAACAAGC AGAAGCACAG
GCGTTAATTT TAGCCCAAAA ACTGCGCTTT GAAGGATTAA CCGTAGAATT AGACCTCAGT
GGAAGTGCCT TCGGGAAGCA ATTTAAACGG GCCGATCGCA GTGGGGCGAT CGCTTGTATT
GTCTTAGGAG ATGAAGAAGC AGGTACCCAA ACCCTTCAAC TCAAATGGTT AGCGACGAAA
CAACAAGAAA CCTTGGATCA AACTCAGTTA GTTAGTGAAA TTGGGGAACT AAAAGCCCAA
TTAGCCCGAC ATAAACAAGC AACTGGCTAA
 
Protein sequence
MGTIQTLPGT RDILPEEIGY WQYVETITTE ILSRAMYQEI RAPIFEQTSL FERGIGEATD 
VVGKEMYTFK DRGDRSVTLR PEGTAGVVRA YLQNNLYATG GVQRLWYQGP MFRYERPQAG
RQRQFHQIGL ELLGSGDPRA DVEVIALATD ILKKLGLQSL KLDLNSVGDR NDRQRYREAL
VNYFLPYKNE LDPDSQDRLE RNPLRILDSK NQRTKEINQN APSILQYLGD QSKKHFDQVQ
QLLTDLNISY QLNPCLVRGL DYYTHTAFEI QSDDLGAQAT VCGGGRYDGL VAELGGPDTP
AVGWAIGMER LIILLKQRQT VPHCVPDIYI VSKGEQAEAQ ALILAQKLRF EGLTVELDLS
GSAFGKQFKR ADRSGAIACI VLGDEEAGTQ TLQLKWLATK QQETLDQTQL VSEIGELKAQ
LARHKQATG