Gene Cyan8802_1211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1211 
SymbolhisS 
ID8390522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1231314 
End bp1232603 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content44% 
IMG OID644979223 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003136974 
Protein GI257059086 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTACAA TTCAAACATT ACCAGGAACA AGAGATATTT TGCCAGAAGA GATCGGATAC 
TGGCAATACG TCGAGACAAT TACTACCGAA ATACTCAGTC GCGCGATGTA TCAAGAAATT
CGTGCCCCTA TTTTTGAGCA AACCTCCTTG TTTGAACGGG GGATTGGAGA AGCCACTGAC
GTAGTGGGCA AAGAAATGTA TACTTTTAAA GATAGGGGCG ATCGCTCTGT TACTCTGCGT
CCCGAAGGAA CCGCAGGAGT GGTTCGTGCT TATCTGCAAA ATAACCTCTA TGCAACTGGA
GGAGTGCAAC GTCTTTGGTA TCAAGGCCCC ATGTTTCGTT ACGAACGTCC CCAAGCTGGT
CGTCAACGAC AATTTCATCA AATTGGTTTA GAATTATTAG GAAGTGGTGA TCCTCGCGCG
GATGTAGAAG TCATTGCCTT AGCAACCGAT ATCCTCAAAA AATTGGGGCT ACAAAGTTTA
AAATTAGACC TGAATTCTGT AGGTGATCGC AACGATAGAC AACGCTATCG AGAAGCTTTA
GTTAACTATT TTTTACCCTA TAAAAATGAG TTAGATCCTG ACTCCCAAGA TCGCTTAGAA
AGAAACCCCC TACGCATCCT TGATAGCAAA AATCAACGCA CAAAAGAAAT CAATCAAAAT
GCCCCCAGTA TTTTACAGTA TCTAGGGGAT CAATCTAAGA AACATTTTGA TCAAGTTCAG
CAATTATTAA CTGACTTAAA TATTAGTTAT CAACTCAACC CTTGCTTAGT TCGCGGCCTA
GATTACTATA CCCATACTGC TTTTGAAATT CAATCCGATG ATTTAGGGGC TCAAGCAACG
GTTTGCGGCG GCGGACGGTA TGATGGGTTA GTGGCCGAAC TAGGGGGACC TGATACCCCT
GCGGTGGGAT GGGCGATCGG AATGGAACGA CTGATTATTC TCCTCAAACA ACGCCAAACT
GTCCCCCATT GCGTCCCTGA TATCTATATC GTGTCTAAGG GAGAACAAGC AGAAGCACAG
GCGTTAATTT TAGCCCAAAA ACTGCGCTTT GAAGGATTAA CCGTAGAATT AGACCTCAGT
GGAAGTGCCT TCGGGAAGCA ATTTAAACGG GCCGATCGCA GTGGGGCGAT CGCTTGTATT
GTCTTAGGAG ATGAAGAAGC AGCTACCCAA ACCCTTCAAC TCAAATGGTT AGCGACGAAA
CAACAAGAAA CCTTGGATCA AACTCAGTTA GTTAGTGAAA TTGGGGAACT AAAAGCCCAA
TTAGCCCGAC ATAAACAAGC AACTGGCTAA
 
Protein sequence
MGTIQTLPGT RDILPEEIGY WQYVETITTE ILSRAMYQEI RAPIFEQTSL FERGIGEATD 
VVGKEMYTFK DRGDRSVTLR PEGTAGVVRA YLQNNLYATG GVQRLWYQGP MFRYERPQAG
RQRQFHQIGL ELLGSGDPRA DVEVIALATD ILKKLGLQSL KLDLNSVGDR NDRQRYREAL
VNYFLPYKNE LDPDSQDRLE RNPLRILDSK NQRTKEINQN APSILQYLGD QSKKHFDQVQ
QLLTDLNISY QLNPCLVRGL DYYTHTAFEI QSDDLGAQAT VCGGGRYDGL VAELGGPDTP
AVGWAIGMER LIILLKQRQT VPHCVPDIYI VSKGEQAEAQ ALILAQKLRF EGLTVELDLS
GSAFGKQFKR ADRSGAIACI VLGDEEAATQ TLQLKWLATK QQETLDQTQL VSEIGELKAQ
LARHKQATG