Gene PCC8801_2914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2914 
Symbol 
ID7104459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3004707 
End bp3006362 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content35% 
IMG OID643475950 
Productpseudouridine synthase 
Protein accessionYP_002373066 
Protein GI218247695 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAAG TAATTCTTGA GAAGATATCA GAATTTGTGA CAGAAAAGAC AGCTTTAAAG 
GATTTAGCGG TTAATTATTG GTATGAAGGA TATTGTCCCC AAAGTGGTGA ATTTTTAAGA
CTTCCCCGTA ATAGAATGAT AGAAGCGATC GCCCTCGGTT TAATGAAACA ATTAGCTGAA
GATAATCGCT ATAGTTATGA AGGAAAAATG TATGGTGTTT TATTAGTAGA AACCCCACAA
GGAGAACTGG CAGTACTAAA AGCGTTTTCG GGTCTTCTTT TAGGAAAAAA TGTTGTTGAG
GGATGGGTTC CTTCTCTATT GGGAAAGGAA AAAATAACTT TAGAAGAAAT TCAAACATTA
GAACAACTAG AAAATCTCAA ACATCAGATA GTTGCCTTAC AAAAAATTTC CGAAAGACAG
GATTATCAAG ACTTATCTAA AGAATGGAAA ACCCGTTTAA ACAATTTAGC AATTATTCAT
CGTGAACGTA AATTAAAAAG ACAAGAAAAA CGCAAAAATT TACTAAAAAC CTTCCAAGAT
AATGATTTAA AGCTTGTTTT GGATAATCTC AACAAAGAAA GTCAAAAAGA CGGCATAGAG
AAGCGAAAAT TAAAACAAAA AAGAGATAAA ATATTAAACC CATTAAAGCA AAAAATTGAT
CAAGCAGATG CTCAAATATT AGAATTAAAA CAACAGCGTA AAGAATTATC CCGTCAGCTT
CAAGCACAGA TGAATCAAGT TTATTCTCTA AGCAATTTTG CGGGACAATC AAACTCATTA
CAAAGCTTAA TACCCACAGG TGGTTTACTG ACAGGAACGG GAGAGTGTTG TGCGCCAAAA
TTACTAAATT ATGCAGCACA ACATCACTTA AAACCTTTAG CTATGGCTGA ATTTTGGTGG
GGAGAAGCGT CTAACAATGG AGATAAAATT CCTGGTCAAT TTTATCCTGC GTGTCAGGAA
AGATGTCAGC CTTTGATGGG ATTTTTACTG TCTGGATTAG GGAACAATCA ATCTTTTTTT
AAAAGTGAAA TCAAGGTAAT TTATGAAGAT CAATGGATAA TTGCTATTGA TAAGCCGAGC
AGTTTATTAT CAGTACCAGG TCGTTATTTT GAGACCTTTG ATAGTGTCTT AACCCGCTTA
CAAAATAGCT TACCTGATGC TCAAGAATTA AGAACTGTAC ATCGATTAGA TCAAGACACT
TCGGGGATTC TTTTATTAGC ACGCGATCGC TACACCCATC GTCACCTTAG TCAACAATTT
GCACAACGAA AAGTTGAGAA AATTTATGAA GCAATCTTAG CTGGATCGGT TATGATGAAT
GAAGGAGTAA TTCAATTACC TTTATGGGGA GATCCCAATA ATCGTCCTTA CCAAAAAGTT
GATTGGGAAC TGGGAAAACC TAGTATTACT CAGTTTAAAG TTATTACAAC ACAAGAAAAC
TTGACCCGTA TTCAATTTAT TCCCCTAACA GGACGTACCC ATCAAATCAG GGTTCATGCA
GTGGATACAC AAGGACTAGG AAGCGTCATT TTAGGCGATT ATCTTTATGG GTGTAATGCT
GGTGTAAGTC GTTTACATTT ACACGCTAGA GAATTAAAAT TTGAGCACCC TCAGCAACAA
AAGACTGTTC ATCTTTATTT AGAAACACCA TTTTAA
 
Protein sequence
MDQVILEKIS EFVTEKTALK DLAVNYWYEG YCPQSGEFLR LPRNRMIEAI ALGLMKQLAE 
DNRYSYEGKM YGVLLVETPQ GELAVLKAFS GLLLGKNVVE GWVPSLLGKE KITLEEIQTL
EQLENLKHQI VALQKISERQ DYQDLSKEWK TRLNNLAIIH RERKLKRQEK RKNLLKTFQD
NDLKLVLDNL NKESQKDGIE KRKLKQKRDK ILNPLKQKID QADAQILELK QQRKELSRQL
QAQMNQVYSL SNFAGQSNSL QSLIPTGGLL TGTGECCAPK LLNYAAQHHL KPLAMAEFWW
GEASNNGDKI PGQFYPACQE RCQPLMGFLL SGLGNNQSFF KSEIKVIYED QWIIAIDKPS
SLLSVPGRYF ETFDSVLTRL QNSLPDAQEL RTVHRLDQDT SGILLLARDR YTHRHLSQQF
AQRKVEKIYE AILAGSVMMN EGVIQLPLWG DPNNRPYQKV DWELGKPSIT QFKVITTQEN
LTRIQFIPLT GRTHQIRVHA VDTQGLGSVI LGDYLYGCNA GVSRLHLHAR ELKFEHPQQQ
KTVHLYLETP F