Gene PCC8801_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1999 
SymbolprfC 
ID7104770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2073747 
End bp2075366 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content46% 
IMG OID643475060 
Productpeptide chain release factor 3 
Protein accessionYP_002372192 
Protein GI218246821 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG4108] Peptide chain release factor RF-3 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00503] peptide chain release factor 3 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAGCG AACTTCAGAC AGAAATCCAA GAAGCCGTTG AGAAACGGCG TAATTTTGCC 
ATTATTTCCC ACCCAGACGC AGGAAAAACC ACTTTAACCG AAAAACTCCT ACTCTACGGA
GGAGCCATTC ACCAAGCAGG AGCCGTCAAA GCCAGACGCG ATCAGCGTAA AGCTACCTCT
GACTGGATGG AAATGGAAAA ACAACGGGGA ATCTCCATTA CCTCCACCGT TCTCCAATTT
GAGTACCGAA ACTTCCAAAT TAATCTCCTC GATACCCCCG GACACCAAGA CTTTAGCGAA
GATACCTATC GTACCCTCGC CGCGGCCGAT AATGCCGTCA TGTTAATCGA TGCAGCCAAG
GGGTTAGAAC CTCAAACCCG CAAACTGTTT GAAGTCTGTC GTCTGCGGGG ACTTCCTATT
TTTACCTTTA TTAATAAACT CGATCGCCCC ACCAGAGAAC CTCTAGAATT ACTCGATGAA
ATTGAACAGG AATTAGGACT GAAAACCTAT GCCGTGAATT GGCCTATCGG AACAGGCGAT
CGCTTTAAAG GCGTTTTTGA CCGCCGTCAC CAGGGTATAC ACCTGTTTGA ACGTCGTGCC
CACGGTAGCC AACAGGCTCA GGAAACCGCC ATTAAGCTTG GCGACCCCAA AATTGAAGCA
CACCTCGAAC AAGAGCTTTA TTATCAGCTA AAAGAAGAGT TAGAAATCCT CCAAGAGTTG
GGGGGAGATC TGGACTTAAA AGAACTCCAC GATGGCCAGA TAACCCCCGT CTTTTTTGGT
TCAGCCATGA CCAACTTTGG CGTGCAGTTA TTCCTCGAAG CCTTCCTAGA ATACGCTTTA
CAACCCGAAG GACGCAATTC TTCTGTGGGA GTGGTTGACC CCACCCATCC CGAATTTAGT
GGCTTTGTTT TCAAATTACA GGCCAATATG GACCCGAAAC ACCGCGATCG CGTTGCATTT
GTGCGGGTTT GTACAGGTAA ATTCGAGAAG GACATGACCG TTAGTCACGC CAGAACGGGG
AAAACCGTCC GTTTATCCCG TCCTCAAAAG CTTTTTGCCC AAGATCGGGA GTCCATCGAG
GAAGCCTATG GGGGGGATGT AATTGGGTTA AATAACCCTG GAGTCTTTGC CATTGGCGAT
ACTATCTATA GCGGAACTAA GTTGGAATAC GAAGGTATCC CTTGCTTTTC GCCTGAAATA
TTCGCCTATC TGAAAAATCC TAATCCGTCT AAATTCAAAC AATTTCAAAA AGGTATTCAG
GAACTGCGCG AAGAGGGAGC AATCCAGATT ATGTACTCTA CCGACGACTT TAAACGCGAT
CCCATTTTAG CGGCTGTTGG ACAGTTGCAG TTTGAAGTGG TGCAGTTTCG GATGTTAAGC
GAATATGGGG TAGAAACGAA CCTAGAACCG CTTCCCTACA GTGTTGCCCG TTGGGTAACA
GGAGGATGGA CTGCATTAGA AAAAGCTGGA CGCATCTTTA ATAGTATGAC CGTTAAAGAT
AATTGGGATC GTCCCGTCTT ATTGTTTAAA AATGAATGGA ATTTAAACCA AGTTAAGGCA
GATAAACCCG AATTAGGCTT AAGTTCTACT GCCCCCGTTG GTTCAGGAAT AAACGAATAA
 
Protein sequence
MSSELQTEIQ EAVEKRRNFA IISHPDAGKT TLTEKLLLYG GAIHQAGAVK ARRDQRKATS 
DWMEMEKQRG ISITSTVLQF EYRNFQINLL DTPGHQDFSE DTYRTLAAAD NAVMLIDAAK
GLEPQTRKLF EVCRLRGLPI FTFINKLDRP TREPLELLDE IEQELGLKTY AVNWPIGTGD
RFKGVFDRRH QGIHLFERRA HGSQQAQETA IKLGDPKIEA HLEQELYYQL KEELEILQEL
GGDLDLKELH DGQITPVFFG SAMTNFGVQL FLEAFLEYAL QPEGRNSSVG VVDPTHPEFS
GFVFKLQANM DPKHRDRVAF VRVCTGKFEK DMTVSHARTG KTVRLSRPQK LFAQDRESIE
EAYGGDVIGL NNPGVFAIGD TIYSGTKLEY EGIPCFSPEI FAYLKNPNPS KFKQFQKGIQ
ELREEGAIQI MYSTDDFKRD PILAAVGQLQ FEVVQFRMLS EYGVETNLEP LPYSVARWVT
GGWTALEKAG RIFNSMTVKD NWDRPVLLFK NEWNLNQVKA DKPELGLSST APVGSGINE