Gene PCC8801_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0213 
Symbol 
ID7103547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp208470 
End bp209696 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content37% 
IMG OID643473326 
Producthypothetical protein 
Protein accessionYP_002370472 
Protein GI218245101 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGAAA AATGGGAATT AAAGGAAAAT CCTTTCCGGT CTACTCCCCC CGATGATCCC 
GAAAAATTAG CCCAAATTTT CTATGGACGG GCTCAGATTT TAGATGTAGC TATTCCGACC
CTTTATGAAG GAAGAAATAT CTTAATTAGG GGGGTTTGGG GAATTGGAAA AACCGCCTTG
ATTTTCAATT TAATTAATCA GTTACAGCAG GAAGTAGCTG AAATAAAGGA AAAAATGTTA
GTCCTGTATC TAAGTAGTAT TCCCGGAGAC AGTCCCCCAG AATTTTATCG TGCTTTATTG
TTAGCTATAG CCGATAGTTT AGCAGAAATT GATGAGGAAG CGAGAGACAT TGCTAATACG
CTTTTAGGCT ATTCTATTCA ACGGACTAAA ACCACCACAG AAGGACAAGT TAAACTTGGA
ATAATATCCT TTGGAAGACG ACAAGAATCT CCCGCAAATT TAGTGAGTCC TACTGATAAA
ATTGACCCCT ATCCCTTACT AATCAAACTC CTTAGTAAAG CAGAAGAAAA ATATCATCGT
ATTGTCATTG CTATTGATGA TTTTGACAAA AAAGATCCCA TTATTGTCCA GACAATTTTA
GAAAGTAGTT TAGATTTATT TCGCATGGGA AAGAATCGAG GATTTATTAT GACAGGAAGA
GGGTTTACCG ATCTCCAAGA AGCTACCTTA AAAGCTTTAG GGATTTTTTC GGAAGATATC
CCCCTCGAAC CCATGAGTCA AGATGATTTA CATCATATTG TCATTAATTA TCTTAATAGC
GTTAGATATC AACCGCGAAA TGATACCTAT CCTTTTACAG AAGACGTAAT GAATCTAATT
ACTAATTATG CCCAAGGAGT CCCTAGACAA CTAAATGAAA TTTGCGAAAA AGTCTTACGC
AAAGCGGCTT CAGCAGGGTA TGAAACCATC GATCAACCTG CTTTTAATAC TATTTGGGAA
ACCCTACAAA AAGAATTTAC GAACCAGCTA AGTCCCCAGT TTCGTCATCT ATTATATGTT
GCCCATGAAG CAGGGGGAAT TAGTGAAAAT ATATCCGATC GCACCCTCGA TAAACTCGAC
GCACTTACCT TTGTTGAACT GTTACCCCAA CTGAAATTAT TGGAAGAACA AGGAGTATTA
ATTCGTCAAG AAGATCAAAA AGGATTTAAG TTTTTACCCT CTCAATTATT CCAACCGAAA
TTAGTATCCG AAGCGCAGGA AGAATAA
 
Protein sequence
MLEKWELKEN PFRSTPPDDP EKLAQIFYGR AQILDVAIPT LYEGRNILIR GVWGIGKTAL 
IFNLINQLQQ EVAEIKEKML VLYLSSIPGD SPPEFYRALL LAIADSLAEI DEEARDIANT
LLGYSIQRTK TTTEGQVKLG IISFGRRQES PANLVSPTDK IDPYPLLIKL LSKAEEKYHR
IVIAIDDFDK KDPIIVQTIL ESSLDLFRMG KNRGFIMTGR GFTDLQEATL KALGIFSEDI
PLEPMSQDDL HHIVINYLNS VRYQPRNDTY PFTEDVMNLI TNYAQGVPRQ LNEICEKVLR
KAASAGYETI DQPAFNTIWE TLQKEFTNQL SPQFRHLLYV AHEAGGISEN ISDRTLDKLD
ALTFVELLPQ LKLLEEQGVL IRQEDQKGFK FLPSQLFQPK LVSEAQEE