Gene PCC8801_2988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2988 
Symbol 
ID7104423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3084485 
End bp3085645 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content32% 
IMG OID643476017 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002373132 
Protein GI218247761 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATG AATCGCAGAA GTTTATTAAA CTTGGTAACT TGATCAAGTT TAAATATGGA 
AAATCTCTAC CGAATAGAGA AAGAGATCCA GATGGAAAAT ATTTAGTCTT TGGATCTGGT
GGTAAAATAG GATTACACAA TAGCTATTTA ACTGAATCAC CTGTAATTGT TGTTGGACGA
AAAGGTTCAA TTGGTTCAAC TTTTTATTCG GATAATCCTT GTTGGTGTAT AGATACAACT
TACTATGTAG ATCAATTTTC TTCTAATTTA TATTCCAAAT ATTTATATTA TTTTCTCAAT
ACTTTAAAAT TAGATCGTCT GAATCGCGCA GCAACAATTC CCGGATTAAG TAGAGATGAT
TTATATACTT TTTCTATCCC TATTCCCTAT CCCAATAATC CTAAACTCTC CTTAGATATA
CAACAGCGAA TTGTAGCGAG AATTGAATCT TTATTCGGGG AAATTAAACG GAATCGTTTA
TTACTTGAAC AAATGCGTTT GGATAATGAT TTGTTGTTAC CTAATGCTTT AGATGAAGTG
GTTGAAAGAT TAGATTCCAA AAGACAAACG CTACTTGATG TTATTCAAGA AAAACCGAGA
AATGGATGGT CGCCAAAATG CGATAATGAT CCTAATGGTG TTCCTGTCTT AAAATTAGGT
GCAGTTTTAC GATTTCAGTA TAACCCAGAT GAGATAAAAC GAACTAGCTT ACCGACTGAT
GAAAATGCAC ATTACTGGTT AGAAGCAGGA GACATTTTAA TCTCTAGAAG TAATACTCTT
GATTTAGTGG GTCATGCGTC AATTTATTCT GGTATTCCTT ATCCTTGTAT TTATCCAGAT
TTAATAATGC GTTTTAGAGT GAATCCCAAC AAAGCAGATA GTAAATTCTT AATGTATTGG
TTACAATCAA AAGAAGTTCG TCATTATATA CAAACGAATG CTTCAGGTGC AAGTCCAACT
ATGAAGAAAA TCAAACAAGA GACTGTTTGT AATATTCCTT TTCCTATCAT TTCTTTAGAA
GAACAAAGTT ATTTTGCTTA TCACTTAGAT GCTATTCAAC AAGAAGTGAA TAAAATCAAT
AGAATAATAG AAGAAGATGA ACAAAACTTT AAGTATTTAG AACAAGCAAT TTTAGAAAAA
GCATTTAGGG GGGAATTGTA A
 
Protein sequence
MSDESQKFIK LGNLIKFKYG KSLPNRERDP DGKYLVFGSG GKIGLHNSYL TESPVIVVGR 
KGSIGSTFYS DNPCWCIDTT YYVDQFSSNL YSKYLYYFLN TLKLDRLNRA ATIPGLSRDD
LYTFSIPIPY PNNPKLSLDI QQRIVARIES LFGEIKRNRL LLEQMRLDND LLLPNALDEV
VERLDSKRQT LLDVIQEKPR NGWSPKCDND PNGVPVLKLG AVLRFQYNPD EIKRTSLPTD
ENAHYWLEAG DILISRSNTL DLVGHASIYS GIPYPCIYPD LIMRFRVNPN KADSKFLMYW
LQSKEVRHYI QTNASGASPT MKKIKQETVC NIPFPIISLE EQSYFAYHLD AIQQEVNKIN
RIIEEDEQNF KYLEQAILEK AFRGEL