Gene PCC8801_2214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2214 
Symbol 
ID7102459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2289667 
End bp2291040 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content30% 
IMG OID643475269 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002372398 
Protein GI218247027 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAT CAAAAAATAC TCAATTAGAA CTTAATTTGT GTTTAGAGGA TGAAAACTCT 
GGACATGAAA ATAAAATTTT AGCTCGTGAT TTTCCTCCTG AATGGCAATT AATACCACTT
AAAAATGCTG TAACTTATAT TGATTATGGT TATTCTCACT CAATTCCTAA AATACCTCCT
GAAAATGGAA TAAAAATTGT TAGTACAGCA GATATTAGTA AAACAGGAGA GTTGTTATAT
TCACAAATTA GAAAAGTTGA AGCCCCTTTA AAAACTATAC AACGATTAAC TTTACATGAT
GGAGATGTTT TATTTAATTG GCGCAATAGT TCTTATTTAA TTGGCAAAAC AACTATTTTT
GAAGAACAAT CAGAACCTCA TATATTTGCT TCTTTTGTTC TTAGACTGAA ATGTGATGAA
ATAAAATCAC ATAACTATTT TTTCAAATAC TTATTAAATT ACTATCGCTA TTCTGGAATT
TTTGAAAGTC TTGCTAGAAG GGCAGTTAAT CAAGCTAATT TTAATAAAAA TGAAGTATCA
GATTTAATTA TTCCCCTTCC CCCAATAGAA GAACAGCGAA AAATCGCCAG TGTATTAACA
TTAATACAAG AAGCCATCCA AGAACAAGAA AATGCGATCG CTTTAACAAC GGAACTCAAA
AAAGCCCTTA TGCAAAAGCT ATTCACCGAA GGAATTAATA ATGAACCGCA GAAAATGACG
GAAATTGGTC TTATTCCTGA GAGTTGGGAG GTTGTGAATT TAGGTAACCT GGCAAAATTA
AAATCGGGTG GTACTCCAAG CAGAAAAAAA ATAGAATATT GGGAAAATGG TTCTATTCCT
TGGGTAAAAA CAACTGAAAT TAATTATGAT TTAATAACCA CAACGGAAGA ATATATAACG
AAAGAAGGAC TGGTAAATTC TTCAGCAAAA ATGTTTTCTA AAGGTACTTT GTTAATGGCA
ATGTATGGAC AAGGTGTAAC AAGAGGACGA GTAGGAATTC TTGATATTGA TGCTACTACT
AATCAAGCTT GTGTTGCTAT TATGCCTAAT TCAGAGGATA AATTATCAAC TAAATTTCTG
TATCATTATT TTTCCTATCA CTATGAAAAA TTAAGAAATC AAGGACATGG TGCAAATCAA
AGTAACTTAA GTTCTACTAT TCTAAAAATG TTTCCTATTA CATTCCCTAA AATACAAGAA
CAATTAATAA TTATTAATCA TTTTGATACA TTAAATTTAA AACTAGAGCA ATCTCATAAA
AGAATAACTA TTTTACAAGA CTTATTTAGT ACCCTATTAC ATCAATTAAT GACCGCACAA
ATACGGGTAG ATGAACTAGA GTTATCAGTC TTAGAAAAGC AAATTAAGGA GTAA
 
Protein sequence
MNKSKNTQLE LNLCLEDENS GHENKILARD FPPEWQLIPL KNAVTYIDYG YSHSIPKIPP 
ENGIKIVSTA DISKTGELLY SQIRKVEAPL KTIQRLTLHD GDVLFNWRNS SYLIGKTTIF
EEQSEPHIFA SFVLRLKCDE IKSHNYFFKY LLNYYRYSGI FESLARRAVN QANFNKNEVS
DLIIPLPPIE EQRKIASVLT LIQEAIQEQE NAIALTTELK KALMQKLFTE GINNEPQKMT
EIGLIPESWE VVNLGNLAKL KSGGTPSRKK IEYWENGSIP WVKTTEINYD LITTTEEYIT
KEGLVNSSAK MFSKGTLLMA MYGQGVTRGR VGILDIDATT NQACVAIMPN SEDKLSTKFL
YHYFSYHYEK LRNQGHGANQ SNLSSTILKM FPITFPKIQE QLIIINHFDT LNLKLEQSHK
RITILQDLFS TLLHQLMTAQ IRVDELELSV LEKQIKE