Gene PCC8801_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3940 
Symbol 
ID7103884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4125145 
End bp4126506 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content35% 
IMG OID643476939 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002374040 
Protein GI218248669 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTAA ATACCCTAAA GCAATGGAAA CCTTACCCAC ATTATAAACC TTCCGGGGTT 
GATTTCTTGG GGGATATTCC TGATGGGTGG GAGGTTAAAA GATTAAAATG GATTGTATCA
AAAATTGGTA GCGGTAAAAC TCCTAAAGGT GGTGCAGAAA TTTACTCTGA TTCTGGTATT
ATTTTTTTGC GTAGCCAGAA TATTCATTTT GATGGTTTAA GATTAGATGA CGTTGTTTAT
ATAAATAAAG ATATTGATAA AGCAATGTCA TCTTCTAGAG TAAAACCGCT TGACATTCTT
TTAAATATAA CAGGCGCATC TTTAGGGAGA TGTATGATTA TTCCTAAAGA TTTCCCGTCA
TCTAATGTTA ATCAGCACGT TTGCATTCTT AGACCTATTG TAACCCGTAT CAACCCTTAT
TTTTTAAATA GAGTAATGTC CTCTAATGCA ATTCAAAATC AAATATTTTC TTCTGAAGTT
GGTGTTTCCC GTGAAGGTTT AACTTTTGCT CAAGCTGGTA ATTTAATTTC AGTATTTCCC
TCCCTACCCG AACAAGAAAA AATCGCTCAA TTTCTGGATG AAGAAACCGC GAAAATAGAT
AAACTCATCA CCCACAAACA AAGACTAATT GAATTATTAA AAGAAAAGCG CACAGCTTTA
ATTAGTCATG CTGTCACCAA AGGACTTAAC CCCGATGTCC CGATGAAAGA TTCTGGGGTA
GAATGGTTAG GGTTTATTCC TGAACATTGG GAGGTTAAGA AAATTAAGAG GTTATCCTTA
GTAAAAAGGG GCGCATCACC TAGACCAATT GACGACCCAA TATATTTTGA TGATAATGGA
GAATATGTAT GGGTTAGAAT TTCTGATGTA ACAGCTAGTA ATAAATATTT ATTAGAAGCT
GAACAAAAAT TATCCGAGAT AGGAAAGAGG AAAAGTGTTC CTTTACAACC TAATGAACTA
TTCTTAAGCA TTTGCGCTAG TGTTGGAAAA CCAATCATTA CCAAAATTAA ATGCTGTATT
CATGATGGTT TTGTGTATTT TCCAGAATTG AAAGAAAATA GAGAATATTT ATATTATATT
TTTCTGGGAG GAGAATTATA TAAAGGTTTA GGTAAAATGG GAACACAGTT AAATCTTAAT
ACGGAGATTA TTGGAGATGT TAAATTACCA ATTCCTCCCG TTTCCGAACA ACAAAAAATC
GCAGAATACT TAGACGAAAA AACCGAACAA ATAGACCCAA TAATTAAGAA AACCCGTGAG
AGTATCGAGT ATTTAAAAGA ATATCGAACC GCGTTAATAT CTGCTGCCGT AACAGGTAAA
ATAGATGTGA GGCAGTGGGG ATGTGAGGAG GTGAGGGAAT GA
 
Protein sequence
MTLNTLKQWK PYPHYKPSGV DFLGDIPDGW EVKRLKWIVS KIGSGKTPKG GAEIYSDSGI 
IFLRSQNIHF DGLRLDDVVY INKDIDKAMS SSRVKPLDIL LNITGASLGR CMIIPKDFPS
SNVNQHVCIL RPIVTRINPY FLNRVMSSNA IQNQIFSSEV GVSREGLTFA QAGNLISVFP
SLPEQEKIAQ FLDEETAKID KLITHKQRLI ELLKEKRTAL ISHAVTKGLN PDVPMKDSGV
EWLGFIPEHW EVKKIKRLSL VKRGASPRPI DDPIYFDDNG EYVWVRISDV TASNKYLLEA
EQKLSEIGKR KSVPLQPNEL FLSICASVGK PIITKIKCCI HDGFVYFPEL KENREYLYYI
FLGGELYKGL GKMGTQLNLN TEIIGDVKLP IPPVSEQQKI AEYLDEKTEQ IDPIIKKTRE
SIEYLKEYRT ALISAAVTGK IDVRQWGCEE VRE