Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3989 |
Symbol | |
ID | 8393339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 4102957 |
End bp | 4104327 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 644981913 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_003139627 |
Protein GI | 257061739 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0737122 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTAA ACACCCTAAA ACAATGGAAA CTCTACCCAA ATTATAAACC TTCTGGGGTT GATTGGTTGG GGGATATTCC TGATAGTTGG GAGGTTAAAA GATTAAGATA TTTAAGCAAA AAGATAACAG CCGGTCCCTT TGGTTCTAAT TTGACTAAAA ATATTTATAC ATCTACAGGA TATAAAATTT ATGGACAAGA ACAAGTAATT GCTTCTGATT TTTCCATAGG TGATTATTAC ATCTCTAAAG AAAAATATGA CCAAATGAGT CAATATAAAA TAAATTCTGG AGATATATTA ATAAGCTGTG TAGGAACTTT TGGAAAGGTT GCAGTAGTTC CTAAAAACAT AGAACAGGGT ATAATCAATC CTCGCCTTAT AAAACTCATC CCTATTACTG AATATATTAA CTCTGTTTAT TTAGAAAAAT TATTAAAAAG TGTTGTTGCT TTTGAACAGA TGGAAAAATT AAGCAGAGGA GGAACAATGG GAGTAATTAA CATTGGATTA CTTTCTGATA TTTTACTACC TATTCCCCCC CTTCCCGAAC AAGAAAAAAT CGCTCAATTT CTGGATAAAG AAACGGCGAA AATAGATAAA CTCATCACCC TCAAAGAAAG ACTAATTGAA TTATTAAAAG AAAAGCGCAC AGCTTTAATT AGTCATGCTG TCACCAAAGG ACTTAACCCC GATGTCCCCA TGAAAGATTC TGGGGTAGAA TGGTTAGGGT TTATTCCTGA ACATTGGGAG GTTAAAAGGT TAAAATATAT AGTCCCTAAT ATTACCGTAG GTATTGTAGT TACTCCTGCT AAATATTATG TAGAATCAGG AATACCATGT TTACGTTCTG TAAATATATC TTCGGGAAAA ATTGATAATT CTAATTTAGT TTTTATTAGT TCTCAAAGTA ACGAACTTCA TCAAAAATCT AAAATCTATA AAGGTGATTT AGTTTTAGTA AGAACTGGTG TCACTGGAAC AGCTGCGATT GTTACAGATA ATTTTGATGG GGCAAACTGT GTTGATTTAT TAATTATTCG TAATTCTAGA TTAATTTTAA CACTATATCT ATACTATTAT CTTAATTCTT CAACAACGTC TTATCAAGTT AATAATTATT CAGTAGGTGC TATTCAAGCC CACTATAATA CGTCAACATT ATCAGAACTA ATCATTACTT TTCCTCCCCC TCAAGAACAA CAAAAAATCG CTGAATACTT AGACAGAAAA ACCGAACAAA TAGACCAAAT AATTAACAAA ACCCGTGAGA GTATTGAATA TTTAAAAGAA TATCGAACCG TGTTAATATC TGCTGCCGTA ACAGGTAAAA TAGATGTGAG GCAGTGGGGA GGTGAGGAGG TGAGGGAATG A
|
Protein sequence | MTLNTLKQWK LYPNYKPSGV DWLGDIPDSW EVKRLRYLSK KITAGPFGSN LTKNIYTSTG YKIYGQEQVI ASDFSIGDYY ISKEKYDQMS QYKINSGDIL ISCVGTFGKV AVVPKNIEQG IINPRLIKLI PITEYINSVY LEKLLKSVVA FEQMEKLSRG GTMGVINIGL LSDILLPIPP LPEQEKIAQF LDKETAKIDK LITLKERLIE LLKEKRTALI SHAVTKGLNP DVPMKDSGVE WLGFIPEHWE VKRLKYIVPN ITVGIVVTPA KYYVESGIPC LRSVNISSGK IDNSNLVFIS SQSNELHQKS KIYKGDLVLV RTGVTGTAAI VTDNFDGANC VDLLIIRNSR LILTLYLYYY LNSSTTSYQV NNYSVGAIQA HYNTSTLSEL IITFPPPQEQ QKIAEYLDRK TEQIDQIINK TRESIEYLKE YRTVLISAAV TGKIDVRQWG GEEVRE
|
| |