Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_0725 |
Symbol | |
ID | 8390031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 727030 |
End bp | 730047 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644978743 |
Product | type III restriction protein res subunit |
Protein accession | YP_003136499 |
Protein GI | 257058611 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.147192 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGTA GTGAAGCGAC AACTCGAAAA GACTTAATTG ATCCTCAACT TGCCAAAGCA GGATGGAATG TCAACGATAC TCATCAAGTT GGACTAGAAA TTCCTGTTGA TGGTTACGAT GCACAACCTT GGAATGGGGT AACCGATTAT TGTTTATATT CCCCAAATGG GGATGTCATA GCGGTAGTAG AAGCAAAACG TCAAAGTCGA GATCCCAGAG TCGCTGAACA ACAAGTTCGC CATTATGTGA CCGAAATTGA GAAGCATCAA AGCTTTCGTC CTTTTGCTTT TCTCACCAAT GGCAATATAA CTTATTTCTG GGATGTGGAA AATTCGCCTA AACGACAGGT GGCCGGGTTT TTCTCCTTAC GAGATTTAGA AAATTTGCTG TATATTCGTC AAAACAAAAT TAATCCCTCC ACTTTAGTCG TTAATCGCCA AATTTCAGGA AGAACTTATC AACAAGAAGC CATTCAGAGG GTCATTGAAC GGTTTGAGGA CGGATTTCGT CGTGGGTTGA TCGTTATGGC CACAGGAACC GGAAAAACTC GCACCATCAT GGGGTTAATT GACCTCTTAA TTCGCTCAAA TCACCTCAGA ACGGTTCTTT TTGTGGCTGA TCGGGATGTT TTGGTTAGAC AAGCTTTAGA TGACAATTTT CGGGTGTATT TACCCGATGA ACCTAGCGAT CGCATCCGAA GCTATAACAT TGATTATAGT AAGCGGTTGT ATGTGGGAAC CTTACAAACC CTAACCAAAT GCTGTCGGAA TTTTACCCCT GGTTTTTTTG ACTTAATTAT CTTTGATGAG TGTCATCGCT CGATTTATAA CCAATTTAGG GAAGTTTTAG ACTATTTTGA CGGGAAAATC TTAGGATTAA CCGCTACACC TGCTACAGCG ATTGATCGCA ATACGTTTCA GGCGTTTCAC TGTTTTGAGG GTATTCCGAC GTTTTTGTAT GAATATCAAC AGGCGATCAA TGATGGGAAT TTAGTCGATT ATAGCCTCTA TCAAGCACAA ACTCACTTTC AACGGGAGGG AATACGGGGG GCAAATTTGG ATGAAGAAGA CCGAAATATG TTGATTGAAC AGGGAATTGA TCCCGATGAT ATTGATTATG AGGGGACAGA ATTAGAAAAA AAGGTCAGTA ACCGAGATAC CTTAAGAAAA CAATGGACAG AAATCATGGA AGTCTGTCAT AAGGACGAAT CGGGGCAATT ACCAGGGAAA ACAATTATTT TTGCTGTGAC ACAAAAACAC GCCCATCGTC TTAAAGAAGT ATTTGATGAA ATGTATCCCC AATTTTCGGA GATGGTTCAG GTAATTGTCT CAGAAATGGA AAAAACTAAG GATTTAATCG ATAAATTTAA GAAGGAAGAA ATGCCTCGTA TTGCGATCTC GGTGGATTTA ATGGATACGG GGGTAGATAT TCCAGAGGTG GTTAATTTGG TGTTTATGAA ACCTGTTCAG TCGTTTATTA AGTTACAGCA AATGATTGGA CGGGGAACGC GCAACCATGA AGCTTGTAAA TATCTTAACC GTCTTCCTAA TGGTAAAAAA GATGAATTTT TGATTATCGA TTTTTGGGAA AATGAGTTTG ATCGCGATCC CAGTGATGAA GTCATTAGTC AAAATTTACC CATTACGGTG AAGCTATTTA ATACTCGTTT GCGATTACTA GGGTATTATT TAGATAATCA AGAATCCTCC GATGAACAGA ATCAAGATGT AAAAGGGGAT TTATACGAAT ATTTATTAGG AAAACTCAAT ATTTCCGGTA GAAACGGACA ATTTCGCACC CCTCGTCATA TTATTCGTCT CATGGTGGAA ATGGTTGACC CGAAACCCAA TGAACGAATA GGAGACTTGG CCGCAGGAAC TTGTGGCTTT TTGGTCAATA GTTATCAATA TATCCTTGAA AAATTCACCA GTCCTGAGAT TTTGCTGGAT GAGATGGGAA ATAAACACCC TATCGGCGAT TTACTCACCC CTGAAGAGTC GGAATTTTTG GAAAAAGAGG CGTTTACCGC TTATGATAAT GATTCGGGGA TGACTATGTT ACGCATTGGG TCGATGAATC TGATGTTACA TGGAATTAAA TATCCTCGGT TTTTCTATCA GGATACGTTA TCAAAAGAGT TTAAGGATGA GAAAAGCTTA GATGTTGCGC TGATGAATCC CCCTTTTAAG GGTAAAATGG ATGAAAAAGA TATAAATCCC TATTTACCGA CTAAATGCAA GAAAACAGAG TTATTGTTTC TCTATCAAAT TTTGCGGGTG TTGGAGATGG GGGGACGTTG TGGGGTGATT GTTCCTGATG GGGTATTATT TGGATCGAGT AAACAGCATC AAGACATCCG ACAGAAGTTA ATCGAGGAAA ACCGCTTAGA TGGGGTGGTT TCCATGCCTT CAGGGGTATT TAAGCCTTAT GCGGGGGTTT CTACGGCTAT TTTACTGTTT ACGAAGGGGG CAACTACCGA TCGCATTTGG TTCTATGATA TGGAACATGA TGGCTTTTCC CTTGATGATA AACGTCAACC GATAGAGGAA AATGATATCC CTGATATTTT GGATTGTTGG CGCAATCGCT TTGATAATGG GTTTTCTGCG CTGCGGGAGT CAATGAAAGC TGAGTTAAGC GCGAAATTAC AACCCCTTAA GGAAAAACGC TTACAGTTAC AAGAGGAAAT TCATCGTCTG AGGTTTGAGG ATGCGATCGC GTCTGAAGAT GAGGAAACCC CCCGTCAGGT GTTAGAATCA GCAGAGGAGA CTTTAAAGGT CTTAGAGGAA GAAATAAAGC CATTACAGGG ACAAATTAAC CGTTTAAGTC GTCAATTTTG GGTGGATAAG CAGGTAGTTA AGGGAAATAA ATATGATTTA TCTGCGAGTC GTTATCGTCA CATTGAACAG GATGAGGTGT TTTATGAGTC TCCACATAAG ACAATGGAAA GATTACTTAA GTTAGAACAT TATATGGCCA ATGAAGTAAC AGAATTAAAT CAATTATTAG GGGAATAG
|
Protein sequence | MTRSEATTRK DLIDPQLAKA GWNVNDTHQV GLEIPVDGYD AQPWNGVTDY CLYSPNGDVI AVVEAKRQSR DPRVAEQQVR HYVTEIEKHQ SFRPFAFLTN GNITYFWDVE NSPKRQVAGF FSLRDLENLL YIRQNKINPS TLVVNRQISG RTYQQEAIQR VIERFEDGFR RGLIVMATGT GKTRTIMGLI DLLIRSNHLR TVLFVADRDV LVRQALDDNF RVYLPDEPSD RIRSYNIDYS KRLYVGTLQT LTKCCRNFTP GFFDLIIFDE CHRSIYNQFR EVLDYFDGKI LGLTATPATA IDRNTFQAFH CFEGIPTFLY EYQQAINDGN LVDYSLYQAQ THFQREGIRG ANLDEEDRNM LIEQGIDPDD IDYEGTELEK KVSNRDTLRK QWTEIMEVCH KDESGQLPGK TIIFAVTQKH AHRLKEVFDE MYPQFSEMVQ VIVSEMEKTK DLIDKFKKEE MPRIAISVDL MDTGVDIPEV VNLVFMKPVQ SFIKLQQMIG RGTRNHEACK YLNRLPNGKK DEFLIIDFWE NEFDRDPSDE VISQNLPITV KLFNTRLRLL GYYLDNQESS DEQNQDVKGD LYEYLLGKLN ISGRNGQFRT PRHIIRLMVE MVDPKPNERI GDLAAGTCGF LVNSYQYILE KFTSPEILLD EMGNKHPIGD LLTPEESEFL EKEAFTAYDN DSGMTMLRIG SMNLMLHGIK YPRFFYQDTL SKEFKDEKSL DVALMNPPFK GKMDEKDINP YLPTKCKKTE LLFLYQILRV LEMGGRCGVI VPDGVLFGSS KQHQDIRQKL IEENRLDGVV SMPSGVFKPY AGVSTAILLF TKGATTDRIW FYDMEHDGFS LDDKRQPIEE NDIPDILDCW RNRFDNGFSA LRESMKAELS AKLQPLKEKR LQLQEEIHRL RFEDAIASED EETPRQVLES AEETLKVLEE EIKPLQGQIN RLSRQFWVDK QVVKGNKYDL SASRYRHIEQ DEVFYESPHK TMERLLKLEH YMANEVTELN QLLGE
|
| |