Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3969 |
Symbol | |
ID | 8393319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 4086086 |
End bp | 4087771 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644981893 |
Product | group I intron endonuclease |
Protein accession | YP_003139607 |
Protein GI | 257061719 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01453] group I intron endonuclease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00182947 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTAGCA AGAAGACACA ACAGCAGCTA CTGGAAGAGT TCTATGAAGC TCATGGAGAC TATTACGATT ACTCATTAGT GAAATACGTA AATTCTTCGA CAAAGGTAAG AGTAGTTTGC CCTGTTCATG GTGAATTTCA AATTACTCCT GAACACCATA AAAATGGAGT TGGATGTCGT AAGTGCTATT TTGCCTCTCA GAGGATATCA AAGAAAGCAT TCATTCATCG AGTGCAGAAG CACTTTGGAG ATAGATATGA CTATTCACTT TTTGATGAGT TGCCTCAAAA TGGTGAGCAA ATCAAGATTT TATGTAAAGA ACATAACGAG ATTTTTTTAC AAGAACCAAG AAATCATATG CGAGGTCATA CAGGATGTCC TGTGTGCATA TCACTTAGGT TGTCTGGATT CCAAGAAAAA AGGGGCGAAA TTAAAACGAA AGGAAATTTA ACCACTGAGT TTATTGAACG AGCTAGGATT ATTCATGGTA ATAAGTATGA CTACAGCGAA TTTGAGTATT TAAGCTCAAG TAAAAAAGGA AAAATTTTAT GTCTGAAACA CGGTGTATTT TGGCAATCGC CAAGCAATCA TTTGAGAGGC AGCGATTGCC CAAAATGTTC TCGCGAGAGT CGAAAAGAAG CCACTTTCAA AAGTAAATGT AAGGAATTAG GCGTAGATTA CTGGAGAGCT TTGAAGAGGC GAGAGGCTGG ACTCCCTGAA GAAATAATAT TTGATAAAAA ATATGTCCGT AGCCTAAGGG AAACTGGAGA AATTATAATA TTTGGAGTCA GATATCCTAA TCTTGAAGAA GCGGTAAGGT GTTTAAATCC GCCAGCTAGC ACTCAAACTA TAGCTCGATG GATAAAAGAG GGTGTATCTC CTGAAGATGC TTTTAATCGT ATTCCTAATC CAGGCTATGC TAAAGGTATT ATATATTTGG TCACTCATAA AGAATCCGGC AAGCAATATG TTGGCTTGAC AATTCAGTCT TTAGAACGGC GATGGAAATA TCATGTAGAG CAGGCTTCTG CTGGACATAT AAAGGGAGAT AAATCATTAC ATCATGCAAT TCGAGAATAT GGTTCGGACG CTTTTGAAAT ATGTCAAATA GATCGAGGTA CTTCTAAGCA GGATCTTGAA AAAAAAGAGA AGGAATGGAT TAAGAAGCTA AAAACTTTGA CTCCTCACGG TTACAACATT TCTACAGGAG GAGTAAGTGG CGGGTCAAAC AAAAAGAGTA CTTATATTGA CGATATACGC TTTGAAAGCG TAAAAGCGGC AGCGGCATAT CTCTCTCAGA CTCGTGATAT CAGTCTTTCC GCAGCCAAGA AAAGAATAAG TCAGAATAAT ATTAATGTTA AAACACCTGC AAAGCCGGGT GAAAGCCTAG TTAAAACCAA AGCTTACAAA GCTTGGAGCC GAATTATTCA TGGAGCACTT AACCCTAACT CTAAAGAATA TATACCCGGA TTAGATATTT ATCCCAAATG GCGTGATTTC AAGCAATTTC TCAAAGATGT AGGTAATCCG CCTGAAGACA GCATGGCATT CTCTAGACTA GATAAAGATA AAGGTTTCTT CCCAGAAAAT TGCGCTTGGT TAACTAAGAG CGAAGCTAGT GTAATCAACG CTGAGTATAT GAAGAAAAAA GGCAAGTTTG GGAGAAAATC TCATGACAAA CAATGA
|
Protein sequence | MPSKKTQQQL LEEFYEAHGD YYDYSLVKYV NSSTKVRVVC PVHGEFQITP EHHKNGVGCR KCYFASQRIS KKAFIHRVQK HFGDRYDYSL FDELPQNGEQ IKILCKEHNE IFLQEPRNHM RGHTGCPVCI SLRLSGFQEK RGEIKTKGNL TTEFIERARI IHGNKYDYSE FEYLSSSKKG KILCLKHGVF WQSPSNHLRG SDCPKCSRES RKEATFKSKC KELGVDYWRA LKRREAGLPE EIIFDKKYVR SLRETGEIII FGVRYPNLEE AVRCLNPPAS TQTIARWIKE GVSPEDAFNR IPNPGYAKGI IYLVTHKESG KQYVGLTIQS LERRWKYHVE QASAGHIKGD KSLHHAIREY GSDAFEICQI DRGTSKQDLE KKEKEWIKKL KTLTPHGYNI STGGVSGGSN KKSTYIDDIR FESVKAAAAY LSQTRDISLS AAKKRISQNN INVKTPAKPG ESLVKTKAYK AWSRIIHGAL NPNSKEYIPG LDIYPKWRDF KQFLKDVGNP PEDSMAFSRL DKDKGFFPEN CAWLTKSEAS VINAEYMKKK GKFGRKSHDK Q
|
| |