Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_4288 |
Symbol | |
ID | 8393640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 4423189 |
End bp | 4426482 |
Gene Length | 3294 bp |
Protein Length | 1097 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 644982198 |
Product | hypothetical protein |
Protein accession | YP_003139909 |
Protein GI | 257062021 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.400054 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTACAAT TAGGCAATTC ATCTCAACAG TTTCGCGGAA TTAACCGAAC TATTTGTATT GGATTAGGAG GAACTGGATT AGAAATTTTA ATGAGAATCC GGCGATCAAT TGTTAATAAG TATGGAGACT TAAACAACCT TCCTATCGTT AGTTTTGTTT ATATTGATAC CGATAAAGCT GGTTCACAAT CATCTATTTT ACGCACAGGA AATATTCATC ATGGGGTCGA TCTAAATTTA CGGGAATCTG AGAAAGTAAA CGCTACTATG ACAGCCATTG AAGTTGAGAA CTTTAAGAAA GGATTAGAAC GACGTTCTAG CTATGATAGA CAGGGTCCTT ATGACCATAT TAATCTCTGG TTTCCTCGTC AATTATCCAA CAATATTAAA GCTATTGAAG ATGGTGCAAA AGCAATTAGA CCAGTCGGAA GATTAGCCTT TTTTCACAAC TATCGAAAGA TTAAAACCTC TATAGATGCA GCAGAAAGAA GAACTAGAGA TCATAGTTCA AAACTATTAA AATCAGGTTT AAGAGTCGAT GATCAATTGA ACATTTTTGT CGTTGGGTCT TTGTCTGGTG GTACGGGAAG CGGAATGTTT TTAGATATAG CTTATAATCT GAGACGTGAT TATAGCAAAC AAGGAGTTCA AATCGTTGGT TATTTAGTTG TTGCTCCCAA TCTTTATACT TCTCCTGCCA ATCCTAATCC GACGATTTTT GCGAATACCT ATGCTGCTCT TAAAGAGCTT AATTATTATA GCACACCTGG AACTCAATTT AAAGCTTGCT ATGATATACA AAACATGGCA ATTGTAGACG AAGATCGTCC TCCTTTTGAC TATACTTATT TAGTTTCTAA TGGCAATAAT ACGGGATATC AAATATTAGA TAAACGTAAG CTTTGTAACG TTATTGCTCA TAAAATTGCC TTAGACTTTT CTGGAGAATT AGCTCCCGTA GTTAAAGGAC AACGGGATAA TTTTGCACAG CATTTACTCC AATTAGATGA TTATCCCCGT CCTAATGTTC AAAGATATCT ATCTTTTGGA CTAGCTGCCA TTTATTTTCC CCGCGATCTC ATCGTTCGTA TTGCTTTAAC TAGGATTAGT TTAAATCTAG TTAATTTTTG GTTGTATGGG TTTGGTCAAA GTCCCGATCC TAGGCAATTA CTCGAAGAAT TTTTATTACA AGAACGCTGG CATGATGATA TAGTAGAAAG GGATGGATTT ACCAAACGAC TAGAATCAAC TCCCCAAGAA GTTAATAAAA CTTTTAGTCA AACTCTTAGC AATTGGCGAA ATAAATTAGA AACCATAATT GATGATTGCA AAAGTAAGGA TGATCGGATA TCTTTGCGTC AGCAATTACC GAGAGAATTT CGAGAACAAT TTCGTAAAAC TCAACCAGGA GAAAACGAAA CAACAAGGGG TATTTGGTTG ACTAAAATAC AACAATCTCG ACCAGGAATT ACTAAAGAAC TAAAAGAAAC AATTGATCGG TTTATTGAAG CTTTATTGAC TCCGAACAAT AGTAATTTTT CTATTAAGAA TAGTCGAGAT TGGTTAGATG CACTTATTAG CGAACTACAT GATTATCAAC ACGATCTCGA AGAAAGGATT CAAAACTTTG GAGGGATGGA AAATATAGAA AGTATCGACA AAAAATGGCG AGATACAGAA CAAATTATTG AAGATATTGA ACAGAAATTT CAGTTATTTG GTCAAAAAAA TTCTGATATT CAAAATGAAG CTAAACGAAG TGTTCGACAA ATTAGCAATT TGGTTAAACA TAATTTTGAT TTAGTTGTGA ACCAAGAAGC GTTACAAATT GTCAAAGATT TACAGCAATA TGTTCAAGAT TGGTCGACCC AATTAGCGAG TTTTTATCGG TTAGTGGATA ACCTAAAAAG TGACTATCAA CGACAAGAAA CTGAATGGAA ACAACTGGAT ATTGATGAAA TGAGTGGGGA AGCAATTTTT GATGATCAAG ATATTGACAG TTGCTACAAT GAATTATTGC CAGAAAATGA CTATAAAGAT CAGTTAATTG TTTTAAGTAG AGATATAACT ACAGCATCAG TGCGTTCAAC ATCTCTGATT AATTTTGTTG AACGGACAAC TTTTGGTGCA GCTTATATTG AGAGCAAAAC CCAAAATAAA ATTCAACAAG ATATCAATCT AACTGTCAAT AGTTCTTTTG GTTTTCGTAG TCTTAAAATT GTCAAATCTG TTATTAAGCG ATTTCTCGAA CATTATACCT CTCTAGAACG ATCTATCCGT TTAGAGAGTA TCCTTAAAGA AGCAGAACCT CTGATTAATC TTAACTTAAA CGATCCGTAT TTTCGAGATA ATCCTGCTAA AAGAACTCAA ATCATTGGGT TTAAGGATAC TGATGAACAA GAAGTTAAAG ACTTCAAAGA AATTTTGCTA CGCGACTTAA AAAATATCAC CGAGAATGTC ACTAAACCAA CTCAAGCAGA AGATGAGATT TTAATGGTAA CAGAATATGC TTCTTTTCCT CTAAGATTGA TTGAAGGATT AACAGAAATG CAAAACTATT ACCTACGGGA AAAAAATATT AGTAATGTTT GTTTACATAA TGATCCCCCT GAACAGTTTC CCGATATTAT TCCTCCTGAT GTGAAAACAT TGGAAAGATT AGAAGAAATC TTTTATCCCT GCCTTGCGTT TGAATTATTG AAGGAAAACC CATCAACCCA AGAATTAGAA TTAGAACATT ATGATCAGAT TCGAGATACT TATTATACCG TCTCTCTTAG TCCTATCTGG AATCTAGCGT TAGAAACTCT TCATCAACAT CTTGACATAA TTAATGCGCT AGAAGAACTT CTAAAACAAG CAGAAAATGA GATAGAACGA GAACCTGAAC GTTGGCAAAA TTACTATCTC CCTAAATTAC GAGAATTTGT AAAGAAAGTT GATCAACTTT CGCCTGAACA TCCTAATTAT CCTTATAAGT CAACAGTTGT GGGAACTCAA GGAAATCTTG AAACCTTAGA CAAAGAAGGA GTTATTATTC GCTTTCAACG TCGCATGAAA GACAAGGTTA ATACTTTACA ATCTGATCAA AAAATACTCA ATACTCAAGA AAATGTTCAA AAGGTATTAT CGAGTGATTC TGATATTATT GATATTGAAA CAGATCCTCC AAAAACTCAA CAATTTTCGG ATGATTTTAT GGTAAAATTA CGAGAATTAG GGCAAATGCG TATAGATGGA TTGCTTACCG AAGAAGAATT TCAAATAGCT AAGAAAAAAC TGTTAGGTAG TTAA
|
Protein sequence | MVQLGNSSQQ FRGINRTICI GLGGTGLEIL MRIRRSIVNK YGDLNNLPIV SFVYIDTDKA GSQSSILRTG NIHHGVDLNL RESEKVNATM TAIEVENFKK GLERRSSYDR QGPYDHINLW FPRQLSNNIK AIEDGAKAIR PVGRLAFFHN YRKIKTSIDA AERRTRDHSS KLLKSGLRVD DQLNIFVVGS LSGGTGSGMF LDIAYNLRRD YSKQGVQIVG YLVVAPNLYT SPANPNPTIF ANTYAALKEL NYYSTPGTQF KACYDIQNMA IVDEDRPPFD YTYLVSNGNN TGYQILDKRK LCNVIAHKIA LDFSGELAPV VKGQRDNFAQ HLLQLDDYPR PNVQRYLSFG LAAIYFPRDL IVRIALTRIS LNLVNFWLYG FGQSPDPRQL LEEFLLQERW HDDIVERDGF TKRLESTPQE VNKTFSQTLS NWRNKLETII DDCKSKDDRI SLRQQLPREF REQFRKTQPG ENETTRGIWL TKIQQSRPGI TKELKETIDR FIEALLTPNN SNFSIKNSRD WLDALISELH DYQHDLEERI QNFGGMENIE SIDKKWRDTE QIIEDIEQKF QLFGQKNSDI QNEAKRSVRQ ISNLVKHNFD LVVNQEALQI VKDLQQYVQD WSTQLASFYR LVDNLKSDYQ RQETEWKQLD IDEMSGEAIF DDQDIDSCYN ELLPENDYKD QLIVLSRDIT TASVRSTSLI NFVERTTFGA AYIESKTQNK IQQDINLTVN SSFGFRSLKI VKSVIKRFLE HYTSLERSIR LESILKEAEP LINLNLNDPY FRDNPAKRTQ IIGFKDTDEQ EVKDFKEILL RDLKNITENV TKPTQAEDEI LMVTEYASFP LRLIEGLTEM QNYYLREKNI SNVCLHNDPP EQFPDIIPPD VKTLERLEEI FYPCLAFELL KENPSTQELE LEHYDQIRDT YYTVSLSPIW NLALETLHQH LDIINALEEL LKQAENEIER EPERWQNYYL PKLREFVKKV DQLSPEHPNY PYKSTVVGTQ GNLETLDKEG VIIRFQRRMK DKVNTLQSDQ KILNTQENVQ KVLSSDSDII DIETDPPKTQ QFSDDFMVKL RELGQMRIDG LLTEEEFQIA KKKLLGS
|
| |