Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2201 |
Symbol | |
ID | 7105657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 2274476 |
End bp | 2276893 |
Gene Length | 2418 bp |
Protein Length | 805 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643475255 |
Product | protein of unknown function DUF608 |
Protein accession | YP_002372385 |
Protein GI | 218247014 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4354] Predicted bile acid beta-glucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAT TATTCTCTCT CCCCAATATT CCCCCTTGCG CTTGGAAACG CCCTATCGGT CAAGGATGGG ATAAGCCCTA TACCGTCCGT TATGCTAGTA ATTTAGACGA TGGACCGAAC CACGGAATGC CCCTTGGTGG GTTTGGTTCT GGTTGTATTG GACGTTCTCC CAGTGGTGAC TTTAACCTGT GGCATCTGGA TGGGGGTGAG CACGTTTTTC GCAGCATTCC ATCGTGTCAG TTTAGCGTCT TTGAACAGCC AGAAGGCGAA GATGCTCAAG TCTATGCCCT ATCTACCCAA CCTCCTGAAG AGGGAACCCT CTCCCGTTGG TCTTGGTATC CCCCGGAAAA AGGAACCTAT CACGCCCTTT ATCCCCGCAG TTGGTATCAG TATGAAGGGG TGTTTAAAAG TCAATTAATT TGTGAACAAT TTTCTCCCAT TATCCCCAAT AACTATCAAG AAACCAGCTA TCCTATCGGC ATTTTTGAAT GGACTGCCCA TAACCCCACG GATAAACCCA TTACCCTGAG TATTATGGTG ACGTGGCAAA ATATTGTGGG GTGGTTTACG AATGCCATTA AATCCCCTGA AATTACTGTT AGGGATGATG GCAGTCCTGA ATATCAATAT CAACCCCGAT GGGGAGACAG TACCGGCAAT TTTAATCAGT GGGTTCAAGA TAATTTTCGG GTCGGTTTTA TCCTTAACCG TATTCAACTT CATGACGAAA TTCAAGAAGG AGAAGGACAG ATTTGTATCG CTAGTGTCAC TAACCCCAGT GTGGAGGTTT TTTACTTAGG AAAATGGAAT CCTAAAGGCG ATGGTTCAGA GGTTTGGGAC TATTTTGCCA TGAATGGTTC CTTACCGGAT ACGGAAGATG AAACCCCGGC AGAACCGGGT GAACAAATTG CGGTTGCGAT GGCCATTCGG TTTACTATTC GTCCAGGGAG AACCAAGAAA ATTCCCTTTA TTTTAGCCTG GGATTTTCCG GTAACAGAAT TTGCGCCAGG GGTGCAATAT TATCGCCGCT ATACTGATTT TTATGGTCGC AATGGTAAGA ATGGGTGGGC GATGGTGAGA ACCGCTTTAA AACATTCTGA TGTGTGGTAT GAACGGGTCG AAGAATGGCA GTCTCCTATT TTGAAACGGG AGGACTTACC TGACTGGTTT AAGATGGCTT TATTCAATGA ATTATACCTA TTAACAGATG GGGGAACCCT GTGGACGGCT GCTTCAGAAA ATGACCCCGT AGGACAATTT GGGGTACTTG AATGTTTGGA CTATCGGTGG TATGAAAGTT TAGATGTTCG CTTGTATGGT TCCTTTGGGT TAATGATGCT TTGGCCTCGT TTAGAAAAGT CTGTTTTAGA AGCATTTGCT AGGGCAATTC CTACCCATGA TGATACCCCT AGAATTATTG GATATAACCA AGCAAAAGCC ATCCGAAAAG CGAAAGGGGC AACTCCCCAT GACTTAGGAG CACCCAATGA ACATCCTTGG CAAAAAACCA ATTACACTAG CTATCAAGAT TGTAATTTGT GGAAAGATTT AGGCAGTGAT TTTGTGCTCC AAGTGTATCG AGATTATTTA TTAACCGGGT CAGATGATAC GGATTTTTTG TGGGAATGTT GGCCAGCGAT TACTGAAACC TTAGACTATT TAAAGACGTT TGATTTAGAT AACGATGGCA TTCCCGAAAA TTCAGGAGCC CCGGATCAAA CCTTTGATGA TTGGAAATTA CAAGGAATTA GTGCCTATTG TGGTGGGTTG TGGATAGCTG CCTTAGAGGC GGCGATTAAA ATAGCAGAAA TTCTGCTCAA AAATGTGCCA ACTACGGAAG AATTACAGTC GAGAAATAAT CCTGAATCCA TCAAGCATTA TGTCAAAAAT CATCGTGATT GGCTAGAACA ATCTCGGTCA ATTTATCACG ATACGTTATG GAATGGAGAA TACTACAAAC TCGATAGTCA AAGCGGTTCT GATGTGGTGA TGGCGGATCA ATTATGCGGT CAATTTTACG CCCGTTTATT GAATTTCCCC GATGTCGTTG AAACTCAGTA TACAGAATCA GCTTTAAACA AGGTTTATGA GGCGTGTTTT TTGAAGTTTC AAGATGGAAA ATATGGTGCA GCTAATGGGA TGAAACCTGA TGGAACTCCC GAAGATCCTA ACTCAACCCA TCCCCAAGAA GTTTGGACAG GAATTAACTT TGGTTTAGCG GCATTTTTAT TACAAATGGG ACGCAAAGAT GCAGCTTTTA AGTTAACCGA AGCTGTTGTT AAACAAGTCT ATGAAAATGG CTTACAATTT CGCACTCCTG AAGCGATAAC GGCTGTGGGA ACCTTCCGTG CTAGTCATTA TTTACGGGCG ATGGCTATTT GGGGAATTTA TGGCATTTTA ACCCATTTTC GACCCTAA
|
Protein sequence | MKTLFSLPNI PPCAWKRPIG QGWDKPYTVR YASNLDDGPN HGMPLGGFGS GCIGRSPSGD FNLWHLDGGE HVFRSIPSCQ FSVFEQPEGE DAQVYALSTQ PPEEGTLSRW SWYPPEKGTY HALYPRSWYQ YEGVFKSQLI CEQFSPIIPN NYQETSYPIG IFEWTAHNPT DKPITLSIMV TWQNIVGWFT NAIKSPEITV RDDGSPEYQY QPRWGDSTGN FNQWVQDNFR VGFILNRIQL HDEIQEGEGQ ICIASVTNPS VEVFYLGKWN PKGDGSEVWD YFAMNGSLPD TEDETPAEPG EQIAVAMAIR FTIRPGRTKK IPFILAWDFP VTEFAPGVQY YRRYTDFYGR NGKNGWAMVR TALKHSDVWY ERVEEWQSPI LKREDLPDWF KMALFNELYL LTDGGTLWTA ASENDPVGQF GVLECLDYRW YESLDVRLYG SFGLMMLWPR LEKSVLEAFA RAIPTHDDTP RIIGYNQAKA IRKAKGATPH DLGAPNEHPW QKTNYTSYQD CNLWKDLGSD FVLQVYRDYL LTGSDDTDFL WECWPAITET LDYLKTFDLD NDGIPENSGA PDQTFDDWKL QGISAYCGGL WIAALEAAIK IAEILLKNVP TTEELQSRNN PESIKHYVKN HRDWLEQSRS IYHDTLWNGE YYKLDSQSGS DVVMADQLCG QFYARLLNFP DVVETQYTES ALNKVYEACF LKFQDGKYGA ANGMKPDGTP EDPNSTHPQE VWTGINFGLA AFLLQMGRKD AAFKLTEAVV KQVYENGLQF RTPEAITAVG TFRASHYLRA MAIWGIYGIL THFRP
|
| |