Gene PCC8801_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2201 
Symbol 
ID7105657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2274476 
End bp2276893 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content43% 
IMG OID643475255 
Productprotein of unknown function DUF608 
Protein accessionYP_002372385 
Protein GI218247014 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4354] Predicted bile acid beta-glucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAT TATTCTCTCT CCCCAATATT CCCCCTTGCG CTTGGAAACG CCCTATCGGT 
CAAGGATGGG ATAAGCCCTA TACCGTCCGT TATGCTAGTA ATTTAGACGA TGGACCGAAC
CACGGAATGC CCCTTGGTGG GTTTGGTTCT GGTTGTATTG GACGTTCTCC CAGTGGTGAC
TTTAACCTGT GGCATCTGGA TGGGGGTGAG CACGTTTTTC GCAGCATTCC ATCGTGTCAG
TTTAGCGTCT TTGAACAGCC AGAAGGCGAA GATGCTCAAG TCTATGCCCT ATCTACCCAA
CCTCCTGAAG AGGGAACCCT CTCCCGTTGG TCTTGGTATC CCCCGGAAAA AGGAACCTAT
CACGCCCTTT ATCCCCGCAG TTGGTATCAG TATGAAGGGG TGTTTAAAAG TCAATTAATT
TGTGAACAAT TTTCTCCCAT TATCCCCAAT AACTATCAAG AAACCAGCTA TCCTATCGGC
ATTTTTGAAT GGACTGCCCA TAACCCCACG GATAAACCCA TTACCCTGAG TATTATGGTG
ACGTGGCAAA ATATTGTGGG GTGGTTTACG AATGCCATTA AATCCCCTGA AATTACTGTT
AGGGATGATG GCAGTCCTGA ATATCAATAT CAACCCCGAT GGGGAGACAG TACCGGCAAT
TTTAATCAGT GGGTTCAAGA TAATTTTCGG GTCGGTTTTA TCCTTAACCG TATTCAACTT
CATGACGAAA TTCAAGAAGG AGAAGGACAG ATTTGTATCG CTAGTGTCAC TAACCCCAGT
GTGGAGGTTT TTTACTTAGG AAAATGGAAT CCTAAAGGCG ATGGTTCAGA GGTTTGGGAC
TATTTTGCCA TGAATGGTTC CTTACCGGAT ACGGAAGATG AAACCCCGGC AGAACCGGGT
GAACAAATTG CGGTTGCGAT GGCCATTCGG TTTACTATTC GTCCAGGGAG AACCAAGAAA
ATTCCCTTTA TTTTAGCCTG GGATTTTCCG GTAACAGAAT TTGCGCCAGG GGTGCAATAT
TATCGCCGCT ATACTGATTT TTATGGTCGC AATGGTAAGA ATGGGTGGGC GATGGTGAGA
ACCGCTTTAA AACATTCTGA TGTGTGGTAT GAACGGGTCG AAGAATGGCA GTCTCCTATT
TTGAAACGGG AGGACTTACC TGACTGGTTT AAGATGGCTT TATTCAATGA ATTATACCTA
TTAACAGATG GGGGAACCCT GTGGACGGCT GCTTCAGAAA ATGACCCCGT AGGACAATTT
GGGGTACTTG AATGTTTGGA CTATCGGTGG TATGAAAGTT TAGATGTTCG CTTGTATGGT
TCCTTTGGGT TAATGATGCT TTGGCCTCGT TTAGAAAAGT CTGTTTTAGA AGCATTTGCT
AGGGCAATTC CTACCCATGA TGATACCCCT AGAATTATTG GATATAACCA AGCAAAAGCC
ATCCGAAAAG CGAAAGGGGC AACTCCCCAT GACTTAGGAG CACCCAATGA ACATCCTTGG
CAAAAAACCA ATTACACTAG CTATCAAGAT TGTAATTTGT GGAAAGATTT AGGCAGTGAT
TTTGTGCTCC AAGTGTATCG AGATTATTTA TTAACCGGGT CAGATGATAC GGATTTTTTG
TGGGAATGTT GGCCAGCGAT TACTGAAACC TTAGACTATT TAAAGACGTT TGATTTAGAT
AACGATGGCA TTCCCGAAAA TTCAGGAGCC CCGGATCAAA CCTTTGATGA TTGGAAATTA
CAAGGAATTA GTGCCTATTG TGGTGGGTTG TGGATAGCTG CCTTAGAGGC GGCGATTAAA
ATAGCAGAAA TTCTGCTCAA AAATGTGCCA ACTACGGAAG AATTACAGTC GAGAAATAAT
CCTGAATCCA TCAAGCATTA TGTCAAAAAT CATCGTGATT GGCTAGAACA ATCTCGGTCA
ATTTATCACG ATACGTTATG GAATGGAGAA TACTACAAAC TCGATAGTCA AAGCGGTTCT
GATGTGGTGA TGGCGGATCA ATTATGCGGT CAATTTTACG CCCGTTTATT GAATTTCCCC
GATGTCGTTG AAACTCAGTA TACAGAATCA GCTTTAAACA AGGTTTATGA GGCGTGTTTT
TTGAAGTTTC AAGATGGAAA ATATGGTGCA GCTAATGGGA TGAAACCTGA TGGAACTCCC
GAAGATCCTA ACTCAACCCA TCCCCAAGAA GTTTGGACAG GAATTAACTT TGGTTTAGCG
GCATTTTTAT TACAAATGGG ACGCAAAGAT GCAGCTTTTA AGTTAACCGA AGCTGTTGTT
AAACAAGTCT ATGAAAATGG CTTACAATTT CGCACTCCTG AAGCGATAAC GGCTGTGGGA
ACCTTCCGTG CTAGTCATTA TTTACGGGCG ATGGCTATTT GGGGAATTTA TGGCATTTTA
ACCCATTTTC GACCCTAA
 
Protein sequence
MKTLFSLPNI PPCAWKRPIG QGWDKPYTVR YASNLDDGPN HGMPLGGFGS GCIGRSPSGD 
FNLWHLDGGE HVFRSIPSCQ FSVFEQPEGE DAQVYALSTQ PPEEGTLSRW SWYPPEKGTY
HALYPRSWYQ YEGVFKSQLI CEQFSPIIPN NYQETSYPIG IFEWTAHNPT DKPITLSIMV
TWQNIVGWFT NAIKSPEITV RDDGSPEYQY QPRWGDSTGN FNQWVQDNFR VGFILNRIQL
HDEIQEGEGQ ICIASVTNPS VEVFYLGKWN PKGDGSEVWD YFAMNGSLPD TEDETPAEPG
EQIAVAMAIR FTIRPGRTKK IPFILAWDFP VTEFAPGVQY YRRYTDFYGR NGKNGWAMVR
TALKHSDVWY ERVEEWQSPI LKREDLPDWF KMALFNELYL LTDGGTLWTA ASENDPVGQF
GVLECLDYRW YESLDVRLYG SFGLMMLWPR LEKSVLEAFA RAIPTHDDTP RIIGYNQAKA
IRKAKGATPH DLGAPNEHPW QKTNYTSYQD CNLWKDLGSD FVLQVYRDYL LTGSDDTDFL
WECWPAITET LDYLKTFDLD NDGIPENSGA PDQTFDDWKL QGISAYCGGL WIAALEAAIK
IAEILLKNVP TTEELQSRNN PESIKHYVKN HRDWLEQSRS IYHDTLWNGE YYKLDSQSGS
DVVMADQLCG QFYARLLNFP DVVETQYTES ALNKVYEACF LKFQDGKYGA ANGMKPDGTP
EDPNSTHPQE VWTGINFGLA AFLLQMGRKD AAFKLTEAVV KQVYENGLQF RTPEAITAVG
TFRASHYLRA MAIWGIYGIL THFRP