Gene PCC8801_1218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1218 
Symbol 
ID7104913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1269441 
End bp1271045 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content38% 
IMG OID643474302 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002371440 
Protein GI218246069 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGGA CTGACGGAAC TCTTAAACAA CAACTCGGAC AAATGATTGT TGTTCGCGCT 
TCAGGATATT TATTTGATCA CCAAATTCGT TATCCAGCAT GGGAACCCTC TAATGAAAAA
TTGCGCCATT GGATAGAAAC CCTCAACCTA GGAGGAGTCA TCTTATTAGG AGGAAGCGCA
GGGGAATTAA GCTTAAGAAC CCAACAACTT CAACAATGGT CAAAAAATCC CCTTTTAATA
GCAGCAGATA TTGAAGAAGG AGTCGGACAA AGATTTCCGG GTGCAACGTG GTTTCCCCCT
CCCATGGCCT TAGGAGAAAT AGCCAAAAAA GACTTGACTC AAGCCAAAGA ATATGCTACA
CAAATGGGAG TTATTATTGC CCAAGAAGCC TTAGCAGTTG GCATTAATTG GGTTTTAGCT
CCCGTCGTTG ATGTTAATAA TAATCCCAAA AATCCTGTCA TTAATATACG CTCTTTTAGT
GATGATCCTA AAATCGTTAG TGAGTTAGCA GTAGCGTTTC TTGAAGGAGC AAAAACCTAT
CCCGTTTTAA CCTCAGCTAA GCATTTTCCT GGCCATGGTG ATACGAGCAA TGATTCCCAC
ATTGATCTAC CAGTTATTCC TCATGAAATA TCCCGATTAG AAGAGATAGA ATTAGTCCCT
TTTAGAGCAA CAATTGGGGC AAATGTTGAT AGTATTATGA CGGCACATTT ATTAATTCCT
GCTTGGGATA AAGACCGTCC AGCGACTCTT TCTAAAGCCA TTTTAACAGG GGAATTACGA
GCAAGACTAG GCTTTAAAGG ATTAATTGTT ACCGATGCTT TAATTATGGG AGGAGTCGCT
AATTATGCTT CCCCCGAAGA AGTCGCAGTG ATGGCAGTAG AAGCGGGAGT TGATATTTTA
TTAATGCCAA AAGATCCCGA AAAAACCCTT GAAGCATTAG TCAAAGCAGT GGAAACAGGA
CGCATTCCAA GAGAACAAAT AGAAGCTTCT TTAAACCGTA TTTATCAAGC TAAGCAAAAG
GTTTTTAAAA ACTCAAAAAC TACTTTTAAT AATCCTCTTT ATTGTGTTGG GGAATTGTCT
CAAAAAAGAG CAAAAGAAAC AGTTAAAAAT ATACTAAATA GTTCTCTGGA AAAAGGAAAT
AATATTACCC TAAAACCCAA AAAACGCAAT TTAATTGTCG TTGATGATCT CCTAACTTGC
ACCTTTTTAG ATCGCCAAAC TCCTGGGGTA ACAATTCCTC AACAATTAGG CTATGATTGT
CAAATTGCAG AACTCAATAC TTTAAAGTTT TTCTTAGAAG ATGATTGCAG CACCTTATTA
CAGGTTTTCA TTAGAGCAAG TGCCTTTAGA GGGAATGCAG GGTTAAGTGA AGAAGTCCAG
AAAATCTATA AAAAATTGCT CAAAAATAAA ATAGTGAAGG GATTAATTAT TTATGGAAGT
CCCTATGCTA AAGATTGGTT TTTAACTAAC ACAAACTTAC TTAAAAATCA AGTACCTTGG
GTCTTTTCCT ACGGACAAAT GGCAGACAGT CAAAAAATCG CCTGTGAGAC ATTATTTAAT
CTGTCAGAAG TCCCTGACAA TTGGGTAGAT AGGTTTGAAA ATTAA
 
Protein sequence
MNWTDGTLKQ QLGQMIVVRA SGYLFDHQIR YPAWEPSNEK LRHWIETLNL GGVILLGGSA 
GELSLRTQQL QQWSKNPLLI AADIEEGVGQ RFPGATWFPP PMALGEIAKK DLTQAKEYAT
QMGVIIAQEA LAVGINWVLA PVVDVNNNPK NPVINIRSFS DDPKIVSELA VAFLEGAKTY
PVLTSAKHFP GHGDTSNDSH IDLPVIPHEI SRLEEIELVP FRATIGANVD SIMTAHLLIP
AWDKDRPATL SKAILTGELR ARLGFKGLIV TDALIMGGVA NYASPEEVAV MAVEAGVDIL
LMPKDPEKTL EALVKAVETG RIPREQIEAS LNRIYQAKQK VFKNSKTTFN NPLYCVGELS
QKRAKETVKN ILNSSLEKGN NITLKPKKRN LIVVDDLLTC TFLDRQTPGV TIPQQLGYDC
QIAELNTLKF FLEDDCSTLL QVFIRASAFR GNAGLSEEVQ KIYKKLLKNK IVKGLIIYGS
PYAKDWFLTN TNLLKNQVPW VFSYGQMADS QKIACETLFN LSEVPDNWVD RFEN