Gene PCC8801_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2108 
Symbol 
ID7104341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2178184 
End bp2179665 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content44% 
IMG OID643475165 
Productalpha amylase catalytic region 
Protein accessionYP_002372296 
Protein GI218246925 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATA ATCTTTATCC CTCTCTTTAT CAAATTAATA CCCGTGTTTG GCTCAATCAA 
CTCTCTGGCC AACTCGGTCG TCCAGCTACC CTAGATGACA TTCCCGATAC AGAACTCGAC
AAACTCGCTA ATTTTGGGTT TGATTGGGTT TATTTTTTGA GTGTTTGGCA AACGGGAGAG
GCCGCACGTC AAGTATCCAT GAGTAATCCC CAATGGTTAG CCGAATATCA CGAACTGTTA
CCCGATTTGC AAGATGAAGA TATTGTCGGC TCAGGATTTG CTATCAAAGA TTATACCTTA
AATACCCGTT TAGGGACATC AGCCTCATTA ATTCGTCTGC GCGATCGCCT CCATCAACGA
AACCTCAAAT TAATGTTAGA TTTCGTTCCT AATCATACTG CTCCCGATCA TGCTTGGGTT
AACTCCCATC CTGAGTATTA TCTTGCTGGA AATGAAAGTC TATTGGCTGA ACAGCCCCAA
AATTATACTA AAATTGACTT GCCTGAAGGA TCAAGAATTT TCGCCTATGG ACGAGATCCC
TATTTTGATG GTTGGCCAGA CACCCTACAA CTCAATTATG GCAATCGGGA CCTGCAAACA
GCCCTAATCA ACGAATTATT AAGGATTTCT CAATGGTGTG ATGGCTTACG CTGTGATATG
GCCATGCTAG TCTTACCGGA AATTTTTCAA CGAACTTGGG GTATTACGAC TGAACCCTTC
TGGCCTAAAG CCATCCCCCA AATTAAAGAA CAACAGCCCA ATTTTGTCTT TATGGCCGAG
GTTTATTGGG ATATGGAATG GACGCTGCAA CAACAGGGGT TTGACTATAC CTATGATAAG
CGATTATACG ATCGCCTGAG AGAACAGATT TCCCGTCCCA TTCGAGAGCA TTTTTGGGCT
GATCTTGACT ACCAAAACAA ATCAACCCGT TTTTTAGAAA ATCACGACGA ACCTCGCGCG
GCTGCTACCT TTCCATCGGG TATTCACCAA GCAGCCGCCA TTTTGACCTT TTTCTGTCCA
GGGTTGCGCT TTTTCCACCA AGGACAGTTA CAGGGATGGA CAAAACGCAT CTCGGTTCAC
TTGGGACGGG GGCCAGACCA ACCCACTGAT CCTAACGTAG AACAGTTTTA TAGCCAATTG
ATCGAAAGTT TACAGTTTAA GGCCTTTCAG GAGGGACAAT GGCAATTACT CGAATGTCAT
CCCGCTTGGT CTGATAATTG GACGTGGGAC TGTTTTATTG CCTTTGCTTG GCAAGGAAAG
GAAGAAGAAC AGGCGATCGT TGTGGTTAAT TATGCGGGAA ACCAAAGTCA AGGTTATATT
TCCGTTCCTT GGTCAAATTT AGCTGGCCAA CACTTTCACC TGCAAGACAT GATGAGTGAT
ACGGTTTACG AGGTTGAGGG TGATAATTTA TTTTCCCCCG GTCTTTATGT AGATTATTCC
CCCTGGGAAT ATCATGTATT TAAGCTAGTT AAAAAAGGAT AA
 
Protein sequence
MSNNLYPSLY QINTRVWLNQ LSGQLGRPAT LDDIPDTELD KLANFGFDWV YFLSVWQTGE 
AARQVSMSNP QWLAEYHELL PDLQDEDIVG SGFAIKDYTL NTRLGTSASL IRLRDRLHQR
NLKLMLDFVP NHTAPDHAWV NSHPEYYLAG NESLLAEQPQ NYTKIDLPEG SRIFAYGRDP
YFDGWPDTLQ LNYGNRDLQT ALINELLRIS QWCDGLRCDM AMLVLPEIFQ RTWGITTEPF
WPKAIPQIKE QQPNFVFMAE VYWDMEWTLQ QQGFDYTYDK RLYDRLREQI SRPIREHFWA
DLDYQNKSTR FLENHDEPRA AATFPSGIHQ AAAILTFFCP GLRFFHQGQL QGWTKRISVH
LGRGPDQPTD PNVEQFYSQL IESLQFKAFQ EGQWQLLECH PAWSDNWTWD CFIAFAWQGK
EEEQAIVVVN YAGNQSQGYI SVPWSNLAGQ HFHLQDMMSD TVYEVEGDNL FSPGLYVDYS
PWEYHVFKLV KKG