Gene PCC8801_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1944 
Symbol 
ID7102892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2021169 
End bp2022629 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content39% 
IMG OID643475006 
Productalpha amylase catalytic region 
Protein accessionYP_002372138 
Protein GI218246767 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATTC ATACACCCGA CTGGGTTAAA CACGCGGTTT TCTATCAAAT TTATCCCGAT 
AGTTTTGCCC GAACAGTTCC TCCTGATCAA CAATGGTTAC TCAATATTCC CCTAGAAAAT
TGGCAAGCTT CTCCAACCTT TCAAGGATAC AAAGGGGGTA ATCTTTGGGG GGTAATTGAT
CAACTGGATT ATTTGCAAGA TCTGGGCATT ACAGCCCTCT ATTTTACCCC GATCTTTCAG
TCGGCTTCTA ACCATCGCTA CCATACCCAT GACTATTATC AAGTTGATCC TCTTTTAGGG
GGAAATGAAG CCTTCCACAA GATGCTAAAA GAAGCCCACA AACGTAATAT AAAAGTGGTC
TTGGATGGAG TTTTTAACCA TGCCAGTCGG GGCTTTTTCT TCTTTAATGA TATCTTAGAA
AATGGACCTA ATTCTCCTTG GTTAAATTGG TTTAAAATTA CCGGATGGCC GCTTTCGGCT
TATGATGGAA ATCTTCCCGC TAACTATGTT TCTTGGGTTA ATTATCGCGC CTTACCAGAG
TTTAATCACG ATAACCCTGC GGTGCGAGAA TATATTATGA AAGTAGCAGA ATATTGGTTA
CATCAAGGGA TTGATGGTTG GCGGTTAGAT GTGGCTGCTT GCATTAAAGC AGAGGGGTTT
TGGCAGGAGT TTCGACAACG GGTAAAAGCC ATTAATGCTG ACGCTTATAT TGTTGGAGAA
ATTGTCGATG ATGCTACCCA ATGGTTAGAT GGAACACAAT TTGATGGGGT GATGAATTAT
CCCTTTGGTC GTGCTACTAT TGCTTTTATT GCGGGCGATC GCGTCGTAAC AAATACCGTG
CCTTCTTTCT ATCAACCCTA TCCTGCCATA GACGCTGCCC AATATTCTGT AGAAATTAAT
AATTTATTAC AACGTCATCC TTGGGAAATT GAATTAACCC AATTAAACTT ACTCGATAGC
CACGATACTG CTAGATTAAT TAGCATTGCC GATGGCGATC AATCTACCGT AGAATTAGCA
ACATTATTAC TATTTACCTT TCCTGGTGCG CCTAATATTT TTTATGGCGA TGAAATTGGT
TTACCGGGAG GCCATGAACC AGACTGTCGT CGTGGTTTTC CTTCTGAGGA TCAATGGAAT
CAAGACGTTT TAAATTATCA CCGTCAATTC ATTAAATTAC GCCATCATTA TCCAGCTTTA
CGAATTGGAG AATATCACAC TCTTTATGCC CAACAACAAG TTTATATTTT TGCCCGGATT
TTAGGTAGAG AAGTTTTAAT TATTGCGATC AATGCTGATA GATCTTCTCA AGAAATAAAT
TTATCTTTAG CAGAAAAATT TCAGTCAATT ACTAACGTCA AACCTAATAA CATAGTTTAT
GGAAAAGGAA GCATTAATTG GGATGAATCT AGTATAAGTT TTAGTCTTCC TCCTCGTGAT
GGACTAATCA TTGCCTCTTA A
 
Protein sequence
MSIHTPDWVK HAVFYQIYPD SFARTVPPDQ QWLLNIPLEN WQASPTFQGY KGGNLWGVID 
QLDYLQDLGI TALYFTPIFQ SASNHRYHTH DYYQVDPLLG GNEAFHKMLK EAHKRNIKVV
LDGVFNHASR GFFFFNDILE NGPNSPWLNW FKITGWPLSA YDGNLPANYV SWVNYRALPE
FNHDNPAVRE YIMKVAEYWL HQGIDGWRLD VAACIKAEGF WQEFRQRVKA INADAYIVGE
IVDDATQWLD GTQFDGVMNY PFGRATIAFI AGDRVVTNTV PSFYQPYPAI DAAQYSVEIN
NLLQRHPWEI ELTQLNLLDS HDTARLISIA DGDQSTVELA TLLLFTFPGA PNIFYGDEIG
LPGGHEPDCR RGFPSEDQWN QDVLNYHRQF IKLRHHYPAL RIGEYHTLYA QQQVYIFARI
LGREVLIIAI NADRSSQEIN LSLAEKFQSI TNVKPNNIVY GKGSINWDES SISFSLPPRD
GLIIAS