Gene Cyan8802_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1971 
Symbol 
ID8391287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1991448 
End bp1992908 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content39% 
IMG OID644979952 
Productalpha amylase catalytic region 
Protein accessionYP_003137697 
Protein GI257059809 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0248141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0160527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATTC ATACACCCGA CTGGGTTAAA CACGCGGTTT TTTATCAAAT TTATCCCGAT 
AGTTTTGCCC GAACAGTTCC TCCTGATCAA CAATGGTTAC TCAATATTCC CCTAGAAAAT
TGGCAAGCTT CTCCAACCTT TCAAGGATAC AAAGGGGGTA ATCTTTGGGG GGTAATTGAT
CAACTGGATT ATTTGCAAGA TCTGGGCATT ACAGCCCTCT ATTTTACCCC GATCTTTCAG
TCGGCTTCTA ACCATCGCTA CCATACCCAT GACTATTATC AAGTTGATCC TCTTTTAGGG
GGAAATGAAG CCTTCCACAA GATGCTAAAA GAAGCCCACA AACGTAATAT AAAAGTGGTC
TTGGATGGAG TTTTTAACCA TGCCAGTCGG GGCTTTTTCT TCTTTAATGA TATCTTAGAA
AATGGACCTA ATTCTCCTTG GTTAAATTGG TTTAAAATTA CCGGATGGCC GCTTTCGGCT
TATGATGGAA ATCTTCCCGC TAACTATGTT TCTTGGGTTA ATTATCGCGC CTTACCAGAG
TTTAATCACC ATAACCCTGC GGTCCGAGAA TATATTATGA AAGTAGCAGA ATATTGGTTA
CATCAAGGGA TTGATGGTTG GCGGTTAGAT GTGGCTTCTT GCATTAAAGC AGAGGGGTTT
TGGCAAGAAT TTCGACAACG GGTAAAAGCC ATTAATGCTG ACGCTTATAT TGTTGGAGAA
ATTGTCGATG ATGCTACCCA ATGGTTAGAT GGAACACAAT TTGATGGGGT GATGAATTAT
CCCTTTGGTC GTGCTACTAT TGCTTTTATT GCGGGCGATC GCGTCGTAAC AAACACGGTT
CCTTCTTTCT ATCAACCCTA TCCTGCCATA GACGCTCCCC AATATTCTGT AGAAATTAAT
AATTTATTAC AACGTCATCC TTGGGAAATT GAATTAACCC AATTAAACTT ACTCGATAGC
CACGATACTG CTAGATTAAT TAGCATTGCC GATGGCGATC AATCTACCGT AGAATTAGCA
ACATTATTAC TATTTACCTT TCCTGGTGCG CCTAATATTT TTTATGGCGA TGAAATTGGT
TTACCGGGAG GCCATGAACC AGACTGTCGT CGTGGTTTTC CTTCTGAGGA TCAATGGAAT
CAAGACGTTT TAAATTATCA CCGTCAATTC ATTAAATTAC GCCATCATTA TCCAGCTTTA
CGAATTGGAG AATATCACAC TCTTTATGCC CAACAACAAG TTTATATTTT TGCCCGGATT
TTAGGTAGAG AAGTTTTAAT TATTGCGATC AATGCTGATA GATCTTCTCA AGAAATAAAT
TTATCTTTAG CAGAAAAATT TCAGTCAATT ACTAACGTCA AACCTAATAA CATAGTTTAT
GGAAAAGGAA GCATTAATTG GGATGAATCT AGTATAAGTT TTAGTCTTCC TCCTCGTGAT
GGACTAATCA TTGCCTCTTA A
 
Protein sequence
MSIHTPDWVK HAVFYQIYPD SFARTVPPDQ QWLLNIPLEN WQASPTFQGY KGGNLWGVID 
QLDYLQDLGI TALYFTPIFQ SASNHRYHTH DYYQVDPLLG GNEAFHKMLK EAHKRNIKVV
LDGVFNHASR GFFFFNDILE NGPNSPWLNW FKITGWPLSA YDGNLPANYV SWVNYRALPE
FNHHNPAVRE YIMKVAEYWL HQGIDGWRLD VASCIKAEGF WQEFRQRVKA INADAYIVGE
IVDDATQWLD GTQFDGVMNY PFGRATIAFI AGDRVVTNTV PSFYQPYPAI DAPQYSVEIN
NLLQRHPWEI ELTQLNLLDS HDTARLISIA DGDQSTVELA TLLLFTFPGA PNIFYGDEIG
LPGGHEPDCR RGFPSEDQWN QDVLNYHRQF IKLRHHYPAL RIGEYHTLYA QQQVYIFARI
LGREVLIIAI NADRSSQEIN LSLAEKFQSI TNVKPNNIVY GKGSINWDES SISFSLPPRD
GLIIAS