Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_1971 |
Symbol | |
ID | 8391287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 1991448 |
End bp | 1992908 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644979952 |
Product | alpha amylase catalytic region |
Protein accession | YP_003137697 |
Protein GI | 257059809 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0248141 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0160527 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAATTC ATACACCCGA CTGGGTTAAA CACGCGGTTT TTTATCAAAT TTATCCCGAT AGTTTTGCCC GAACAGTTCC TCCTGATCAA CAATGGTTAC TCAATATTCC CCTAGAAAAT TGGCAAGCTT CTCCAACCTT TCAAGGATAC AAAGGGGGTA ATCTTTGGGG GGTAATTGAT CAACTGGATT ATTTGCAAGA TCTGGGCATT ACAGCCCTCT ATTTTACCCC GATCTTTCAG TCGGCTTCTA ACCATCGCTA CCATACCCAT GACTATTATC AAGTTGATCC TCTTTTAGGG GGAAATGAAG CCTTCCACAA GATGCTAAAA GAAGCCCACA AACGTAATAT AAAAGTGGTC TTGGATGGAG TTTTTAACCA TGCCAGTCGG GGCTTTTTCT TCTTTAATGA TATCTTAGAA AATGGACCTA ATTCTCCTTG GTTAAATTGG TTTAAAATTA CCGGATGGCC GCTTTCGGCT TATGATGGAA ATCTTCCCGC TAACTATGTT TCTTGGGTTA ATTATCGCGC CTTACCAGAG TTTAATCACC ATAACCCTGC GGTCCGAGAA TATATTATGA AAGTAGCAGA ATATTGGTTA CATCAAGGGA TTGATGGTTG GCGGTTAGAT GTGGCTTCTT GCATTAAAGC AGAGGGGTTT TGGCAAGAAT TTCGACAACG GGTAAAAGCC ATTAATGCTG ACGCTTATAT TGTTGGAGAA ATTGTCGATG ATGCTACCCA ATGGTTAGAT GGAACACAAT TTGATGGGGT GATGAATTAT CCCTTTGGTC GTGCTACTAT TGCTTTTATT GCGGGCGATC GCGTCGTAAC AAACACGGTT CCTTCTTTCT ATCAACCCTA TCCTGCCATA GACGCTCCCC AATATTCTGT AGAAATTAAT AATTTATTAC AACGTCATCC TTGGGAAATT GAATTAACCC AATTAAACTT ACTCGATAGC CACGATACTG CTAGATTAAT TAGCATTGCC GATGGCGATC AATCTACCGT AGAATTAGCA ACATTATTAC TATTTACCTT TCCTGGTGCG CCTAATATTT TTTATGGCGA TGAAATTGGT TTACCGGGAG GCCATGAACC AGACTGTCGT CGTGGTTTTC CTTCTGAGGA TCAATGGAAT CAAGACGTTT TAAATTATCA CCGTCAATTC ATTAAATTAC GCCATCATTA TCCAGCTTTA CGAATTGGAG AATATCACAC TCTTTATGCC CAACAACAAG TTTATATTTT TGCCCGGATT TTAGGTAGAG AAGTTTTAAT TATTGCGATC AATGCTGATA GATCTTCTCA AGAAATAAAT TTATCTTTAG CAGAAAAATT TCAGTCAATT ACTAACGTCA AACCTAATAA CATAGTTTAT GGAAAAGGAA GCATTAATTG GGATGAATCT AGTATAAGTT TTAGTCTTCC TCCTCGTGAT GGACTAATCA TTGCCTCTTA A
|
Protein sequence | MSIHTPDWVK HAVFYQIYPD SFARTVPPDQ QWLLNIPLEN WQASPTFQGY KGGNLWGVID QLDYLQDLGI TALYFTPIFQ SASNHRYHTH DYYQVDPLLG GNEAFHKMLK EAHKRNIKVV LDGVFNHASR GFFFFNDILE NGPNSPWLNW FKITGWPLSA YDGNLPANYV SWVNYRALPE FNHHNPAVRE YIMKVAEYWL HQGIDGWRLD VASCIKAEGF WQEFRQRVKA INADAYIVGE IVDDATQWLD GTQFDGVMNY PFGRATIAFI AGDRVVTNTV PSFYQPYPAI DAPQYSVEIN NLLQRHPWEI ELTQLNLLDS HDTARLISIA DGDQSTVELA TLLLFTFPGA PNIFYGDEIG LPGGHEPDCR RGFPSEDQWN QDVLNYHRQF IKLRHHYPAL RIGEYHTLYA QQQVYIFARI LGREVLIIAI NADRSSQEIN LSLAEKFQSI TNVKPNNIVY GKGSINWDES SISFSLPPRD GLIIAS
|
| |