Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_1248 |
Symbol | |
ID | 8390559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 1270303 |
End bp | 1271907 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644979256 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003137007 |
Protein GI | 257059119 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0180507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTGGA CTGACGGAAC TCTTAAACAA CAACTCGGAC AAATGATTGT TGTTCGCGCT TCAGGATATT TATTTGATCA CCAAATTCGT TATCCAGCAT GGGAACCCTC TAATGAAAAA TTGCGCCATT GGATAGAAAC CCTCAACCTA GGAGGAGTCA TCTTATTAGG AGGAAGCGCA GGGGAATTAA GCTTAAGAAC CCAACAACTT CAACAATGGT CAAAAAATCC CCTTTTAATA GCAGCAGATA TTGAAGAAGG AGTCGGACAA AGATTTCCGG GTGCAACGTG GTTTCCCCCT CCCATGGCCT TAGGAGAAAT AGCCAAAAAA GACTTGACTC AAGCCAAAGA ATATGCTACA CAAATGGGAG TTATTATTGC CCAAGAAGCC TTAGCAGTTG GCATTAATTG GGTTTTAGCT CCCGTCGTTG ATGTTAATAA TAATCCCAAA AATCCTGTCA TTAATATACG CTCTTTTAGT GATGATCCTA AAATCGTTAG TGAGTTAGCA GTAGCGTTTC TTGAAGGAGC AAAAACCTAT CCCGTTTTAA CCTCAGCTAA GCATTTTCCT GGCCATGGTG ATACGAGCAA TGATTCCCAC ATTGATCTAC CAGTTATTCC TCATGAAATA TCCCGATTAG AAGAGATAGA ATTAGTCCCT TTTAGAGCAA CAATTGGGGC AAATGTTGAT AGTATTATGA CGGCACATTT ATTAATTCCT GCTTGGGATA AAGACCGTCC AGCGACTCTT TCTAAAGCCA TTTTAACAGG GGAATTACGA GCAAGACTAG GCTTTAAAGG ATTAATTGTT ACCGATGCTT TAATTATGGG AGGAGTCGCT AATTATGCTT CCCCCGAAGA AGTCGCAGTG ATGGCAGTAG AAGCGGGAGT TGATATTTTA TTAATGCCAA AAGATCCCGA AAAAACCCTT GAAGCATTAG TCAAAGCAGT GGAAACAGGA CGCATTCCAA GAGAACAAAT AGAAGCTTCT TTAAACCGTA TTTATCAAGC TAAGCAAAAG GTTTTTAAAA ACTCAAAAAC TACTTTTAAT AATCCTCTTT ATTGTGTTGG GGAATTGTCT CAAAAAAGAG CAAAAGAAAC AGTTAAAAAT ATACTAAATA GTTCTCTGGA AAAAGGAAAT AATATTACCC TAAAACCCAA AAAACGCAAT TTAATTGTCG TTGATGATCT CCTAACTTGC ACCTTTTTAG ATCGCCAAAC TCCTGGGGTA ACAATTCCTC AACAATTAGG CTATGATTGT CAAATTGCAG AACTCAATAC TTTAAAGTTT TTCTTAGAAG ATGATTGCAG CACCTTATTA CAGGTTTTCA TTAGAGCAAG TGCCTTTAGA GGGAATGCAG GGTTAAGTGA AGAAGTCCAG AAAATCTATA AAAAATTGCT CAAAAATAAA ATAGTGAAGG GATTAATTAT TTATGGAAGT CCCTATGCTA AAGATTGGTT TTTAACTAAC ACAAACTTAC TTAAAAATCA AGTACCTTGG GTCTTTTCCT ACGGACAAAT GGCAGACAGT CAAAAAATCG CCTGTGAGAC ATTATTTAAT CTGTCAGAAG TCCCTGACAA TTGGGTAGAT AGGTTTGAAA ATTAA
|
Protein sequence | MNWTDGTLKQ QLGQMIVVRA SGYLFDHQIR YPAWEPSNEK LRHWIETLNL GGVILLGGSA GELSLRTQQL QQWSKNPLLI AADIEEGVGQ RFPGATWFPP PMALGEIAKK DLTQAKEYAT QMGVIIAQEA LAVGINWVLA PVVDVNNNPK NPVINIRSFS DDPKIVSELA VAFLEGAKTY PVLTSAKHFP GHGDTSNDSH IDLPVIPHEI SRLEEIELVP FRATIGANVD SIMTAHLLIP AWDKDRPATL SKAILTGELR ARLGFKGLIV TDALIMGGVA NYASPEEVAV MAVEAGVDIL LMPKDPEKTL EALVKAVETG RIPREQIEAS LNRIYQAKQK VFKNSKTTFN NPLYCVGELS QKRAKETVKN ILNSSLEKGN NITLKPKKRN LIVVDDLLTC TFLDRQTPGV TIPQQLGYDC QIAELNTLKF FLEDDCSTLL QVFIRASAFR GNAGLSEEVQ KIYKKLLKNK IVKGLIIYGS PYAKDWFLTN TNLLKNQVPW VFSYGQMADS QKIACETLFN LSEVPDNWVD RFEN
|
| |