Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3593 |
Symbol | |
ID | 7103288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3743691 |
End bp | 3744878 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643476603 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_002373711 |
Protein GI | 218248340 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGAC TATCATTAAC CGTTAAACCA GTTTTCAACT CTGGACTCGG AGTGCTTATT TTTTTACTGC TAGGAGGATG TGTATTTAAA ACCAATTCCA CTGCCTCCCA ACCGTCTCAA ATCCAAGAAT CCCAGAGAGA AGTCAATCAA GTTACTGTTG TTGAAGGATT AGAGCATCCC TGGAGTATGG CCTGGCTTCC CAATGGGGAT ATGTTGATTA CAGAACGCCC TGGACGACTT CGGTTAGTGA AAAATGGGGT ACTACAACCC ACTCCTATTG CTGGCGTGAT GGAGGTGCTT CAATTGGGAC AGGGAGGGTT AATGGAGGTG TCATTACACC CTAACTTTAG CGAAAATCGC CTTGTTTATT TCACCTATGC CCATGGAACT GCTGAAGCTA ATCGTACCCG CATTGCTCGT GCTACCTTTG ATGGACAAGC GTTACGAGAC GTACAGGTGA TCTTTGAAGT GACTCCCGCT AAACCAGGGG GACAACATTT TGGTTCTCGT TTAGTTTGGC TCAAAGACCA AACGATGTTG ATTTCCATTG GAGATGGCGG AAATCCTCCG CTTTCTCTTG ATGGTGAACT GATTCGTCTA CAAGCACAAA ATCTCGGTAA TCCTCTGGGT AAAATTATTC GCTTGAAGGA CGATGGCAGT ATTCCCGATG ATAATCCCTT TGTTGGACAA AAGGAGGCAC AAAAAGCCAT TTGGAGCTAT GGACATCGTA ATATTCAGGG ATTAGCGGTT AATCAAGCTA CTGGGCAAAT TTGGGCGACA GAACACGGTT CTCGGGGTGG GGATGAACTT AACGCCATTA AAGGGGGTCA GAACTATGGT TGGCCGCTGG TGACGCACAG TGAAGAATAT TTTGGGGGTG AAATTTCCAG TGAACGCTCC CGCTCAGGGA TGATTGATCC TTTAATTGTT TGGACTCCTG CGATCGCTCC GTCTGGGTTA GCTATTTATC AGGGGACTCG CTTTCCCCAG TGGCAAGGTG ATTTATTTGC GGGAGGATTG GTTGGTAAAG AAGTTCGTCA TATTGACTTG GATAGTTCTG GTCAAGTGAT AGAACAAAAA TCAATTCCCT TCTCTCAAAG AGTCCGTGAT GTTAAACAGG GTCCTGACGG GTTTCTTTAT GTTTTAACCG ATGCTCCCAA TGGCAAGTTA ATCCGTCTTG AACCCTAA
|
Protein sequence | MNRLSLTVKP VFNSGLGVLI FLLLGGCVFK TNSTASQPSQ IQESQREVNQ VTVVEGLEHP WSMAWLPNGD MLITERPGRL RLVKNGVLQP TPIAGVMEVL QLGQGGLMEV SLHPNFSENR LVYFTYAHGT AEANRTRIAR ATFDGQALRD VQVIFEVTPA KPGGQHFGSR LVWLKDQTML ISIGDGGNPP LSLDGELIRL QAQNLGNPLG KIIRLKDDGS IPDDNPFVGQ KEAQKAIWSY GHRNIQGLAV NQATGQIWAT EHGSRGGDEL NAIKGGQNYG WPLVTHSEEY FGGEISSERS RSGMIDPLIV WTPAIAPSGL AIYQGTRFPQ WQGDLFAGGL VGKEVRHIDL DSSGQVIEQK SIPFSQRVRD VKQGPDGFLY VLTDAPNGKL IRLEP
|
| |