Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3334 |
Symbol | |
ID | 7102995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 3480579 |
End bp | 3481739 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643476350 |
Product | CDP-glucose 4,6-dehydratase |
Protein accession | YP_002373460 |
Protein GI | 218248089 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR02622] CDP-glucose 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGTA TGGTAAAGTC ATCATTTTGG TTAGATAAAA AAGTTTTTAT CACGGGACAC ACAGGATTTA AAGGCTCTTG GTTGTCTTTG TGGTTACTAC AGCTAGGGGC TAAGATCAAA GGACTTAGTT TAGAACCCAA TACTCATCCC GCTTTATTTG AACAGCTAGG ACTAGCTGAG CAACTGTCTC ACCACATCGG CGATATTCGG GATAGAGAGT TGGTGAGTCG TTTAATTCAT CAATGGCAAC CGGATGTTAT TTTTCATTTG GCTGCCCAAC CTTTAGTCAG ACGTTCCTAC CTTGAATCGG TAGAAACATG GAATATTAAC GTTATGGGAA CAGTCTATGT TTTGGAAGCT TTGAAGTCCC TCACAACTCC CTGTGCGTCT ATTTTTATTA CGACGGATAA ATGCTATGAC AATCGAGAAT GGCTCTATGG TTACCGAGAA AATGACCCGT TAGGGGGTTA TGATCCCTAT AGTTCCAGCA AAGCGGGTGC AGAGTTAGCA ATCGCTTCTT GGCGTAATTC TTTCTTTAAA AATAGTCAAA CCCCGATTGG AATAGCCAGC GTGCGGGCTG GCAATGTTAT TGGAGGAGGA GACTGGGCTG AAAATCGAAT TGTTCCGGAT GCTATGCGAG CTTTAATGGC TAAGCAGGCG ATTCCTGTGC GTAATCCCCA AGCAACTCGA CCTTGGCAGC ACGTTCTGGA ACCGTTGGGG GGTTATCTTC TCTTAGCACA ACGGATTTAT GAACAGCTAA TGACACCTTA TTGGCAGCAA GATCTAAGGG GTTTATATGG GGCGTTCAAT TTTGGACCGT CTTTGCGTTC TAATCGTACT GTTCGGGATT TAGTCGAGGG TATCTTATCC CATTGGTCGG GAACTTGGTT AAATCAATGC GTTTCCAACG CTGTCCATGA GGCGAAATTA CTCAATTTAG TGACAGACAA AGCCTTTCAT ACTTTACGAT GGCAGCCCAT TTGGGATTTT GAAGAAACTG TTCAAAAAAC GGTCACTTGG TATTACCAAG CGAGTCAGAT GGCGATCGCA GATAAGCAAG AATTTCGAGC TTTAACCCAG AAGCAAATTG AACAGTATCA AAACGATGCT CATCAACTGA CAAATTCCCC GGAGAAACTA ACTTTTCAAG AGGTTCAATA G
|
Protein sequence | MESMVKSSFW LDKKVFITGH TGFKGSWLSL WLLQLGAKIK GLSLEPNTHP ALFEQLGLAE QLSHHIGDIR DRELVSRLIH QWQPDVIFHL AAQPLVRRSY LESVETWNIN VMGTVYVLEA LKSLTTPCAS IFITTDKCYD NREWLYGYRE NDPLGGYDPY SSSKAGAELA IASWRNSFFK NSQTPIGIAS VRAGNVIGGG DWAENRIVPD AMRALMAKQA IPVRNPQATR PWQHVLEPLG GYLLLAQRIY EQLMTPYWQQ DLRGLYGAFN FGPSLRSNRT VRDLVEGILS HWSGTWLNQC VSNAVHEAKL LNLVTDKAFH TLRWQPIWDF EETVQKTVTW YYQASQMAIA DKQEFRALTQ KQIEQYQNDA HQLTNSPEKL TFQEVQ
|
| |