Gene PCC8801_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3334 
Symbol 
ID7102995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3480579 
End bp3481739 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content43% 
IMG OID643476350 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_002373460 
Protein GI218248089 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGTA TGGTAAAGTC ATCATTTTGG TTAGATAAAA AAGTTTTTAT CACGGGACAC 
ACAGGATTTA AAGGCTCTTG GTTGTCTTTG TGGTTACTAC AGCTAGGGGC TAAGATCAAA
GGACTTAGTT TAGAACCCAA TACTCATCCC GCTTTATTTG AACAGCTAGG ACTAGCTGAG
CAACTGTCTC ACCACATCGG CGATATTCGG GATAGAGAGT TGGTGAGTCG TTTAATTCAT
CAATGGCAAC CGGATGTTAT TTTTCATTTG GCTGCCCAAC CTTTAGTCAG ACGTTCCTAC
CTTGAATCGG TAGAAACATG GAATATTAAC GTTATGGGAA CAGTCTATGT TTTGGAAGCT
TTGAAGTCCC TCACAACTCC CTGTGCGTCT ATTTTTATTA CGACGGATAA ATGCTATGAC
AATCGAGAAT GGCTCTATGG TTACCGAGAA AATGACCCGT TAGGGGGTTA TGATCCCTAT
AGTTCCAGCA AAGCGGGTGC AGAGTTAGCA ATCGCTTCTT GGCGTAATTC TTTCTTTAAA
AATAGTCAAA CCCCGATTGG AATAGCCAGC GTGCGGGCTG GCAATGTTAT TGGAGGAGGA
GACTGGGCTG AAAATCGAAT TGTTCCGGAT GCTATGCGAG CTTTAATGGC TAAGCAGGCG
ATTCCTGTGC GTAATCCCCA AGCAACTCGA CCTTGGCAGC ACGTTCTGGA ACCGTTGGGG
GGTTATCTTC TCTTAGCACA ACGGATTTAT GAACAGCTAA TGACACCTTA TTGGCAGCAA
GATCTAAGGG GTTTATATGG GGCGTTCAAT TTTGGACCGT CTTTGCGTTC TAATCGTACT
GTTCGGGATT TAGTCGAGGG TATCTTATCC CATTGGTCGG GAACTTGGTT AAATCAATGC
GTTTCCAACG CTGTCCATGA GGCGAAATTA CTCAATTTAG TGACAGACAA AGCCTTTCAT
ACTTTACGAT GGCAGCCCAT TTGGGATTTT GAAGAAACTG TTCAAAAAAC GGTCACTTGG
TATTACCAAG CGAGTCAGAT GGCGATCGCA GATAAGCAAG AATTTCGAGC TTTAACCCAG
AAGCAAATTG AACAGTATCA AAACGATGCT CATCAACTGA CAAATTCCCC GGAGAAACTA
ACTTTTCAAG AGGTTCAATA G
 
Protein sequence
MESMVKSSFW LDKKVFITGH TGFKGSWLSL WLLQLGAKIK GLSLEPNTHP ALFEQLGLAE 
QLSHHIGDIR DRELVSRLIH QWQPDVIFHL AAQPLVRRSY LESVETWNIN VMGTVYVLEA
LKSLTTPCAS IFITTDKCYD NREWLYGYRE NDPLGGYDPY SSSKAGAELA IASWRNSFFK
NSQTPIGIAS VRAGNVIGGG DWAENRIVPD AMRALMAKQA IPVRNPQATR PWQHVLEPLG
GYLLLAQRIY EQLMTPYWQQ DLRGLYGAFN FGPSLRSNRT VRDLVEGILS HWSGTWLNQC
VSNAVHEAKL LNLVTDKAFH TLRWQPIWDF EETVQKTVTW YYQASQMAIA DKQEFRALTQ
KQIEQYQNDA HQLTNSPEKL TFQEVQ