Gene Cyan8802_2783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2783 
Symbol 
ID8392110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2811324 
End bp2812484 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content44% 
IMG OID644980737 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_003138472 
Protein GI257060584 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.824232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGTA TGGTAAAGTC ATCGTTTTGG TTAAATAAAA AAGTTTTTAT CACGGGACAC 
ACAGGATTTA AAGGCTCTTG GTTGTCTTTG TGGTTACTAC AGCTAGGGGC TAAGATCAAA
GGACTTAGTT TAGAGCCCAA TACTCATCCG GCTTTATTTG AACAGCTAGG ACTAGCTGAG
CAACTGTCTC ACCACATCGG CGATATTCGG GATAGAGAGT TGGTGAGTCG TTTAATTCGT
CAATGGCAAC CGGATATTAT TTTTCATTTG GCTGCCCAAC CGTTAGTCAG ACGTTCCTAC
CTTGAATCGG TAGAAACATG GAATATTAAC GTTATGGGAA CAGTCTATGT TTTGGAAGCT
TTGAAGTCCC TCACAACTCC CTGTGCGTCT ATTTTTATTA CGACGGATAA ATGCTATGAC
AATCGAGAAT GGCTCTATGG TTATCGAGAA AATGACCCGT TAGGGGGTTA TGATCCCTAT
AGTTCTAGTA AAGCGGGTGC AGAGTTAGCG ATCGCTTCTT GGCGTAATTC TTTCTTTAAA
AATAGTCAAA CCCCGATTGG GATAGCGAGC GTACGAGCCG GCAATGTTAT CGGAGGAGGA
GACTGGGCTG AGAATCGCAT TGTTCCGGAT GCTATGCGAG CTTTAATGGC TAAGCAGGCG
ATTCCTGTGC GTAATCCCCA AGCAACTCGA CCTTGGCAGC ACGTTCTTGA GCCATTGGGG
GGTTATCTTC TCTTAGCACA ACGGATTTAT GAACAGCTAA TGACACCTCA TTGGCAGCAA
GATTCAAGGG GTTTATATGG GGCGTTCAAT TTTGGACCGT CTTTGCGTTC TAATCGTACT
GTTCGGGATT TAGTCGAGGG TATCTTATCC CATTGGTCGG GAACTTGGTT AAATCAAGGC
GTTCCCAACG CTGTCCATGA GGCGAAATTG CTCAATTTAG TGACAGACAA AGCCTTTCAT
AGTTTACGAT GGCAGCCCAT TTGGGATTTT GAAGAAACTG TTCAAAAAAC GGTCACTTGG
TATTATCAAG CGAGTCAGAT GGCGATCGCA GATAAGCAAG AATTTCGAGC TTTAACCCAG
AAGCAAATTG AACAGTATCA AAACGATGCT TATCAACTGA CAAATTCCCT GGAGAAACTA
ACTTTTCAAG AGGTTCAATA G
 
Protein sequence
MESMVKSSFW LNKKVFITGH TGFKGSWLSL WLLQLGAKIK GLSLEPNTHP ALFEQLGLAE 
QLSHHIGDIR DRELVSRLIR QWQPDIIFHL AAQPLVRRSY LESVETWNIN VMGTVYVLEA
LKSLTTPCAS IFITTDKCYD NREWLYGYRE NDPLGGYDPY SSSKAGAELA IASWRNSFFK
NSQTPIGIAS VRAGNVIGGG DWAENRIVPD AMRALMAKQA IPVRNPQATR PWQHVLEPLG
GYLLLAQRIY EQLMTPHWQQ DSRGLYGAFN FGPSLRSNRT VRDLVEGILS HWSGTWLNQG
VPNAVHEAKL LNLVTDKAFH SLRWQPIWDF EETVQKTVTW YYQASQMAIA DKQEFRALTQ
KQIEQYQNDA YQLTNSLEKL TFQEVQ