Gene P9303_01401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01401 
SymbolrfbG 
ID4776573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp153676 
End bp154773 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content54% 
IMG OID640085639 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_001016160 
Protein GI124021853 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACA TGGTGATGGC TGAGAGCTGG AATCCAGATT TTTGGTGTGG CAGGCGCGTC 
TGGCTCAGTG GCCACACAGG CTTTAAAGGC AGCTGGTTGG CTCTTTGGTT GCTGCATTGG
GGAGCAGTTG TGGAAGGCTA TGCCCTTGAT CCAGACCAGG CTGGTGGACC CTCCTTATTC
GACAGCCTTG AGATCGCGGC CGATCTGTCA CGTGATGAGC GCGCTGATTT GGCGCAGGCT
GATCATCTAG TCACGCGGCT GCGCGCCTTT CAGCCCGAGG TGGTGTTTCA TCTTGCTGCC
CAGCCGCTTG TGACCAGAAG TTACCGAGAA CCTTTGCAGA CGTGGAATAC CAACGTGATT
GGTACCTGCC ATGTGCTCGA GGCTTTGCGA CAATTGGATC ATCCCTGTGT GTCGGTGATG
ATCACCACCG ACAAGGTCTA CGACAATCAA GAGTGGGAGT ATGGCTATCG GGAGAATGAT
CCCCTCGGAG GTCATGATCC TTACAGCAGT AGTAAGGCAG CAGCAGAACT TGCGATTGCG
AGTTGGCGAG CAAGTTTTTG TGGTCTCTTG CCCCATCAAG TGCCAAATTT GAAGCTGGTT
TCAGCGCGTG CAGGCAATGT GATTGGCGGT GGCGATTGGG CTGAAAATCG CATTGTTCCT
GATGCCATAC GTGCTCTGAT CAAGGCAGAG CCAATTGTCG TGCGTGCGCC TGAATCGACA
AGGCCGTGGC AGCATGTGCT TGAGCCTCTG GGTGGCTATC TCGCCTTGGC TGAGCAGCTG
CACGAGAATC CCAATCTTGC GACCGCCTTC AATTTTGGTC CACAACGAAG CGCCAACCGG
CCAGTGGCAG CCCTTGTCGA AGAAATCCTC AGCCATTGGC CTGGATCATG GCTTGATCAA
TCCGATCCCA AGGCCAAACA TGAAGCTGGT CGCCTAGATC TAACCACCGA TCGAGCATTC
CATCAGCTTG GTTGGAGCCC TCGTTGGAGT TTTGAGGTCA CGATTGCGGA AACGGTGAAT
TGGTATCGCC GCTTTCATGC TGGAGAAGAT GCGCGCCAAT TGTGCCTTGA ACAGATCGAT
CGCTACGTGA AGGGCTGA
 
Protein sequence
MENMVMAESW NPDFWCGRRV WLSGHTGFKG SWLALWLLHW GAVVEGYALD PDQAGGPSLF 
DSLEIAADLS RDERADLAQA DHLVTRLRAF QPEVVFHLAA QPLVTRSYRE PLQTWNTNVI
GTCHVLEALR QLDHPCVSVM ITTDKVYDNQ EWEYGYREND PLGGHDPYSS SKAAAELAIA
SWRASFCGLL PHQVPNLKLV SARAGNVIGG GDWAENRIVP DAIRALIKAE PIVVRAPEST
RPWQHVLEPL GGYLALAEQL HENPNLATAF NFGPQRSANR PVAALVEEIL SHWPGSWLDQ
SDPKAKHEAG RLDLTTDRAF HQLGWSPRWS FEVTIAETVN WYRRFHAGED ARQLCLEQID
RYVKG