Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01401 |
Symbol | rfbG |
ID | 4776573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 153676 |
End bp | 154773 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640085639 |
Product | CDP-glucose 4,6-dehydratase |
Protein accession | YP_001016160 |
Protein GI | 124021853 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR02622] CDP-glucose 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAACA TGGTGATGGC TGAGAGCTGG AATCCAGATT TTTGGTGTGG CAGGCGCGTC TGGCTCAGTG GCCACACAGG CTTTAAAGGC AGCTGGTTGG CTCTTTGGTT GCTGCATTGG GGAGCAGTTG TGGAAGGCTA TGCCCTTGAT CCAGACCAGG CTGGTGGACC CTCCTTATTC GACAGCCTTG AGATCGCGGC CGATCTGTCA CGTGATGAGC GCGCTGATTT GGCGCAGGCT GATCATCTAG TCACGCGGCT GCGCGCCTTT CAGCCCGAGG TGGTGTTTCA TCTTGCTGCC CAGCCGCTTG TGACCAGAAG TTACCGAGAA CCTTTGCAGA CGTGGAATAC CAACGTGATT GGTACCTGCC ATGTGCTCGA GGCTTTGCGA CAATTGGATC ATCCCTGTGT GTCGGTGATG ATCACCACCG ACAAGGTCTA CGACAATCAA GAGTGGGAGT ATGGCTATCG GGAGAATGAT CCCCTCGGAG GTCATGATCC TTACAGCAGT AGTAAGGCAG CAGCAGAACT TGCGATTGCG AGTTGGCGAG CAAGTTTTTG TGGTCTCTTG CCCCATCAAG TGCCAAATTT GAAGCTGGTT TCAGCGCGTG CAGGCAATGT GATTGGCGGT GGCGATTGGG CTGAAAATCG CATTGTTCCT GATGCCATAC GTGCTCTGAT CAAGGCAGAG CCAATTGTCG TGCGTGCGCC TGAATCGACA AGGCCGTGGC AGCATGTGCT TGAGCCTCTG GGTGGCTATC TCGCCTTGGC TGAGCAGCTG CACGAGAATC CCAATCTTGC GACCGCCTTC AATTTTGGTC CACAACGAAG CGCCAACCGG CCAGTGGCAG CCCTTGTCGA AGAAATCCTC AGCCATTGGC CTGGATCATG GCTTGATCAA TCCGATCCCA AGGCCAAACA TGAAGCTGGT CGCCTAGATC TAACCACCGA TCGAGCATTC CATCAGCTTG GTTGGAGCCC TCGTTGGAGT TTTGAGGTCA CGATTGCGGA AACGGTGAAT TGGTATCGCC GCTTTCATGC TGGAGAAGAT GCGCGCCAAT TGTGCCTTGA ACAGATCGAT CGCTACGTGA AGGGCTGA
|
Protein sequence | MENMVMAESW NPDFWCGRRV WLSGHTGFKG SWLALWLLHW GAVVEGYALD PDQAGGPSLF DSLEIAADLS RDERADLAQA DHLVTRLRAF QPEVVFHLAA QPLVTRSYRE PLQTWNTNVI GTCHVLEALR QLDHPCVSVM ITTDKVYDNQ EWEYGYREND PLGGHDPYSS SKAAAELAIA SWRASFCGLL PHQVPNLKLV SARAGNVIGG GDWAENRIVP DAIRALIKAE PIVVRAPEST RPWQHVLEPL GGYLALAEQL HENPNLATAF NFGPQRSANR PVAALVEEIL SHWPGSWLDQ SDPKAKHEAG RLDLTTDRAF HQLGWSPRWS FEVTIAETVN WYRRFHAGED ARQLCLEQID RYVKG
|
| |