Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12911 |
Symbol | rfbG |
ID | 5731740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1165705 |
End bp | 1166805 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641285661 |
Product | CDP-glucose 4,6-dehydratase |
Protein accession | YP_001551176 |
Protein GI | 159903832 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR02622] CDP-glucose 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.253167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00798919 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATCTGA ACACAACATT CAAAAATAAA AGAGTAGTTG TAACCGGTCA TACTGGCTTT AAAGGTTCAT GGCTCTCAAT TTGGCTATTA TCCCTTGGAG CAAAAGTATA TGGAATAGCA CTAGATCCTC CTAGCTTGCC TTCTCTTTTT GATGAAGCAT TAATGAGTGA GAGAGTAGTG GATAATAGAA TTAATATAAA AGATACTAAT AAAGTTATTG AAATTATAGA CGAAGTTAAA CCTGACTTTT TATTTCATTT AGCTGCACAA CCTTTGGTAA GAGAGTCATA CCTTGACCCA GTTGAAACAT GGCACACAAA TGTGTTAGGT ACAGTTAATG TTCTAAATGC TCTTAGGGTA GTAAACCATA GATGTACAGC CATAATAATT ACAAGTGATA AGTGCTATGA CAACCAGGAA TGGTTATGGG GTTATAGAGA GTCTGATAAG TTAGGAGGAG GAGATCCTTA TAGTGCCTCT AAAGGATCTG CAGAGTTAGC ATTTAGTTCT TTTTATCGTT CTTATTTTCA TGACAATAAT AGTAAGGTAA CTATAGCTTC AGCAAGAGCA GGTAACGTTA TAGGAGGAGG AGATTGGGCC AACAATAGAA TAGTCCCAGA CTGTATAAGA GCTTGGACAC AAGAGAGAGA AGTTTCTATT AGAGCTCCAT ATTCAACAAG ACCATGGCAA CATGTGCTTG AGCCCCTAAG CGGATATCTT TGCCTTGCAC AACAATTAAC CTCAGAAAGT TTGCTTAATG GGGAGTCATT TAATTTTGGC CCAACATCTT CTCTGAACTA TTCAGTTCTA GATGTTGTGA ATGAACTAGC TAAAGGATGG TCATACAATT TATTGGATAT AGAAGAAAAG AGTGAATCAA AAGCTAATGA ATCAGCACTT CTAAAATTAA ATTGTGACAA AGCCTTGGCC CAATTAAGCT GGCAATCTAC ACTTAACTTC GAATCAACTC TACTGTATAC AAGCTCATGG TATTTAGATT ATTATTCAGA ATCAAAATCT ATAGACCCTT ATGAGCTTTG TCTTAAACAG ATTAATTCTT ACTCTGCCGA AGCCAAGTTA ACTGGTAACA AATGGATATA A
|
Protein sequence | MNLNTTFKNK RVVVTGHTGF KGSWLSIWLL SLGAKVYGIA LDPPSLPSLF DEALMSERVV DNRINIKDTN KVIEIIDEVK PDFLFHLAAQ PLVRESYLDP VETWHTNVLG TVNVLNALRV VNHRCTAIII TSDKCYDNQE WLWGYRESDK LGGGDPYSAS KGSAELAFSS FYRSYFHDNN SKVTIASARA GNVIGGGDWA NNRIVPDCIR AWTQEREVSI RAPYSTRPWQ HVLEPLSGYL CLAQQLTSES LLNGESFNFG PTSSLNYSVL DVVNELAKGW SYNLLDIEEK SESKANESAL LKLNCDKALA QLSWQSTLNF ESTLLYTSSW YLDYYSESKS IDPYELCLKQ INSYSAEAKL TGNKWI
|
| |