Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_25441 |
Symbol | galE |
ID | 4776228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2236629 |
End bp | 2237675 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640088065 |
Product | UDP-glucose-4-epimerase |
Protein accession | YP_001018540 |
Protein GI | 124024233 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.8826 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGT TGTTGATCAC GGGTGGTGCC GGGTTCATTG GCAGTCACAC CTGCCTTGTG CTTCTGGAAG CCGGCCACAG GCTGGTGATA CTCGACAACT TTTCCAATAG CTCAGCTATT GCCTCAAAAA GGGTAGCCGA ACTTGCTGGA GTGGCCGCTC AAGAGCGAAT GCTCGTATTG GAGGGCGATA TCCGCAGTAG CAACGATCTA GATCGCGCTT TCAACTCAAT GGAAAACGGC ATCGCGGCTG TGGTGCACTT CGCCGGGCTC AAGGCAGTTC ATGAATCAGT TCAGTTACCA CTGAAATACT GGGATGTGAA TGTGGCTGGC AGCCGCTGCC TACTGGAAGC CATGCAACGG CATAACTGCC GCACGATTGT TTTCAGCAGC AGTGCAACGC TTTATGGCTA TCCCGAGCAA ATCCCAATCC CTGAAACGAC TAGGGTGCAA CCGATCAATC CATATGGTCA AAGCAAGGCA GCGGTGGAGC AGTTGCTAGA TGACCTGGCC TGCAGCGAAC CTGGTTGGCG CATTGCCCGA TTGCGCTATT TCAATCCAGT TGGAGCCCAC CCCAGCGGCT GCATCGGCGA AGATCCCAAG GGAACACCCA ACAACCTTTT CCCTTTTGTG AGCCAAGTAG CAGTAGGCCG GCGAGCGGAA CTCCAAGTAT TTGGAGCCGA TTGGCCTACA CCCGATGGGA GTGCTGTGCG CGACTACATC CATGTGATGG ACTTAGCTGA GGGACACCGA GCTGCACTGG AGGTCCTGCA ACGAGAGGAA CCGCAACTCC TCACTCTTAA TCTCGGCAGT GGCAAAGGCC ACTCGGTGTT GGAAGTTGTG CAGGCCTTTG AAAAGGCAAG CGGCCAACCC GTTCCATACA GCATTAACCA GCGCCGCGCT GGAGATGCCG CTTGTAGCGT CGCAGATCCA AGCCTGGCCG CCGAGCGGTT GGGATGGTCC ACGCAGCGCA GCCTGTCAGA CATGTGCCGC GACAGTTGGA ATTGGCAGAA GGCCAATCCA CAGGGCTATA GCCAGAAACA ACAATGA
|
Protein sequence | MAELLITGGA GFIGSHTCLV LLEAGHRLVI LDNFSNSSAI ASKRVAELAG VAAQERMLVL EGDIRSSNDL DRAFNSMENG IAAVVHFAGL KAVHESVQLP LKYWDVNVAG SRCLLEAMQR HNCRTIVFSS SATLYGYPEQ IPIPETTRVQ PINPYGQSKA AVEQLLDDLA CSEPGWRIAR LRYFNPVGAH PSGCIGEDPK GTPNNLFPFV SQVAVGRRAE LQVFGADWPT PDGSAVRDYI HVMDLAEGHR AALEVLQREE PQLLTLNLGS GKGHSVLEVV QAFEKASGQP VPYSINQRRA GDAACSVADP SLAAERLGWS TQRSLSDMCR DSWNWQKANP QGYSQKQQ
|
| |