Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08531 |
Symbol | galE |
ID | 4779697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 783035 |
End bp | 784081 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640084128 |
Product | UDP-glucose 4-epimerase |
Protein accession | YP_001014676 |
Protein GI | 124025560 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.264639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00180619 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGGTTC TTTTAACTGG TGGTGCAGGA TTTATAGGTT CCCATATAGC CTTGTTGTTG CTCGAAAGAG GGTACGATGT TTTGATATTA GATTCATTTG CTAATAGTTC ATCAAATGTT ATCGAGCGCA TTGAAAATTT TCTAGATAAT AAAGCCCTAA AATATAAACT AAGAGTTATA AATGGTGATA TTAGAGATAA GCAAATACTT GAGAGCATTT TCTCAAAATG TGTAAAGGAA AATAAACCGA TAGAAGTAGT TATACATCTT GCTGGTGTTA AGTCTGTATG CGAATCTTTG ACTAATCCTC TTTATTATTG GGATGTAAAT GTATCTGGAA CATTAAATTT ATTACTTACG ATGAAAGATT ATCAATGTTA CTCCCTTGTC TTTAGTAGTA GTGCAACTAT CTATGGTTTA TCTGATTATG TCCCTATCTT AGAAGAACAA AAGATTTCAC CAATTACACC CTATGGTCAA ACCAAAGTTG CTGTTGAAAA TTTGTTTTAT GATTTATATA AATCTAATGT GAATTTATGG AAAATTTGTT CCTTGCGTTA TTTTAATCCT GTTGGTGCGC ACCCTTCTGG CCTGATTGGC GAGGATCCAA GAGGAATTCC TAATAATCTG TTCCCTTTTA TAACTCAAGT AGCAATAGGA AGACAAAAGA TTTTAAACAT TTATGGAGAT GATTGGGAAA CAAAGGATGG TTCAGGGATT CGTGATTATG TTCATATTAT TGATTTAGCA GAAGGACATC TAGCCTCAAT AGATTATTTA AATACCAGTG AATCATGTTT AGAGTTTATC AATTTAGGCT CTGGTAAAGG GTATTCTGTA TTTCAGATTA TTCGGCAATT TGAGTTGTCT ACAGGGTGTA GTATTCCTTT TTCAATTGAA AGTCGAAGAG ATGGTGATGT GGCAGTCTCT TACGCAGATA TATCAAAAGC CAAGAGATTA TTAAGTTGGA CTCCTAAGAG ATCCTTAGAA CAGATTTGCC TAGATGGATG GAATTGGCAA ATTAAAAATC CAAATGGTTA TGGATAA
|
Protein sequence | MRVLLTGGAG FIGSHIALLL LERGYDVLIL DSFANSSSNV IERIENFLDN KALKYKLRVI NGDIRDKQIL ESIFSKCVKE NKPIEVVIHL AGVKSVCESL TNPLYYWDVN VSGTLNLLLT MKDYQCYSLV FSSSATIYGL SDYVPILEEQ KISPITPYGQ TKVAVENLFY DLYKSNVNLW KICSLRYFNP VGAHPSGLIG EDPRGIPNNL FPFITQVAIG RQKILNIYGD DWETKDGSGI RDYVHIIDLA EGHLASIDYL NTSESCLEFI NLGSGKGYSV FQIIRQFELS TGCSIPFSIE SRRDGDVAVS YADISKAKRL LSWTPKRSLE QICLDGWNWQ IKNPNGYG
|
| |