Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_14131 |
Symbol | |
ID | 4718134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1184809 |
End bp | 1186371 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640079134 |
Product | nucleotide-diphosphate-sugar epimerase, membrane-associated protein |
Protein accession | YP_001009804 |
Protein GI | 123968946 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGCTT TAACAATATG GATATTGACT GCTGCATTAG TTTCAAGTTT TTCATTTATT GTTCGACTAT TTTTAAAAGA CGTAATATTT TTCCTTAAAA ATAAACTTAA TAATAAACAA AAAAATATTG TTATTTATGG AGCTGGTGAT GCTGGGAATC AACTTGCGAA TGCATTGTGC TTGAGTCAAA AATATAAAAT TATAAGTTTT ATAGACGATT CTTCTAATCT TCAAGGTAGG ACAATAGGAG GAATTCCAAT AAAAAGTCCA AATTATTTAA ATTTTCAAAA TTCAAAAGTT GATAAAGTTC TTTTGGCAAT ACCTTCTTTG ACAAAAGAGA GAAAAAAAAC TTTATTAGAA AATTTAGAAA AAAAATCTAT AGGTGTATTA CAAATCCCAT CTATTGATGA ATTAACTAGC GGCTCGGCAC AAATTGACAC ATTGCGACCA GTTTCTCCTG AGGATTTATT AAGCAGAGAT ATTGCAACTT ATGAAGATAA TAATCTAGAG GAATTAATAA AAAATAAAGT TGTTTGTATT TCGGGAGCAG GCGGATCTAT TGGATCTGAA TTATGTAGAC AGATCATTAA ACTTAAACCC AAAAAATTAA TACTTATTGA AATGAATGAG CATAGTCTTT ATAAAATTAA CTATGAATTG ACTCAAAAAG AAATTTACGA GATTGAAATT ATTCCAATAC TGGAAAATGC ATCAAACTAT AAATCTCTCA ATCTCCTATT TAAACAAATT AAAATTAATA TCTTATTTCA CGCAGCTGCT TATAAACACG TACCCTTAGT AGAAATGAAT CCAATGTCAG GTCTGGCTAA TAATTTTTTA TCAACAAGTA ATTTATGTAA ATTAGCATTA GAAAATTCAA TAGAGAGAAT AATTTTAATT TCATCTGATA AAGCAGTAAG GCCTACTAAC TTAATGGGCG TTTCAAAACG ATTGTCAGAA TTAATTTTTC AGGCATATTC CAAAATTGAT AATAAAAAAG ATGTTAATAA AAAGACTATT TTTGCGATGG TTAGGTTTGG TAATGTACTT GGTTCTTCGG GATCAGTAGT GCCATTATTT AATAAACAAA TTACTAAAGG AGGGCCTATA ACTTTAACTC ATCCAGATGT AATAAGATTT TTTATGACTA TTTCAGAATC AGTACAATTA GTTCTACAAG CCGCCTTATT AGCTAATGGG GGAGACTTAT TCATACTTGA TATGGGAAAA CCTGTAAAAA TATATGATCT TGCCATGAAA ATGATTAATT TAAGAGGATT AAAAATTAAA AATAAAGAGA ATCCTGATGG TGATATTGAG ATTATTTTTA CAGGTCTAAG GCCAGGGGAA AAACTTTTTG AGGAATTATT AATTGATGCC GATACTGAAT CGACTATAAA TCCATATATT CTTCGAGCAC AAGAAAAATT TATTATGCCA GAAAATCTTT TCCCTAGATT AGAAAAATTG GAATATCTTA TTGACTCAAG GGATTCAAAA GAAGTTTGGA ATCTCTTGAA TGAAATTGTT CCTGAATGGA TTAGAAGTAA AGAACTAAAT TAA
|
Protein sequence | MPALTIWILT AALVSSFSFI VRLFLKDVIF FLKNKLNNKQ KNIVIYGAGD AGNQLANALC LSQKYKIISF IDDSSNLQGR TIGGIPIKSP NYLNFQNSKV DKVLLAIPSL TKERKKTLLE NLEKKSIGVL QIPSIDELTS GSAQIDTLRP VSPEDLLSRD IATYEDNNLE ELIKNKVVCI SGAGGSIGSE LCRQIIKLKP KKLILIEMNE HSLYKINYEL TQKEIYEIEI IPILENASNY KSLNLLFKQI KINILFHAAA YKHVPLVEMN PMSGLANNFL STSNLCKLAL ENSIERIILI SSDKAVRPTN LMGVSKRLSE LIFQAYSKID NKKDVNKKTI FAMVRFGNVL GSSGSVVPLF NKQITKGGPI TLTHPDVIRF FMTISESVQL VLQAALLANG GDLFILDMGK PVKIYDLAMK MINLRGLKIK NKENPDGDIE IIFTGLRPGE KLFEELLIDA DTESTINPYI LRAQEKFIMP ENLFPRLEKL EYLIDSRDSK EVWNLLNEIV PEWIRSKELN
|
| |