Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08591 |
Symbol | |
ID | 4781273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 788809 |
End bp | 790689 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640084134 |
Product | nucleotide-diphosphate-sugar epimerase, membrane associated |
Protein accession | YP_001014682 |
Protein GI | 124025566 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000225321 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTTGCCA GCGCTGCTAA TAAAATTCTC TCTTTATCGT CTAAAAAGCG GCTTTCAATT CTTATATTTA TTGATATAGT CATAATTATA TTTTCGAGTA AGTTAGGACT ATATTTAACA AGTAGAGATT ACTTTAATGA TTCTTACTTT TCTATATTCA TAACACTTAT TACAATATTT ATAGGTATAA CCTGTTATAT AATAAGTGGA CAATATATTG GTATAACTAG ATATTTAAGC AGTAGAGATT TAAATCATCT TATAATTAGG AATTTTATAA TTACAGTATT AACAAGAATA ACTCTTATCT TCTTTCAAGT AGAATTACCA TCCTTAGGAT ATTTTATTTT ATTATGGATT TTATTATCAA CATCAAGTGT ATATATTAGA TTTTTTATGC GTGATTTTAT CCTAAAAATT AAGATCTCAA AAAATAAAAA TAGGAAAAAA GTTATTATTT ATGGGGCAGG TGAGGCAGGG GCTCAACTAG CATCATCCTT AATTCAGGAT GGGCGTTATT GCGTTGAGGG ATTTATCGAT GATGATTCAA GCCTTTGGCG GAGGAACATA AAAGGTATAC CCATATATCC TCCAAATAGA ATTTATGAAA ATAAAGCTCA TATTGATCAG GTTCTTTTAG CAATACCTTC CCTAAGAAAA AAAAAGCGAC TTGAAATTTT ACATACACTC TATAAAAAAG GGGTTTCGGT ACTTCAAATA CCTTCTATTG ATGAAATAAA GAGTGAAAAG AATCTTATTA CATCATTAAA ACCTGTAAAA GTTGAAGATA TTCTCGGACG AGAGCCAATT AATCCTGATA ATAATCTTTT AAACATAGCA GTGAAAGGGC AAACAGTTTG CATAACAGGT GCAGGTGGAT CTATAGGTAG TGAATTATCC AAACAAATAT ATAATTTAAA CCCCTATAAA ATGATATTAA TAGATCATAG TGAATCTCAT CTTTATAATA TAAATAAGCA AATTACTTCC TATCCTGATA ATGGTATAGA AGTTAAAGCA ATTCTAGGAA GTACAACAGA TTTACCATTT ATTAATAAAG TTTTTACTGA TAATAATGTA GATATAATTT TTCATGCTGC TGCATATAAA CATGTTCCTC TTGTTGAATC AAATCCCTTA AAAGGCTTAT TTAATAATGT TTTTTCTACT GAAATAGTTT GTAAAGCAGC ATTAGAAGCA GGAGCTAATA ATTTAGTTCT GATCTCAACA GATAAAGCTG TTCGACCCAC CAATGTAATG GGTGCCTCCA AGAGGCTTTC AGAATTAGTT GTTCAAGCGA TTGCAGAGAA ATCAAAAGAG AATTCTATTG CTAAAAAAAC ATGTTTTTCT ATGGTTCGAT TTGGGAATGT ACTTGGATCT TCTGGTTCAG TTTTACCACT TTTTCAAGAG CAAATTGATA ATGGTGGTCC AATAACTTTG ACCCATCCAA GAATAATTAG ATATTTTATG ACTATTTCGG AAGCTTCTCA ATTAGTAATT CAATCAAAGG TCCTTGCAGA GGGGGGGGAT GTATTTCATC TCGATATGGG AAAACCAGTG AGCATTAAAT CATTAGCAGA GCAATTAATA CTTTTAAATG GTTTATCTAT TAAAGATAAT AAAAATTTAG AAGGAGATAT AGAAATAAAA TTTACTGGTC TAAGACCAGG AGAAAAATTA TATGAAGAAT TGATCATAGA TGCAGAATCT AAGAAAACAA TTCATCCTCT TATCTATCGT GCAGATGAGA GATTTATTCC TTTGGATATA ATTATGCCAA CATTAGAAAT ACTACGAAGA TATTTAGATA ACGAGGATAA AATCAATAGT CTTTTGATTT TGAAAGAGCT TGTGCCTGAA TGGCAGACTA ATTTAATTTA A
|
Protein sequence | MLASAANKIL SLSSKKRLSI LIFIDIVIII FSSKLGLYLT SRDYFNDSYF SIFITLITIF IGITCYIISG QYIGITRYLS SRDLNHLIIR NFIITVLTRI TLIFFQVELP SLGYFILLWI LLSTSSVYIR FFMRDFILKI KISKNKNRKK VIIYGAGEAG AQLASSLIQD GRYCVEGFID DDSSLWRRNI KGIPIYPPNR IYENKAHIDQ VLLAIPSLRK KKRLEILHTL YKKGVSVLQI PSIDEIKSEK NLITSLKPVK VEDILGREPI NPDNNLLNIA VKGQTVCITG AGGSIGSELS KQIYNLNPYK MILIDHSESH LYNINKQITS YPDNGIEVKA ILGSTTDLPF INKVFTDNNV DIIFHAAAYK HVPLVESNPL KGLFNNVFST EIVCKAALEA GANNLVLIST DKAVRPTNVM GASKRLSELV VQAIAEKSKE NSIAKKTCFS MVRFGNVLGS SGSVLPLFQE QIDNGGPITL THPRIIRYFM TISEASQLVI QSKVLAEGGD VFHLDMGKPV SIKSLAEQLI LLNGLSIKDN KNLEGDIEIK FTGLRPGEKL YEELIIDAES KKTIHPLIYR ADERFIPLDI IMPTLEILRR YLDNEDKINS LLILKELVPE WQTNLI
|
| |