Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03941 |
Symbol | |
ID | 4780474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 364132 |
End bp | 365088 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640083662 |
Product | nucleoside-diphosphate-sugar epimerases |
Protein accession | YP_001014223 |
Protein GI | 124025107 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.873071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0436557 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAAT CTCCTGTAAA AAACTTGGTT ACTGGAGGGG CTGGCTTCGT TGGTTCTCAT TTGATTGATC GTTTAATGAA ATCTGGAGAA AAAGTTATAT GTTTGGATAA TTTTTTTACT GGGAGTAAAG AAAATATTGA ACACTGGATT GGACATCCAT CTTTTGAGCT TATAGATCAT GATGTTATAG AGCCAATCAA GCTTGATGTG GATAGGATTT GGCATTTAGC TTGTCCAGCA TCTCCAATTC ATTATCAATT TAACCCTATT AAAACAGCGA AAACGAGTTT TTTGGGGACT TATAATATGC TTGGATTAGC TAGGAAAGTT GGAGCTCGAA TATTATTAGC AAGTACTAGT GAAGTTTATG GAAATCCCGA AATTCATCCT CAGCCTGAAA AATATAACGG CAATGTAAAT CCTATAGGAA TTCGTAGTTG CTACGATGAG GGTAAACGTG TTGCGGAATC ATTGTGTTAT GACTATATGA GAATGCATGG TTTAGAAATA AGAATTGCTA GAATATTTAA TACCTATGGT CCTAGAATGT TATTAAATGA TGGAAGACTT ATTAGCAACT TATTAGTTCA ATCAATACAT GGAAATGACT TGACTATTTA TGGCAATGGT AAGCAAACTA GAAGCTTTTG TTTTGTTGAT GACTTAATAG ATGGTTTAAC TTTATTCATG AATTCTTTAA ATGTAGGACC TATGAATTTA GGCAATCCTG AAGAATTATC TATTCTTCAA ATAACTAACT TCATAAGAAA TATCTCAATT GAAAAAGTAA ATCTGAAATT TTTAAAAGCA CTAGATGATG ATCCTTTAAG AAGAAAGCCT GATATTTATC TTGCAAAAAA AGAATTAAAT TGGGAGCCTA AAATAATGTT TAAAGAAGGA TTAGCAATTA CAAGAAAGTA TTTTGAAAAG AAATTAATCT TTGAAAAAAG TAAATAA
|
Protein sequence | MPKSPVKNLV TGGAGFVGSH LIDRLMKSGE KVICLDNFFT GSKENIEHWI GHPSFELIDH DVIEPIKLDV DRIWHLACPA SPIHYQFNPI KTAKTSFLGT YNMLGLARKV GARILLASTS EVYGNPEIHP QPEKYNGNVN PIGIRSCYDE GKRVAESLCY DYMRMHGLEI RIARIFNTYG PRMLLNDGRL ISNLLVQSIH GNDLTIYGNG KQTRSFCFVD DLIDGLTLFM NSLNVGPMNL GNPEELSILQ ITNFIRNISI EKVNLKFLKA LDDDPLRRKP DIYLAKKELN WEPKIMFKEG LAITRKYFEK KLIFEKSK
|
| |