Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08731 |
Symbol | |
ID | 4779752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 810945 |
End bp | 812066 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640084148 |
Product | UDP-N-acetylglucosamine 2-epimerase |
Protein accession | YP_001014696 |
Protein GI | 124025580 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0381] UDP-N-acetylglucosamine 2-epimerase |
TIGRFAM ID | [TIGR03568] UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.572404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000175685 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATAAAA AAGTTTTATT ATTTGTGACT GGGACGAGAG CTGATTTTGG TAAGATGGAG CCATTGGCAC GAGAAGCATT CAATAATGGA TTCAAAGTTA TTTTCTTCGT AACAGGTATG CATATGATGA GAGAATATGG TCTAACAAAA GAAGAAGTTC ATAAAAATAA AGATATACAA ATTTTCGAAT TCAGCAATCA AAAATATGGT GATAAATTAG ACACTATATT ATCTAATACT GTGAGAGGGT TTTCTAATTA TGTTAAAGAA ATTAATCCTG ATATAGTTAT CATTCATGGA GACAGAATAG AGGCAATAGC ATGCAGCTTA GTATGTTCAA CTAATAATAT AATAAGTGCA CATATTGAGG GTGGAGAAGT ATCAGGAACC ATTGATGAGG TTTTTAGACA TTGCAATACC AAGCTTTGTA CCTTTCACTT GGTTAGCTCT AATGAAGCAA AAAAGAGAGT AAGGCAAATG GGCGAGCCAG AAAAAAATAT TTTTGTTATT GGTTCACCAG AATTAGATAT CCACGGAAGG AAGTCTGGAG TAGATTTATT GCAAGTAAAA GAGAGATATA AAATTGATTT TAAGGAATAC GGAATATGCA TCTTTCATCC TGTAACAACT GAAGAAAATC AAATAAAAAT ACAAGCAGAG AATCTATTCA AATCATTAAG TATTAGTAAT AGAAACTTTG TAATTATTTT ACCCAATAAT GATCCAGGAT CAATTTATAT ATGTAATGAG ATTGATAAAC TAAATAGCAA CAACTTTCGA ATAATACCAT CAATGAGATT CAATTATTTT TCAGAGTTAA TGAAAAACTC ATCCCTAATA ATAGGTAACT CGAGTTTAGG TGTAAGAGAA GCTCCATTTC TTGGAATCAT GTCGATAAAC ATAGGAACTA GGCAAAATAA AAGAGCTTTA ACGCAATCGA TATATAATTG CAGTGGTCAA TCAATCCCTG AAATTGTCGA CGCCATAGGA AAATTTTGGA ATAAAAAAAC CACAAGTCAT AAAGGCTTTG GGAGTGGCAA CTCGAGAAAA AAGTTTTTAA AATTCATCAA TTCTGATAAG ATATGGAATC AAAGTACTCA AAAAAGCTTT GAAGAGCTTT AA
|
Protein sequence | MDKKVLLFVT GTRADFGKME PLAREAFNNG FKVIFFVTGM HMMREYGLTK EEVHKNKDIQ IFEFSNQKYG DKLDTILSNT VRGFSNYVKE INPDIVIIHG DRIEAIACSL VCSTNNIISA HIEGGEVSGT IDEVFRHCNT KLCTFHLVSS NEAKKRVRQM GEPEKNIFVI GSPELDIHGR KSGVDLLQVK ERYKIDFKEY GICIFHPVTT EENQIKIQAE NLFKSLSISN RNFVIILPNN DPGSIYICNE IDKLNSNNFR IIPSMRFNYF SELMKNSSLI IGNSSLGVRE APFLGIMSIN IGTRQNKRAL TQSIYNCSGQ SIPEIVDAIG KFWNKKTTSH KGFGSGNSRK KFLKFINSDK IWNQSTQKSF EEL
|
| |