Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_09151 |
Symbol | |
ID | 4780514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 846969 |
End bp | 847769 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640084191 |
Product | mannosyl-3-phosphoglycerate phosphatase |
Protein accession | YP_001014738 |
Protein GI | 124025622 |
COG category | [R] General function prediction only |
COG ID | [COG3769] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01484] HAD-superfamily hydrolase, subfamily IIB [TIGR01486] mannosyl-3-phosphoglycerate phosphatase family [TIGR02463] mannosyl-3-phosphoglycerate phosphatase-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00568947 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAATA ATTCTAAATA TTGGATAGTT ACAGATCTTG ATGGCACTTT AATGGACGAA GATTACGATA TAACTCCTGC CAAAGAAACC TTAAAAATAT TGGCAGAATT GAATATTCCT GTAATACCTT GTACTAGTAA AACTGCTTCT GAAGTTAGAT ATTTTAGAAA AGAGAATGCA TTGACAGATC CTTTTATTGT CGAAAATGGA GCGGCTGTTT ATGGGTGTTA TGAACAAAAT TCGTCGGAAT GGGAATTGAT CTTAGGAAAA AGTTATTCTG AATTAAAAAC TATATTATTT AATATCTCTA AAAAAGTTAA CTTCCACTTA ACCCCATTAA ATGATTTAAG CAAAAATCAA ATATTTGAAC TTACGGGATT ATCTGATCAA GGTATTACTA GAGCTCTTGA TAGGTGCTGG AGTGTTCCAT TCTTAAATCC TCCTGATGAA ATTTTTGAGA ATGTAAAGTT TATTTGTGAT TTTTATAATG TGCATGTTTT TAAAGGAAAT AGAATGAGTC ATTTACTCTC CAGTGAAAGT CATAAAGGTA AAGCTGTTAA TAAATTAAAG GTTTATCTAC AGAACAATGA TGTAAAGATT ATTGCACTTG GGGATTCGCA AAATGATCTT CCTTTACTTG AATATGCCGA TATATCTATC GTTATTCCTG GCAAAAATGG CCCAAATAAG TACTTACAGA ATGGTATTGA TAATGGCTCT TTTAGATTAG CAAATGCCCC GCACGCAGAG GGGTGGGCAA ATAGTGTTCA GGATATTATT AAAGATTTTA TGGATTTATG A
|
Protein sequence | MKNNSKYWIV TDLDGTLMDE DYDITPAKET LKILAELNIP VIPCTSKTAS EVRYFRKENA LTDPFIVENG AAVYGCYEQN SSEWELILGK SYSELKTILF NISKKVNFHL TPLNDLSKNQ IFELTGLSDQ GITRALDRCW SVPFLNPPDE IFENVKFICD FYNVHVFKGN RMSHLLSSES HKGKAVNKLK VYLQNNDVKI IALGDSQNDL PLLEYADISI VIPGKNGPNK YLQNGIDNGS FRLANAPHAE GWANSVQDII KDFMDL
|
| |