Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04151 |
Symbol | |
ID | 4780065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 379882 |
End bp | 381141 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640083685 |
Product | HD superfamily phosphohydrolase |
Protein accession | YP_001014244 |
Protein GI | 124025128 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.761915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.477621 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCAA GAACTTATTA CGACCCTCTT CATCAATCCA TTACATTAAA TAGTTCGATT CCAGAGGAAA AAATGGTAAT GGAATTAATA GATTCCTCCC CATTTCAAAG GTTAAGAAGA ATCAAACAGT TAGGGCCTGC TTACTTGACA TTTCATGGTG CAGAATCAAG CAGGTTCACT CATTCTCTTG GTGTATTTCA TCTGGCAAGA AGAGCAATCA ATCATTTATT AAATGTTGAT TCTGGATTAA AAGAGCATAA ATTCACTATT TATGGTGCTG CGCTTTTACA TGATCTAGGT CATGGACCAC TCAGTCACAC AAGTGAAGAA ATTTTCAAGA TAAAACATGA ATCTTGGACC GCTAAATTAA TAAATTCAAG TAAAGAAATA ACTACGATAC TTAATAAATA CGGTAAAGGA AATGCAAAAG CAATTTCAGA TTTAATTCAA TCGAGAAAAG CAGAAAAAAA ATCAATAATT TCGTTAATTA GTAGTCAGCT AGACTGTGAT CGACTTGATT ATTTAATGAG AGATAGTTAT ACAACTGGGG CAAGATATGG TCAGTTAGAT ATAGACCGGA TAATTTCAGC AATGACAATT TCACCTGATG GAGATTTAGC AATACACCCA AAAGGATTAA TGGCAGTTGA GCATTATTTA GTTATAAGGA ATCTAATGTA TAGAAGCGTA TACAATCATC GATTAAATGA AGTTTGCAAT TGGTTATTAG AGCAAATTAT AAAAACAGCA AGAAAAATTG GCCCACAAAG CTTATGGGCA GATAAAAGCA TGTCTGAATG GCTTTGGAAT CATGAAAAAA TGAGCCTAGA AAGTTTTTTA TCTAATGATG ACATAGTAAC TGGATATCAT ATCCATAGAT GGCAAGACTG TAGTTCCAAC AATCTATCTA ATCTCTGTAA GCGCTTTATA CATAGAAATT TATTAAAAGC ACTAAATCTA TCCTCCTTTA CTTTAGAGAC GAGGCTAGAA TCTCTAGCAA AAGCTAGAAA ACTATCTGAA AAATATTGCC TTGAGCCAGA TATATCCTGT GGATTAAGAG AACAAGTAGT AAAAAGTTAC CATCCTTATA AACATGGACT TCGTTTATGG GACGGAGATA AGTTACAAGC TCTAGAGAGA GTTTCACCAT TAGTTGATAG ATTAATTCAG CCTAATCAAT CTTCATGGTT GATTTATCCA AAAGAAATAG AAATTGAATT AAAAATAGAA ATTGAGAAAC TAAAAATGAA ATATAATTAG
|
Protein sequence | MSSRTYYDPL HQSITLNSSI PEEKMVMELI DSSPFQRLRR IKQLGPAYLT FHGAESSRFT HSLGVFHLAR RAINHLLNVD SGLKEHKFTI YGAALLHDLG HGPLSHTSEE IFKIKHESWT AKLINSSKEI TTILNKYGKG NAKAISDLIQ SRKAEKKSII SLISSQLDCD RLDYLMRDSY TTGARYGQLD IDRIISAMTI SPDGDLAIHP KGLMAVEHYL VIRNLMYRSV YNHRLNEVCN WLLEQIIKTA RKIGPQSLWA DKSMSEWLWN HEKMSLESFL SNDDIVTGYH IHRWQDCSSN NLSNLCKRFI HRNLLKALNL SSFTLETRLE SLAKARKLSE KYCLEPDISC GLREQVVKSY HPYKHGLRLW DGDKLQALER VSPLVDRLIQ PNQSSWLIYP KEIEIELKIE IEKLKMKYN
|
| |