Gene NATL1_04151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04151 
Symbol 
ID4780065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp379882 
End bp381141 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content33% 
IMG OID640083685 
ProductHD superfamily phosphohydrolase 
Protein accessionYP_001014244 
Protein GI124025128 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.761915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.477621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCAA GAACTTATTA CGACCCTCTT CATCAATCCA TTACATTAAA TAGTTCGATT 
CCAGAGGAAA AAATGGTAAT GGAATTAATA GATTCCTCCC CATTTCAAAG GTTAAGAAGA
ATCAAACAGT TAGGGCCTGC TTACTTGACA TTTCATGGTG CAGAATCAAG CAGGTTCACT
CATTCTCTTG GTGTATTTCA TCTGGCAAGA AGAGCAATCA ATCATTTATT AAATGTTGAT
TCTGGATTAA AAGAGCATAA ATTCACTATT TATGGTGCTG CGCTTTTACA TGATCTAGGT
CATGGACCAC TCAGTCACAC AAGTGAAGAA ATTTTCAAGA TAAAACATGA ATCTTGGACC
GCTAAATTAA TAAATTCAAG TAAAGAAATA ACTACGATAC TTAATAAATA CGGTAAAGGA
AATGCAAAAG CAATTTCAGA TTTAATTCAA TCGAGAAAAG CAGAAAAAAA ATCAATAATT
TCGTTAATTA GTAGTCAGCT AGACTGTGAT CGACTTGATT ATTTAATGAG AGATAGTTAT
ACAACTGGGG CAAGATATGG TCAGTTAGAT ATAGACCGGA TAATTTCAGC AATGACAATT
TCACCTGATG GAGATTTAGC AATACACCCA AAAGGATTAA TGGCAGTTGA GCATTATTTA
GTTATAAGGA ATCTAATGTA TAGAAGCGTA TACAATCATC GATTAAATGA AGTTTGCAAT
TGGTTATTAG AGCAAATTAT AAAAACAGCA AGAAAAATTG GCCCACAAAG CTTATGGGCA
GATAAAAGCA TGTCTGAATG GCTTTGGAAT CATGAAAAAA TGAGCCTAGA AAGTTTTTTA
TCTAATGATG ACATAGTAAC TGGATATCAT ATCCATAGAT GGCAAGACTG TAGTTCCAAC
AATCTATCTA ATCTCTGTAA GCGCTTTATA CATAGAAATT TATTAAAAGC ACTAAATCTA
TCCTCCTTTA CTTTAGAGAC GAGGCTAGAA TCTCTAGCAA AAGCTAGAAA ACTATCTGAA
AAATATTGCC TTGAGCCAGA TATATCCTGT GGATTAAGAG AACAAGTAGT AAAAAGTTAC
CATCCTTATA AACATGGACT TCGTTTATGG GACGGAGATA AGTTACAAGC TCTAGAGAGA
GTTTCACCAT TAGTTGATAG ATTAATTCAG CCTAATCAAT CTTCATGGTT GATTTATCCA
AAAGAAATAG AAATTGAATT AAAAATAGAA ATTGAGAAAC TAAAAATGAA ATATAATTAG
 
Protein sequence
MSSRTYYDPL HQSITLNSSI PEEKMVMELI DSSPFQRLRR IKQLGPAYLT FHGAESSRFT 
HSLGVFHLAR RAINHLLNVD SGLKEHKFTI YGAALLHDLG HGPLSHTSEE IFKIKHESWT
AKLINSSKEI TTILNKYGKG NAKAISDLIQ SRKAEKKSII SLISSQLDCD RLDYLMRDSY
TTGARYGQLD IDRIISAMTI SPDGDLAIHP KGLMAVEHYL VIRNLMYRSV YNHRLNEVCN
WLLEQIIKTA RKIGPQSLWA DKSMSEWLWN HEKMSLESFL SNDDIVTGYH IHRWQDCSSN
NLSNLCKRFI HRNLLKALNL SSFTLETRLE SLAKARKLSE KYCLEPDISC GLREQVVKSY
HPYKHGLRLW DGDKLQALER VSPLVDRLIQ PNQSSWLIYP KEIEIELKIE IEKLKMKYN