Gene NATL1_01331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01331 
Symboldap2 
ID4780955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp128762 
End bp130696 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content35% 
IMG OID640083397 
Productesterase/lipase/thioesterase family protein 
Protein accessionYP_001013962 
Protein GI124024846 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACA AAACAATGAG AATCTTGGAT GCTGAAAAAG TTTATGGAGA AGCCCCCATA 
TTTAAAGAGC CTCGTATAAT AGGTGATTGG ATTTTATGGT TAGAACAAAG ACCAAACGAA
AAGGGAAGAA CTACAGCTTT AATCAGACCT TGGGGACAAA AAGACGTATT ACCTCAGGAG
TTAACACCTT ATCCAAGTGA TTTAAGGACA AAAATTCATG GATATGGTGG CGCTCCGCTA
ACAGCTACTC TCGATGGATC TGATCTTATA TTGACTTGGG TTGACAATAA AGACAACCGC
TTATGGATGA GAACTTGGTT TTACGAGGAA GAAAAAGAAA AATCTTTTTC TTTTAAATTC
ATACCTAAAA TAGAATCAAT TTGTCTTACA AAAAAACATA GCTATTTTCT TGCAGGTGGC
GTGATTGATC TTGAAAAAAA TATTTGGATT GGTTTGATGG AGGATGAAGA AGGGGATCAT
ATAGTTTCTT ACTCTCTAAA CAAATCTGAA CAATATCCAA AAATTATTTA TTCATCTCAG
GGATTATTAG GTTATCTTGC TCTCAATTCT AAAGATAGAA AATTAGCATG GGTCGAATGG
AAAAATACTT CAATGCCTTG GGATTTAAAT GAATTAAAAC TTGCTAAATT AGGTGAGAAA
GAAAATATAA TTAATGTAGT AACTGTGAAT AATGAATATT TAAAATGCAC AGAAAAAATA
TCATTTTTTA ATCCTATTTG GTCCGATACA GGTGATCTTT TTGTCGCTGA AGATAGTAGT
GGCTGGTGGA ATATAACGCA GATAAAAACT GACTTAAATA ATAATTCAAT TACTATTTTC
CAGAATCAAT GGACTATTAA GGCTGAAATT GCTTTCCCAC AATGGGTCCT CGGGATGTCG
AGCTTTTCAT GTGTGGGGGA TAATGTCGTT GGGGCTTTTG CTCAGGAAGG AATTTGGACT
TTAGCTCTAT TTCAAAAAGA TGGATCTATC AAGACTTTTG ATCAGACTTT TATTGAATTC
TCAGGTATTC ATTCGCATCA AAATCGACTT GTTGCAATTG CCAGTAGTGC AGAAATTACT
GAAGGGATTT TTGAAATAGA TTTATTGAAT CAAAGTTGGG AACATACTCC TGCCTCTTCA
TTTAGCTTGG ATCCAAAGGA AATAAGTATT GGCGAATCTT TTTGGTTTAT TGGATCGAAT
GAAGAGAAAG TACATGCTTG GTATTACCCT CCTCTGAATA AACAAATATT GTTACCTCCT
TTGTTGGTGA AAAGTCATAG CGGACCTACT GGTATGGCTC GTTGTGGATT GGATCTTGAG
GTGCAATTTT GGACATCAAG AGGTTGGGCG GTCGTAGACG TTAATTATGG AGGCTCTTCT
GGTTTTGGTA GGGAATATAG AGATCGATTA AGAGGTAATT GGGGAGTAAT CGATGTTATG
GATTGCACTA AGGCAGCTCA GTCTTTGATT GCATCTGGTA AGGCTGACAA GGACCGTATA
GCAATTATGG GGAGCAGCGC ATCGGGTTTT ACAGCTTTAG GTTGTTTGAT ATCTTCTGAC
ATTTTTAATA TTGGTGCATG TAAATATGCT GTGACTGATT TGATTGGTAT GGCTAATTCA
ACGCATAGGT TTGAGGAATT TTATTTAGAT TATTTAATAG GAAACATAGA AACTGATTAT
GAGAAATATC TGAAAAGATC GCCAATTGAA AATGTCAATT TTATGAATAT GCCATTGATT
TTGTTTCATG GTTTAAAAGA TAAAGTTATA CCCTCTGATC AATCTATTGC GATTAAAGAT
GAATTGTTAA AGCGTGAAAT TCCTGTGCAA ATCAATTTAT TTGAGAACGA AGGTCATGGA
TTTAAAGACG GTAAAATCAA AGTTGATGTA TTAAACAAAA CAGAGGCTTT TTTTAGACAA
TATCTAAATA TTTAA
 
Protein sequence
MKNKTMRILD AEKVYGEAPI FKEPRIIGDW ILWLEQRPNE KGRTTALIRP WGQKDVLPQE 
LTPYPSDLRT KIHGYGGAPL TATLDGSDLI LTWVDNKDNR LWMRTWFYEE EKEKSFSFKF
IPKIESICLT KKHSYFLAGG VIDLEKNIWI GLMEDEEGDH IVSYSLNKSE QYPKIIYSSQ
GLLGYLALNS KDRKLAWVEW KNTSMPWDLN ELKLAKLGEK ENIINVVTVN NEYLKCTEKI
SFFNPIWSDT GDLFVAEDSS GWWNITQIKT DLNNNSITIF QNQWTIKAEI AFPQWVLGMS
SFSCVGDNVV GAFAQEGIWT LALFQKDGSI KTFDQTFIEF SGIHSHQNRL VAIASSAEIT
EGIFEIDLLN QSWEHTPASS FSLDPKEISI GESFWFIGSN EEKVHAWYYP PLNKQILLPP
LLVKSHSGPT GMARCGLDLE VQFWTSRGWA VVDVNYGGSS GFGREYRDRL RGNWGVIDVM
DCTKAAQSLI ASGKADKDRI AIMGSSASGF TALGCLISSD IFNIGACKYA VTDLIGMANS
THRFEEFYLD YLIGNIETDY EKYLKRSPIE NVNFMNMPLI LFHGLKDKVI PSDQSIAIKD
ELLKREIPVQ INLFENEGHG FKDGKIKVDV LNKTEAFFRQ YLNI