Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01331 |
Symbol | dap2 |
ID | 4780955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 128762 |
End bp | 130696 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640083397 |
Product | esterase/lipase/thioesterase family protein |
Protein accession | YP_001013962 |
Protein GI | 124024846 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.180596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACA AAACAATGAG AATCTTGGAT GCTGAAAAAG TTTATGGAGA AGCCCCCATA TTTAAAGAGC CTCGTATAAT AGGTGATTGG ATTTTATGGT TAGAACAAAG ACCAAACGAA AAGGGAAGAA CTACAGCTTT AATCAGACCT TGGGGACAAA AAGACGTATT ACCTCAGGAG TTAACACCTT ATCCAAGTGA TTTAAGGACA AAAATTCATG GATATGGTGG CGCTCCGCTA ACAGCTACTC TCGATGGATC TGATCTTATA TTGACTTGGG TTGACAATAA AGACAACCGC TTATGGATGA GAACTTGGTT TTACGAGGAA GAAAAAGAAA AATCTTTTTC TTTTAAATTC ATACCTAAAA TAGAATCAAT TTGTCTTACA AAAAAACATA GCTATTTTCT TGCAGGTGGC GTGATTGATC TTGAAAAAAA TATTTGGATT GGTTTGATGG AGGATGAAGA AGGGGATCAT ATAGTTTCTT ACTCTCTAAA CAAATCTGAA CAATATCCAA AAATTATTTA TTCATCTCAG GGATTATTAG GTTATCTTGC TCTCAATTCT AAAGATAGAA AATTAGCATG GGTCGAATGG AAAAATACTT CAATGCCTTG GGATTTAAAT GAATTAAAAC TTGCTAAATT AGGTGAGAAA GAAAATATAA TTAATGTAGT AACTGTGAAT AATGAATATT TAAAATGCAC AGAAAAAATA TCATTTTTTA ATCCTATTTG GTCCGATACA GGTGATCTTT TTGTCGCTGA AGATAGTAGT GGCTGGTGGA ATATAACGCA GATAAAAACT GACTTAAATA ATAATTCAAT TACTATTTTC CAGAATCAAT GGACTATTAA GGCTGAAATT GCTTTCCCAC AATGGGTCCT CGGGATGTCG AGCTTTTCAT GTGTGGGGGA TAATGTCGTT GGGGCTTTTG CTCAGGAAGG AATTTGGACT TTAGCTCTAT TTCAAAAAGA TGGATCTATC AAGACTTTTG ATCAGACTTT TATTGAATTC TCAGGTATTC ATTCGCATCA AAATCGACTT GTTGCAATTG CCAGTAGTGC AGAAATTACT GAAGGGATTT TTGAAATAGA TTTATTGAAT CAAAGTTGGG AACATACTCC TGCCTCTTCA TTTAGCTTGG ATCCAAAGGA AATAAGTATT GGCGAATCTT TTTGGTTTAT TGGATCGAAT GAAGAGAAAG TACATGCTTG GTATTACCCT CCTCTGAATA AACAAATATT GTTACCTCCT TTGTTGGTGA AAAGTCATAG CGGACCTACT GGTATGGCTC GTTGTGGATT GGATCTTGAG GTGCAATTTT GGACATCAAG AGGTTGGGCG GTCGTAGACG TTAATTATGG AGGCTCTTCT GGTTTTGGTA GGGAATATAG AGATCGATTA AGAGGTAATT GGGGAGTAAT CGATGTTATG GATTGCACTA AGGCAGCTCA GTCTTTGATT GCATCTGGTA AGGCTGACAA GGACCGTATA GCAATTATGG GGAGCAGCGC ATCGGGTTTT ACAGCTTTAG GTTGTTTGAT ATCTTCTGAC ATTTTTAATA TTGGTGCATG TAAATATGCT GTGACTGATT TGATTGGTAT GGCTAATTCA ACGCATAGGT TTGAGGAATT TTATTTAGAT TATTTAATAG GAAACATAGA AACTGATTAT GAGAAATATC TGAAAAGATC GCCAATTGAA AATGTCAATT TTATGAATAT GCCATTGATT TTGTTTCATG GTTTAAAAGA TAAAGTTATA CCCTCTGATC AATCTATTGC GATTAAAGAT GAATTGTTAA AGCGTGAAAT TCCTGTGCAA ATCAATTTAT TTGAGAACGA AGGTCATGGA TTTAAAGACG GTAAAATCAA AGTTGATGTA TTAAACAAAA CAGAGGCTTT TTTTAGACAA TATCTAAATA TTTAA
|
Protein sequence | MKNKTMRILD AEKVYGEAPI FKEPRIIGDW ILWLEQRPNE KGRTTALIRP WGQKDVLPQE LTPYPSDLRT KIHGYGGAPL TATLDGSDLI LTWVDNKDNR LWMRTWFYEE EKEKSFSFKF IPKIESICLT KKHSYFLAGG VIDLEKNIWI GLMEDEEGDH IVSYSLNKSE QYPKIIYSSQ GLLGYLALNS KDRKLAWVEW KNTSMPWDLN ELKLAKLGEK ENIINVVTVN NEYLKCTEKI SFFNPIWSDT GDLFVAEDSS GWWNITQIKT DLNNNSITIF QNQWTIKAEI AFPQWVLGMS SFSCVGDNVV GAFAQEGIWT LALFQKDGSI KTFDQTFIEF SGIHSHQNRL VAIASSAEIT EGIFEIDLLN QSWEHTPASS FSLDPKEISI GESFWFIGSN EEKVHAWYYP PLNKQILLPP LLVKSHSGPT GMARCGLDLE VQFWTSRGWA VVDVNYGGSS GFGREYRDRL RGNWGVIDVM DCTKAAQSLI ASGKADKDRI AIMGSSASGF TALGCLISSD IFNIGACKYA VTDLIGMANS THRFEEFYLD YLIGNIETDY EKYLKRSPIE NVNFMNMPLI LFHGLKDKVI PSDQSIAIKD ELLKREIPVQ INLFENEGHG FKDGKIKVDV LNKTEAFFRQ YLNI
|
| |