Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_00811 |
Symbol | dap2 |
ID | 4716764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 83680 |
End bp | 85605 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 640077779 |
Product | esterase/lipase/thioesterase family protein |
Protein accession | YP_001008476 |
Protein GI | 123967618 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATG ATGATCAGTT AAAAGTTAGG CAAACTGTAT CTAAAAAGAA ATCTTTTAAA GAATTAACTA TTATTAGGGA TATCATTTTT TGGATTGATC TTGTTGGTGA AGGTCAAAAT GAAAATGCCA TTTTTGCAAG ACCATTTAAT AAAAAAGAGG CGTTTCCTCA GAAATTAACA AGTAAAAAAT ATAATATTAA AAATAACTTT CATGGATATG GTGGTAAATC TTATAAATGT ATATATTTAA AAAATAATTT TTATTTGATA TGGATTGATC AGATTACCAA CGCAGTTTGG TTTCAAATTT TTAAAGAGGT TGCATCAAAT TATAGAAGTC AAAAAAGATA TCTCGATTCA GTTCAAGAAC CAAGACAACT ATCTAAATCA ATTGATGGAA ATTTTGATTC TTCGTTTGTT ATTTCTCAAA AAAATTTTTT ATATGGTATT TGTGAAATAA ATAATAGAGA TTACTTATTT TCTTTAAACC TAAAAAAAAC TAAACAAGAT ATTTACCGAA TAAAAAAATT TAAAAATTTC GCTGGAGAAT TATCTTGTAA TTCTTCTGTT AGCTTACTTT CTTGGGTCGA GTGGGGTTCT CCATATATGC CTTGGGAGAA AAATGATCTT TTTTTTGCTC AAATTGACTT AGATGGAGAG ATAACAAAAA TAAAAAAATT CTCAGATAAG CTGATTAATG CCAAAAAAAA CGTTTCTTTT TTTCATCCTT ATTGGATAAG TGAAACTCTT TTAGTATGTT CTGAAGATAG TTCTGGATGG TGGAACTTAT TGTTTTTAGA TGCTAGTAAA ATTGAGAATA TTTTTATTAA AAAAAGAGTA GAGAGAAATT TTGTTGAATA TGGAGTACCT CAGTGGGTCT CAGGAATAAC ATTTTTTTCA GGGGATATAA AAGATTTATT TTGTTTAGCA AAAAAAGAAA ATAATTTAGT AGTTGAACAA TATAAAGATC TTCAATGCGT TAAAGAATTT TCTACTCCTT TTACCTCAAT AAGTGATTTC AGTGTTTTTG AGAAGAAAGT AGTTTTGAAA GGTCATGGAT CTGATTTTCT TGGAAATTTA CTTGAAATTG ATTTTAAAAA GGAAGTTTTA TCAAATGTTT TTGAGGAAAT AAATGCTGAA TATATAAAAG TTTGTTCAAA ACCTGAAACA TTTTGGTTTA AAGGTTTTGA AGCTCAATCT ACTCATTCTT TTCTTTATAG GCCGCTCGTA GAAAATTTTA GAAAGCCACC GCTCCTTGTT AGAGCACATA GCGGACCAAC TTCATGTTTT GATGGATCAT ATAATTCTGA GGTTCAATAT TGGACTTCGA AGGGATTTTT TGTTGCTGAA GTTAATTATG GAGGATCATC AGGATTTGGC AAAGCATATA TAGAGAGGTT GAATTGTAAA TGGGGTATTG TTGATTCTTA TGATTGCAAA GCACTAGCTC TTGAATTGAT TAAATCAAAT CAAGTTGATA GTGAAAAAGT AGTAATTTTT GGGAATAGTG CTGGTGGGTT AACTGCCCTG AATTGTTTAT TATATGGGTC TATTTTTACA GCAGCAATTT GTAAATATCC TGTTATTGAT TTGAAGGATA TGCATTACAA CACTCATAGG TTTGAAAAAG ATTATTTAAA TTCTTTGGTA GGAATTTATG CACAAAATCA TGATGATTAT ATAAATAGAT CACCGATAAA TCATATTAAC AAAATAAAAA AACCTATCTT ATTGTTTCAT GGAAAAAAAG ATAAAGTAAT TTCTTATAAA CAAACTTTTA AAATCCAGGA AATTTTGATT CAGAATAATA AATATTCAGA AGTTATTTTT TTTGATAATG AGGGGCACGG TTTTAGAAAT ATTGAAAATA AAGAAATAGT AATGCAAAAA TCTATGGAAT TTTTAAAAAA TGCTTTGAAT ATTTAA
|
Protein sequence | MSNDDQLKVR QTVSKKKSFK ELTIIRDIIF WIDLVGEGQN ENAIFARPFN KKEAFPQKLT SKKYNIKNNF HGYGGKSYKC IYLKNNFYLI WIDQITNAVW FQIFKEVASN YRSQKRYLDS VQEPRQLSKS IDGNFDSSFV ISQKNFLYGI CEINNRDYLF SLNLKKTKQD IYRIKKFKNF AGELSCNSSV SLLSWVEWGS PYMPWEKNDL FFAQIDLDGE ITKIKKFSDK LINAKKNVSF FHPYWISETL LVCSEDSSGW WNLLFLDASK IENIFIKKRV ERNFVEYGVP QWVSGITFFS GDIKDLFCLA KKENNLVVEQ YKDLQCVKEF STPFTSISDF SVFEKKVVLK GHGSDFLGNL LEIDFKKEVL SNVFEEINAE YIKVCSKPET FWFKGFEAQS THSFLYRPLV ENFRKPPLLV RAHSGPTSCF DGSYNSEVQY WTSKGFFVAE VNYGGSSGFG KAYIERLNCK WGIVDSYDCK ALALELIKSN QVDSEKVVIF GNSAGGLTAL NCLLYGSIFT AAICKYPVID LKDMHYNTHR FEKDYLNSLV GIYAQNHDDY INRSPINHIN KIKKPILLFH GKKDKVISYK QTFKIQEILI QNNKYSEVIF FDNEGHGFRN IENKEIVMQK SMEFLKNALN I
|
| |