Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_00801 |
Symbol | dap2 |
ID | 4911268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 82492 |
End bp | 84417 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 640159645 |
Product | esterase/lipase/thioesterase family protein |
Protein accession | YP_001090304 |
Protein GI | 126695418 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATG ATGATCAATT AAAAGTTAGG CAAAGTGTAT CTAAAAAAAA ATCTTTTAAG GAATTAACTA TTGTTAGGGA TATCATTTTT TGGATTGATC TTGTTGGCGA AGGTCAAAAT GAAAATGCGA TTTTTGCAAG ACCATTTAAT GAAAAAAGGG CGTTTCCTCA GAAATTAACA AGTAAAAAAT ATAATATTAA AAATAACTTT CATGGATATG GTGGTAAATC TTATAAATGT ATATATTTAA AAAATAATTT TTATTTGATA TGGATAGATC AGATTACCAA CGCAGTATGG TTTCAAATTT TTAAAGAGGT AGCATCAAAT TATAAAAGTC AAAAAAGATA TCTCGATTCA GTTCAAGAAC CTAGACAACT ATCTAAATCA ATTGATGGAA ATTTCGATTC TTCATTTGTT ATTTCTCAAA AAAATTTTTT GTATGGTATT TGCGAAATTA ATAATAGAGA TTACTTATTT TCTTTAAACT TAAAAAAAAC TAAACAAGAT ATTTACCGAA TTAAAAAATT TAAAAATTTC GCTGGAGAAT TATCTTCCAA TACTTCTATT AACTTACTTT CTTGGGTCGA GTGGGATTCT CCATACATGC CCTGGGAGAA AAATGATCTT TGTTTTGCTC AAATTGGCTT AGATGGAGAG ATAACAAAAA TAAAAAAATT CTCAGATAAG CTGATTAATG CTAAAAAAAA AGTTTCTTTT TTTCAACCTT ATTGGATAAG TGAAACTCTT TTAGTATGTT CTGAAGATAG TTCTGGATGG TGGAACTTAT TGTTTTTAGA TGCTAGTAAA ATTGAGAATA TTTTTATTAA AAAAAGAGTA GAGAGAAATT TTGTTGAATA TGGAGTACCT CAGTGGGTCT CAGGAATAAC ATTTTTTTCA GGGGATATAG AAGATTTATT TTGTTTAGCA AAAAAAGAAA ATAATTTAGT AGTTGAACAA TATAAAGATC TTCAATTCGT TAAAGAATTT TCTACTCCTT TTACTTCAAT AAGTGATTTT AGTGTTTTTG AAAAGAAAGT AGTTTTGAAA GGTTATGGAT CTGATTTTCT TGGAAATTTA CTTGAAATTG ATTGTAAAAA GGAAGTTTTA TCAAATGTTT TTGAGGAAAT AAATTCTGAA TATATAAAAG ATTGTTCAAA ACCCGAATCT TTTTGGTTTA AAGGTTTTGA AGATAAATCT ACTCATTCTT TTTTATATCG GCCGCTTGTT GAAAAATTTA GAAAACCACC GCTTTTTGTT AGAGCTCATA GTGGACCAAC TTCATTTTTT GATGGATCAT ATAATTCTGA AGTTCAATAT TGGACGTCGA AGGGTTTTTT TGTTGCTGAA GTCAATTATG GAGGATCATC AGGATTTGGC AAAGCATATA GAGAGAGGTT GAACTATAAA TGGGGTATTG TTGATTCTTA TGATTGCAAA GCACTAGCTC TTGAATTGAT TAAATCAAAT CAAGTAGATA GTGAAAAAGT GGTAATTTTT GGGAATAGTG CCGGTGGGTT AACTGCCCTG AATTGTTTAT TATATGGGTC TATTTTTAAA GCAGCAATTT GTAAATATCC TGTTATTGAT TTGAAGGATA TGCATTACAA CACTCATAGG TTTGAAAAAG ATTATTTAAA TTCTTTGGTA GGAAATTTTG AAAAAAATCA TAATGACTAT TTAAATAGAT CACCGATAAA TCATATTAAC AAAATAAAAA AACCTATCTT ATTGTTTCAT GGAAAAAAAG ATATAGTTAT TTCTTATAAA CAAACTTTAA AAATTCAAGA AATTTTGATT CAGAATAATA AATATTCAGA AGTTATTTTT TTTGATAATG AAGGGCATGG GTTTAGAAAT ATTGAAAATA AAGAAGTAGT AATGCAAAAA TCTCAGGAAT TTTTAAGAAA TGCTTTGAAT ATTTAA
|
Protein sequence | MSNDDQLKVR QSVSKKKSFK ELTIVRDIIF WIDLVGEGQN ENAIFARPFN EKRAFPQKLT SKKYNIKNNF HGYGGKSYKC IYLKNNFYLI WIDQITNAVW FQIFKEVASN YKSQKRYLDS VQEPRQLSKS IDGNFDSSFV ISQKNFLYGI CEINNRDYLF SLNLKKTKQD IYRIKKFKNF AGELSSNTSI NLLSWVEWDS PYMPWEKNDL CFAQIGLDGE ITKIKKFSDK LINAKKKVSF FQPYWISETL LVCSEDSSGW WNLLFLDASK IENIFIKKRV ERNFVEYGVP QWVSGITFFS GDIEDLFCLA KKENNLVVEQ YKDLQFVKEF STPFTSISDF SVFEKKVVLK GYGSDFLGNL LEIDCKKEVL SNVFEEINSE YIKDCSKPES FWFKGFEDKS THSFLYRPLV EKFRKPPLFV RAHSGPTSFF DGSYNSEVQY WTSKGFFVAE VNYGGSSGFG KAYRERLNYK WGIVDSYDCK ALALELIKSN QVDSEKVVIF GNSAGGLTAL NCLLYGSIFK AAICKYPVID LKDMHYNTHR FEKDYLNSLV GNFEKNHNDY LNRSPINHIN KIKKPILLFH GKKDIVISYK QTLKIQEILI QNNKYSEVIF FDNEGHGFRN IENKEVVMQK SQEFLRNALN I
|
| |