Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_00771 |
Symbol | dap2 |
ID | 5730227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 81970 |
End bp | 83907 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641284420 |
Product | esterase/lipase/thioesterase family protein |
Protein accession | YP_001549962 |
Protein GI | 159902618 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.248963 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.78651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTTCAC CAAAGAATCA AGTCAGTTCA AAATTCTTAG ATGCAGAGGT TGTTTTGGGT GAATTCCCCA AAATCAAATC CCCTAGAATT CTTGGTGATT GGGTCTTCTG GCTAGAACAG CGTCCTTATG AGAATGGGAG AACCACACTC CTCACTCGTC CTTGGGGGAG GTTTGATTGC CTTCCACAAG AGTTAACACC TTTTCCAGTA AATCTTAGAA CTCGTTTGCA TGGTTATGGA GGTTCGCCAC TTGCCTTGGT TAAGCAAGCT GATTGCTTTG TAATGACATG GATTGATGAC CAGCAAGGGG GTTTATGGCA TCAAAAATGG ATTATTGCTG ATCAAACAAA ACCAACAATT TTAAAATCTC TGTCTAGCCC AATTTGTTTA TCTTTGAAGG ACAAGTATTG TCTTGCTGAT GGTTTGATTG ACTTACAATT TAATAGATGG ATTGGTGTGA TGGAGGAAGA TAATAAAGAT TATTTGGTTT CATTTGCGCT CGATAAAGAA TTAAAAGCAC CAATGATTCT TCATCAAGCA ATTGATTTTC TTGGGTATCC AACATTAAGT ATAAAGTCTG ATCAATTGGC ATGGGTTGAG TGGCAAAAGC CATACATGCC ATGGGATCAA AGCCAAATCT TTCACTCTTT TATTAATGAC ATAGGTAAAC TTAGCTCAGT TTCGATGTTG TCTGGATCAG ATAAATCTTC CCAAAAAAGT TCTGCTTTTC AGCCTCAATG GTTGCCTAAT GGTCAATTAA TTGTAGCTGA AGATAGTAGT GGATGGTGGA ATCTTAAGAT TGCAGGGCCA GATTTTTCTT CTAATTTAAC TAATCAATTT AGTAATCTTT GGCATATAAA AGCTGAAGCT GCCTGTCCCC AGTGGATTCA TGGGATGTCT ACCATTGCTT CTTCTGGGAA AAAAGAAATT GTTGCTCTTA GTTGTCAAGA AGGTAGTTGG TCCATGAGTG TTGTAAACAA GAGTGGTTCA GTCACAAAGT TGCAACTACC TTTTGAACAT TTTGAAGATG TATCTTCTGA GGAAGGAAAG GCAGTTGCAA TAGCAGCTAA TTCTTTCCTA GATTCTGGTC TGCTTGAAGT GAATTTAAAA AATGGTAGTT GGATTCATAA TTCCTTTAGA GAGTCAATAG TTAAACCACA AGAAATCAGT ATTGCTGAAT CATTTTGGTT TAAGGGTTTT GGAGGTGAGA TGAGTCATGC TTGGTATTAC CCCCCGATTC AGGGTCGATT GAACTATTCA CCTCTTTTAG TGAAAGCTCA TAGTGGTCCT ACTTCTATGG CAAAAAGAGG TTTGAATTTA GAAATTCAGT TCTGGACTTC TCGAGGATGG GGTGTTTTAG ATGTTAATTA CGCAGGATCA ACAGGCTTTG GTCGAGCTTA TAGAGATCGC TTAAAACATT CTTGGGGAGA GGCAGATGTT TTTGATTGCT CTCAAGCTGC CATGGAATTA ATTAATAATG GGAAAGCCGA TAAAAATTTA GTTGCTATTG AAGGATCTAG CGCAGGAGGT TTCACGAGTT TATGTTGCCT ATGCTTTAGA AATATTTTTA GGGTCGCTTC TTGTAAATAC CCAGTAATTG ATCTTCTTGA TATGGCAAAC TCAACCCATC GCTTTGAAGA GTATTACTTA GATTTCCTGA TAGGTAAATT TAACAATAAC AAGCATTTGT ATATGAGCAG ATCTCCTATC AATAATTTAG ATAAGATTAC TTGCCCTGTA ATCTTATTTC AAGGATTAAA AGATAAGGTT GTTTCTCCTG AGAAAACTAA AGATTTGTTT ACAGCTTTGA AAAATAAGAA AATACCTACT GAATTACATG TTTTTGATAA TGAAGGTCAT GGCTTTAATC ATCGGTCTAC AAAAATTAAA GTTTTGCGAG AAACAGAATC ATTTTTTAGA GAGCATTTAG GTATCTAA
|
Protein sequence | MVSPKNQVSS KFLDAEVVLG EFPKIKSPRI LGDWVFWLEQ RPYENGRTTL LTRPWGRFDC LPQELTPFPV NLRTRLHGYG GSPLALVKQA DCFVMTWIDD QQGGLWHQKW IIADQTKPTI LKSLSSPICL SLKDKYCLAD GLIDLQFNRW IGVMEEDNKD YLVSFALDKE LKAPMILHQA IDFLGYPTLS IKSDQLAWVE WQKPYMPWDQ SQIFHSFIND IGKLSSVSML SGSDKSSQKS SAFQPQWLPN GQLIVAEDSS GWWNLKIAGP DFSSNLTNQF SNLWHIKAEA ACPQWIHGMS TIASSGKKEI VALSCQEGSW SMSVVNKSGS VTKLQLPFEH FEDVSSEEGK AVAIAANSFL DSGLLEVNLK NGSWIHNSFR ESIVKPQEIS IAESFWFKGF GGEMSHAWYY PPIQGRLNYS PLLVKAHSGP TSMAKRGLNL EIQFWTSRGW GVLDVNYAGS TGFGRAYRDR LKHSWGEADV FDCSQAAMEL INNGKADKNL VAIEGSSAGG FTSLCCLCFR NIFRVASCKY PVIDLLDMAN STHRFEEYYL DFLIGKFNNN KHLYMSRSPI NNLDKITCPV ILFQGLKDKV VSPEKTKDLF TALKNKKIPT ELHVFDNEGH GFNHRSTKIK VLRETESFFR EHLGI
|
| |