Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_03061 |
Symbol | dap2 |
ID | 4778861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 321324 |
End bp | 323282 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640085808 |
Product | esterase/lipase/thioesterase family protein |
Protein accession | YP_001016324 |
Protein GI | 124022017 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAGCA CCAGCCATCA CCAGCGAAAC CCGAAGCAAA CCACTCAACC TCTTTCTGCC AGACGCGCTC TCGGACAAAC ACCCACCCTC AAAGAACCGC GCCTGATTGA TGACTGGGTG CTTTGGCTGG AGCAACGTCC TCAGGAACAT GGTCGCACCA CTGCCCTTAT CCGTCCCTGG GGTCAATCTG ACCACCCCCC CCAAGAGCTG ACACCTGCCC CAGCCAACCT GCGTAGTCGT ATTCACGATT ACGGAGGAGG GGTTCTAGCA ACAGCCTGCC AAGACAACCA GCTACTGATG GCCTGGATCG ATGATGCCGA TGGCTGCCTC TGGTTCCAGC GCTGGCAGGG CCTCAACCAG GCCACAAAAG GTAAGAAGGC ATTATCTCCG CTCAAGCCGC CGCTTCGCCT CTCAAAACCA AACGATGCCC AACTTGCTGA TGGTCTGATT GACCTCCCAC GACAGCGCTG GCTCGGAATC ATGGAAGCAG ACAAGCGGGA CTGGCTGGTG ACCTTCTCCC TCAACCATGA GAACCAGGCT GCCACGGTGT TGCATCGCCC TGCTGATTTT GCTGGTTACG CGATCCTCAG CCCGAATGGA GATCAACTGG CCTGGGTGGA ATGGCAACAA CCGGCCATGC CCTGGGAGGC AAGCCAACTC TGGTGGGCCA GCCTCGACCC TGCGGGTTTG ATCCAAAGCT CGGCCTGTCT AGCTGGTAGC AAACCACTTG ATCACAAACA AACGTCCGTT TTCCAGCCCC TTTGGCTACC CAATGGAGAG CTGGTTGTCA GCGAAGACAG CAGCGGCTGG TGGAATCTGA TGGTGGCAAA GCTGACGACT GACCCCACTG TCCAACCCAC TTGGCGACGC CCCTGGCCAC TTTCAGCCGA AACCGGCATG CCGCAGTGGG TTTATGGCAT GAGCAGCAGC GCATGGGATG GAGAACAAAT TCTGACCGCC GTCTGTGAAC AAGGTTCTTG GAGGCTGAGC CGCTTGGCCG ATGATGGACA GATCAGCACC ATCAACCAAC CTTTTGATGA TCTAAATGGT CTGCAGGCAC AGGAAGGTCG AGCCGTAGCC ATCGCTAGCA ATGCCACCAC GAGCCCTGGG CTACTAGAGC TCAACCTCAA CTGTGGCAGC TGGAAGCACA CCCCAGCCAA TGAGCCTTTA CTGAATGCTG ATGCAATCAG CGTTGCGGAA CCTATCTGGT TTGAAGGCTG CCATGGCCAG GCAACCCATG CCTGGTATTA CCCGCCAATC AATGGCAGCA AAGGCCCTGC GCCACTACTT GTCAAAAGCC ATAGCGGTCC TACCAGCATG GCCAACCACG GTCTAAGCCT CAGCATTCAG TTCTGGACAT GCAGAGGCTG GGGAGTGGTG GATGTGAACT ATGGCGGCTC CACTGGATTT GGCCGTGCAT ACCGCGAACG CCTACGGGGA GGCTGGGGTG AGACAGACGT AACGGATTGC GCACAAGCAG CACTTGCACT AGTGAAATGC AACAAGGCAA ACCCAACACA AATCGCCATT GAAGGAGGCA GTGCCGGTGG ATTTACCACC CTGGCCTGCC TTTGTTTCAC AGATGTCTTT CGCGCTGCTG CCTGCCGTTA TGCAGTGAGT GATCTCACCG CCATGGCAGA AGACACCCAT CGATTTGAAG CGCGATACCT CGATCACCTA GTAGGCCGTT GGCCCGACCA AAGACAACTT TACGAAAACC GCTCACCTCT CCTGCATGCC AACAAGATCC AATGCCCAGT GATCTTCTTT CAGGGACTTC AAGACAAAGT GGTTCCTCCA GATCAAACAG AACGGATGGC CAATGCCTTA AAAGAAAACG GCATACCAGT TGAACTACAC ATTTTTGAGC AGGAAGGCCA CGGCTTTCGC GACAGTGCTG TCAAGATCAA AGTCTTAGAA GCAACTGAGC AATTCTTCCG CCGCCACCTA AAGCTCTAG
|
Protein sequence | MESTSHHQRN PKQTTQPLSA RRALGQTPTL KEPRLIDDWV LWLEQRPQEH GRTTALIRPW GQSDHPPQEL TPAPANLRSR IHDYGGGVLA TACQDNQLLM AWIDDADGCL WFQRWQGLNQ ATKGKKALSP LKPPLRLSKP NDAQLADGLI DLPRQRWLGI MEADKRDWLV TFSLNHENQA ATVLHRPADF AGYAILSPNG DQLAWVEWQQ PAMPWEASQL WWASLDPAGL IQSSACLAGS KPLDHKQTSV FQPLWLPNGE LVVSEDSSGW WNLMVAKLTT DPTVQPTWRR PWPLSAETGM PQWVYGMSSS AWDGEQILTA VCEQGSWRLS RLADDGQIST INQPFDDLNG LQAQEGRAVA IASNATTSPG LLELNLNCGS WKHTPANEPL LNADAISVAE PIWFEGCHGQ ATHAWYYPPI NGSKGPAPLL VKSHSGPTSM ANHGLSLSIQ FWTCRGWGVV DVNYGGSTGF GRAYRERLRG GWGETDVTDC AQAALALVKC NKANPTQIAI EGGSAGGFTT LACLCFTDVF RAAACRYAVS DLTAMAEDTH RFEARYLDHL VGRWPDQRQL YENRSPLLHA NKIQCPVIFF QGLQDKVVPP DQTERMANAL KENGIPVELH IFEQEGHGFR DSAVKIKVLE ATEQFFRRHL KL
|
| |