Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01951 |
Symbol | |
ID | 4777239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 213618 |
End bp | 214814 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640085694 |
Product | Zinc metallopeptidase M20/M25/M40 family protein |
Protein accession | YP_001016215 |
Protein GI | 124021908 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCTTC TTCTTCATTG GAGCGAGCGA CTTGATCAAG CTTTGCCCAA GCTGATTGAG TTGCGGCGCC ATCTGCATGC TCATCCAGAA CTGAGTGGTG AGGAGCATCA GACGGCAGTC CTTGTAGCAG GTCAGTTGCG GGGGGATGGT TGGCGGGTGT CTGAGGGGGT GGGGCGCACA GGTGTGTTGG CGGAGTTGGG GCCAATCGGT GGTCCGTTTG TGGGCCTGCG GGTGGATATG GATGCTCTGC CAGTGGAAGA GCGCACAGGG CTTGTTTATG CCTCACGGCG AGAGGGGGTG ATGCATGCCT GTGGCCATGA TCTGCATACC TGCATTGGTC TTGGGGTTGC AAGGGTTTTG GCCAAAGAGG AGTCGATGCC AATAGGAGTA AGGCTGTTGT TTCAACCGGC AGAAGAATTG TGCGAAGGGG CTCGTTGGAT GCGTATGGAC GGGGCGACCG ATGGCCTCGA GGCCCTGTTT GGGGTTCATG TTTGCCCGGA ATTGCCGACT GGAAGTATTG GTGTACGCAG TGGGTGCTTG ACGGCTGCGG CGGGAGAGCT GGATATCGAA GTGATCGGTG AAGGAGGCCA CGGTGCAAGG CCTCATCAGG CTATGGATGC CATTTGGCTT GCCGCTCGAG TTGTTTGTGG ACTGCAGGAG GCGATCAGTC GTCGCTTGGA TGCCTTGCAT CCTGTTGTGG TGAGTTTCGG GAAGATCGAA GGTGGTCAGG CTTTCAATGT GATTGCAGAT CGAGTTCGGC TGCTGGGTAC GGTCCGCTGT TTGGACGGGG CTGTTTTTGA CAAGCTCCCA GCTTGGATCG AGCAGATTGT TCAGGCCATC TGTGGAAGTT TTGGGGCAGA GGCGATCGTG CGCTATCGCA GCATTACGCC GCCTGTTTAC AACGATCCTG AACTCACAGA CCTATTGGAA AGTTGTGCTA TCTCTCAGAT AGGAAAAGAG CGTGTGTTGC GGCTTGAACA ACCCTCGCTT GGTGCTGAAG ACTTTGCAGA ACTACTGCAA AAGGTCCGTG GAACAATGTT CCGATTAGGG GTTAGCGGCC CTAATGGGTG CGCGCCTTTA CACAATGGTC AGTTCTGCCT AGAGGAGAGC AGTCTTGGAG TTGGTATTCG GGTTCTGACC GCGACCCTAT TGGCCTGGAT GGATGAGCGT TCAAGGCTGG CTTTGGAGAG GACATGA
|
Protein sequence | MTLLLHWSER LDQALPKLIE LRRHLHAHPE LSGEEHQTAV LVAGQLRGDG WRVSEGVGRT GVLAELGPIG GPFVGLRVDM DALPVEERTG LVYASRREGV MHACGHDLHT CIGLGVARVL AKEESMPIGV RLLFQPAEEL CEGARWMRMD GATDGLEALF GVHVCPELPT GSIGVRSGCL TAAAGELDIE VIGEGGHGAR PHQAMDAIWL AARVVCGLQE AISRRLDALH PVVVSFGKIE GGQAFNVIAD RVRLLGTVRC LDGAVFDKLP AWIEQIVQAI CGSFGAEAIV RYRSITPPVY NDPELTDLLE SCAISQIGKE RVLRLEQPSL GAEDFAELLQ KVRGTMFRLG VSGPNGCAPL HNGQFCLEES SLGVGIRVLT ATLLAWMDER SRLALERT
|
| |