Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_05201 |
Symbol | |
ID | 4912351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 451498 |
End bp | 453003 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640160100 |
Product | carboxypeptidase Taq (M32) metallopeptidase |
Protein accession | YP_001090744 |
Protein GI | 126695858 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.322027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCTGAAA CTCATTGGAA AAAGCTGGGT GCTTACCTTA AAGAAACACA AATATTAGGT TCAATCCAAA ATACACTTTA TTGGGATCAG AATACTGGAA TGCCAAAAAA AGGGGCTTAT TGGAGGTCTG AACAACTTAC TTATATTGCA AAAGTATTGC ATGAAAGAAA TTCTTCCGAG GAATTTTCTA ATCTGATACA ATCTGCAAAA AATGAACTAG CAGATATTGA AAGAAATTCC GATAATCAAC TTTTCATAAA AGATAAAGAA AGAAATATTA GTCTTTTATT GAAGGAATTT AATAGAGAAA GAAATTTAGA TCCTAAATTA GTTGAGTCTT TAGCAAAGGC AAAATCTAAA GGATATGAAA GCTGGCAAGA AGCTAAGGAA AAATCAGATT TTAAAATTTT TCTTCCTTTC TTTGAAGAAT TAGTTAAATT GCGGATTGAA GAGGCAAAGC AAATATCTAT TAAATGTTCA CCTTGGGAGA CATTAGCCCA ACCCTTTGAG CCTGAATTAA ATTTGAAATG GTTGAACAAA ATTTTTCAAC CTTTGAAAGA AACCATCCCA GGCTTGATTA GAGGACTTAA CAAGTCCCAA AAAAATCAAT GGGATTTAAG TCCAGAATCT CAAAAAAAAT TATGTTCTAA ATTACTTGAC GAGTTTGGAA GAGATAGAGA TCTCGTAGTT GTTGGACAAT CTCCCCATCC TTTTTCGATT ACATTAGGGC CAAATGATTT TAGGATCACT ACAAGAATTG TTGAAGGTGA ACCATTATCA AGTTTTTTAG CAACCGCGCA TGAGTGGGGG CATTCTATTT ATGAGCAGGG TTTGCCATCA CAAAGTCATC AATGGTTTGC TTGGCCTTTA GGTCAAGCAA CATCTATGGG TATTCATGAA AGTCAATCTT TATTTTGGGA AAATAGAATA GTTAAATCCA AATCTTTTTC AAAAAGATTT TTTAAAAAAT TTGTTTCGGC TGGATGTTCT CTTAATAATT ATTTAGAACT ATGGAAATCT ATTAATCATT TGGAAGCAGG ATTAAATAGG GTGGAAGCGG ATGAATTGAC TTATGGCTTA CACATATTAA TAAGAACCGA ACTTGAAATA GATTTAATTG AAAGAGGGTT ACCTGCTGAA GATATTCCAA CAGAATGGAA TAAAAGATAT GGTGAACTCC TAGGAATTAA ACCATCTAAT GATTCAGAAG GTTGTCTTCA AGATGTTCAT TGGAGTGAAG GGGCGTTTGG ATATTTCCCC TCATATTTGT TAGGACATGT TATAAGTGCG CAAATATCTT CTCAAATGGA AAGAGAAATA GGTTTGATTG ACAACTTAAT TGAAAATGGT GAATATCAAA AGATCATCTT TTGGTTAAAA AATAATATAC ATAAATATGG CAGATCTGTT AATTGTATGG AGTTGGTAAG AGCTGTAACT AATGAAGAAC TATCGCCAAA CTATTTTATT AATCATTTAA GGTCTAAAAT AAATGATTTT TGCTGA
|
Protein sequence | MAETHWKKLG AYLKETQILG SIQNTLYWDQ NTGMPKKGAY WRSEQLTYIA KVLHERNSSE EFSNLIQSAK NELADIERNS DNQLFIKDKE RNISLLLKEF NRERNLDPKL VESLAKAKSK GYESWQEAKE KSDFKIFLPF FEELVKLRIE EAKQISIKCS PWETLAQPFE PELNLKWLNK IFQPLKETIP GLIRGLNKSQ KNQWDLSPES QKKLCSKLLD EFGRDRDLVV VGQSPHPFSI TLGPNDFRIT TRIVEGEPLS SFLATAHEWG HSIYEQGLPS QSHQWFAWPL GQATSMGIHE SQSLFWENRI VKSKSFSKRF FKKFVSAGCS LNNYLELWKS INHLEAGLNR VEADELTYGL HILIRTELEI DLIERGLPAE DIPTEWNKRY GELLGIKPSN DSEGCLQDVH WSEGAFGYFP SYLLGHVISA QISSQMEREI GLIDNLIENG EYQKIIFWLK NNIHKYGRSV NCMELVRAVT NEELSPNYFI NHLRSKINDF C
|
| |