Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_04931 |
Symbol | |
ID | 5730349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 461541 |
End bp | 463091 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641284852 |
Product | carboxypeptidase Taq (M32) metallopeptidase |
Protein accession | YP_001550378 |
Protein GI | 159903034 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCTGATA TCGCTTGGCA GAAACTTGGG GACTACCTTC AAGAAACCCA GGTTCTGGGC TCAATTCATA GCACTCTTTA CTGGGATCAA AACACTTCGA TGCCTATTAA TGGTGCTAAG TGGAGAGGCG ACCAATTAAG TTTTTTGGCC AGATCGCTGC ATGCAAGGCA AAGCAGCGAA CGTTTTGAGG AACTGCTTCA TGAAGCTAAG GATGAATTTC AACGAAATTA TGGTCTGAAA CCCTTAGTTT CCAAAGAATT TAATGAGAAG AAAAGGAATT TGGAATTATT AACCCAGGAG CTTCATCGTC AAAAGAGGCT TGATCCAGAC TTAATTACTC AGCTTGCAAC TGCCCAAACC AATGGGTATT CACTGTGGCA ACAAGCAAGG CGGGAGAATG ATTTTCAATG TTTTGCTCCT GCTCTTCGCT ATCTGATTTC ACTAAGGCAG GAACAGGCAA AACAGCTTGA TGAGCCCCGA AGTTGCTGGG AAACACTTGC ACAGCCATTT GAACCCGATT TAACAATTCA ACGCCTCAAT GAATTGTTTA CTCCTTTGAG GAAGCGATTG CCAGAATTAA TATCTAAATA CTGTCTTGCC AAAGATATTA ATCAACAAAA ATGGGACCTT GAGGAAAAAT CACAACAAGA TCTTTGTGAG CGATTACTTC AAGAATGGGG TAGGAATCCC CAAGTTACGT CTATAGCAAG ATCTCCTCAT CCATTTTCAA TTACTCTTGG ACCTCAAGAC TTTCGTCTGA CTACTCGTGT TGTACGAGGT CAACCTCTTT CGTGCTTCCT TGCAACAGCA CATGAATGGG GACACTCTCT TTATGAGCAA GGATTACCTT CTCAGTCTCA TCAGTGGTTT GCATGGCCTT TAGGTCAAGC AACTTCTATG GCTGTACATG AAAGCCAGTC ACTTTTTTGG GAAAATCGAG TAGCTCGAAG TCGAGCATTT TCATATCGTT TTTGGAAGTA TTTTGCTGAA GCAGGGGCTC CGTTGACTTG TGGTCATGAC TTGTGGAGAG CAATGAATCC ATTGACTCCT GGCCTGAATA GAGTTGAGGC TGATGAACTT AGTTATGGAC TTCATATCCT AATTCGGACT GAATTGGAAA TTGCTTTTCT TGAAGAGGGA TTGGAAGTTA ATGATATTCC TTCTGAATGG AACAAAAAAT ACAAAGAATT GCTTGGAGTT GTTCCTAGCA ATGACTCAGA AGGATGTCTT CAGGATGTTC ACTGGAGCGA AGGCTCATTT GGTTACTTTC CTTCTTATTT GATTGGTCAT TTAATCAGTG CTCAACTCTC AGAAGCAATG ATTGAAGGGC TTGCTAATGA TGGGGTTCAA GGAGAAGATC CCATTGGTGA ATGTATTACG AATGCTTCTG AATCTAAGCT TCTATCTTGG TTAAGAAGAG AAGTCCATCA TTACGGACGC CAACTAAATG CTGAACAACT AGTTGAAAAA GTTACGAAAA AGCCTCTTTC TAGCAGAGCA TTCTTAACTT ATTTAGAAAA TAAGCTAGAG CAAATGACCA GCACCCCGTA G
|
Protein sequence | MPDIAWQKLG DYLQETQVLG SIHSTLYWDQ NTSMPINGAK WRGDQLSFLA RSLHARQSSE RFEELLHEAK DEFQRNYGLK PLVSKEFNEK KRNLELLTQE LHRQKRLDPD LITQLATAQT NGYSLWQQAR RENDFQCFAP ALRYLISLRQ EQAKQLDEPR SCWETLAQPF EPDLTIQRLN ELFTPLRKRL PELISKYCLA KDINQQKWDL EEKSQQDLCE RLLQEWGRNP QVTSIARSPH PFSITLGPQD FRLTTRVVRG QPLSCFLATA HEWGHSLYEQ GLPSQSHQWF AWPLGQATSM AVHESQSLFW ENRVARSRAF SYRFWKYFAE AGAPLTCGHD LWRAMNPLTP GLNRVEADEL SYGLHILIRT ELEIAFLEEG LEVNDIPSEW NKKYKELLGV VPSNDSEGCL QDVHWSEGSF GYFPSYLIGH LISAQLSEAM IEGLANDGVQ GEDPIGECIT NASESKLLSW LRREVHHYGR QLNAEQLVEK VTKKPLSSRA FLTYLENKLE QMTSTP
|
| |