Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2025 |
Symbol | |
ID | 4569145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2339043 |
End bp | 2341988 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639766606 |
Product | peptidase M16 domain-containing protein |
Protein accession | YP_912461 |
Protein GI | 119357817 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0310906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCCAA TTTGTCCAAC ACCAGCAAAA ACAGGTATTC CTGCTCTTAT TTTTGCGATG CTCCTCCTGA ACCTTATTTC ATGCACACCA ATGATAACCG ACGTAAAAAA TTATCCTTAC ACAACCGTTC CTGGCGACTC TCTTCACACA CGAATCTATA CACTGAAAAA CGGGCTGACG GTTTACATGA GTCCCTATCA TGACGAACCA CGCATCTACA CCTCTATTGC GGTCAGAGCC GGCAGTAAAA ACGATCCCGC CGAAACAACC GGTCTTGCCC ACTATCTCGA ACACATGCTG TTCAAGGGAA CCGACTCGAT CGGATCGCTG AATTATGAAA AAGAACATGC CGAACTCGAA AAAATCATCG CCCTGTACGA GGAGTACAGA AAATCAACGG ATCCGGCAAA AAGAGCCGCT ATCTACCGGG ATATCGACAC GCTCTCCAAT GCCGCAGCAC AATATACCGT ACCCAATGAA TACGACAAGC TGCTGAATTC CATCGGCGCA CAGGGAACCA ACGCTTACAC CTGGGTTGAA CAGACCGTCT ACATCAACGA CATTCCTGCA AACAAGCTCA ATCAGTGGCT TACCATAGAG GCTGAACGGT TCCGCAATCC GGTGATGCGG TTATTTCATA CCGAACTTGA AACCGTCTAT GAAGAAAAAA ATATGACGAT GGACAGCGAC AGCCGCAAGA TATGGGAGAG CCTCTTTGCC GGACTCTTCA AAACACACAC TTACGGAACC CAGACAACGA TTGGCAAAGC CGAGCATCTT AAAAATCCTT CCATAAAAAA CGTTCTGGAA TATTATCGCA CCTACTACGT GCCAAACAAC ATGGCGCTCT GTATTGCCGG TGATTTTGAC CCCGATGAAA CCATCAAGCT CATCGATAAC AAGTTCTCTC TTCTTGAACC GAAAGCGATC CCGGTATTTA CGCCGCCGGT CGAACCCCCG ATAAGCAAGC CGATCATCGA AAAAGTCAAA GGCCCCGAAG CTGAGGAGCT GGTAATAGGA TTTCGTTTCA ACGGAGTCAA CAGCAATGAT ACAGATTATA TTACTCTGAT CGACAAAATT CTCTTCAATC AGACCGCCGG CATCATCGAC CTGAATCTGA ACCAGCAGCA AAAAGTACTT GACGCAGGCT CCATGCTGGT CATGATGAAA GATTATTCCG CGCATATCCT GAGCGCCAAA CCCAGAGAGG GGCAAAGCCT CGACCAGGTT AAAACACTGC TGCTTGAGCA ACTCGAACAA CTCAAAAAAG GCAATTTCCC CGACTGGCTG CTCGTTGCCG CAATTAATGA CCTTAAAATC GAAGAACTTA AACTCTATGA AAGCAATCGG GGAAGAGTCG AGGCCTATGT CGACGCCTTT ATCATGGGAA CTGAATGGAA TAACTACATC AGCCAGATTG AACGACTTGA AAAAATCACT AAAGAACAGC TTGTGGCTTA TGCGAAAATA CACTACAACG ACAACTATGT AGCGATCTAC AAGCAGCATG GCAAGGAAAA AAGCGAAGCG AAAATCCAGA AGCCACCGAT CACGCCAATC AAGGTCAATC GGGACAGCTC CTCAACGTTC GCCGAAAACC TGCTTGCGCA GCATTCGGAA AAAACGGAAC CCCGGTTCCT CGACTATGCA AGCGATATCG GATTTTTCGA TGTAAACCAA AACGTCAGGC TCCACTACCT GCAAAACCGG GAAAACGAGC TCTATTCGCT CTACTACGTC TTCGATATGG GAAAAAGCCG AAACAGAAAA ATCGATCTTG CGCTCGATTA CCTCTCTTAC CTCGGCTCGT CCGGTTACAC TCCTGCGGAA TTCAGTCAGG AACTCTATAA AATCGGAGCA AGCTTTTCGG CATTTACCTC TGACGATTTC GTCTATCTTC AACTCTCGGG ACTGCAGAAA AACTTTCCCG CCGCGATCAG GCTGCTTGAA AAACTGCTCA CCGACGCTCG CCCCGACGAA AGCGCGCTGC AAAAACTCAA GGCAGGCATC CTCAAGGAAA GAGCCGACGA CAAGCTTTCA AAAAAGAAAA TCCTGTTTGA AGCCATGACG AACTACGGCA AATACGGAGC CTCGTCACCC TTTACCAACG TACTTTCAAA CAGCGAACTC AACTCGATAA CCTCAGGCGA CCTGCTCAGC GAAATACAAA ACCTGATGGA ACACGGTCAT CGGGTGCTCT ACTACGGACC GGCAACTTCC GGCGAAATCC TTTCGGAACT GCACGCAGTC CGACACTACC CCGAAACGTT CAAGCCCTAT CCCGCTGCCG ATCCCTACCC TGAACTCGAG CAGCAGAATA ATCTCGTCTA TATCGTTGAT TACGACATGA CGCAGGCTGA GGTGATAATT CTGTCGCGCG ATGATCTCTA CAGTCCCGAC ATGGTACCGG ACATCAGCCT GTTCAACGAA TACTACGGCG GAGGAATGTC GTCGGTTGTC TTTCAGGAAC TTCGTGAAGC AAAAGCCCTT GCCTATTCGG TATTCTCTGT TTACCGGACC CCGAAACTCA ACAACAGGCA CAACTATATC TTCAGCTATA TCGGCACCCA GTCCGACAAA CTGCCTGAAG CGCTTGAAGG CATCCGGCAC CTTATGCACG AGCTGCCGAA ATCCCCTGAT CTCTTTGCCT CTGCAAAAAA CGGTATTCTT CAAAAAATAT CCACCGAAAG ACTGACAAGA ACCGAAGTTC TCTTCAACTA CGAAGAGGCG TGCCGTCTTG GAATTGACTA TGACATCCGG AAAAACATCT ATGAACATGC CGCAACCATG ACCCTGGAAG ACATTGAAAA ATTCCACCAG AAGCATTTCA GAGATAAAAA ACATGTCATG CTCGTTCTTG GCAAAAAAGA AAACCTCGAT ATGAACACCC TCAAAAAGTA TGGCGGGGTT AAACAGCTAT CCCTTGAAGA AATTTTCGGA TATTGA
|
Protein sequence | MPPICPTPAK TGIPALIFAM LLLNLISCTP MITDVKNYPY TTVPGDSLHT RIYTLKNGLT VYMSPYHDEP RIYTSIAVRA GSKNDPAETT GLAHYLEHML FKGTDSIGSL NYEKEHAELE KIIALYEEYR KSTDPAKRAA IYRDIDTLSN AAAQYTVPNE YDKLLNSIGA QGTNAYTWVE QTVYINDIPA NKLNQWLTIE AERFRNPVMR LFHTELETVY EEKNMTMDSD SRKIWESLFA GLFKTHTYGT QTTIGKAEHL KNPSIKNVLE YYRTYYVPNN MALCIAGDFD PDETIKLIDN KFSLLEPKAI PVFTPPVEPP ISKPIIEKVK GPEAEELVIG FRFNGVNSND TDYITLIDKI LFNQTAGIID LNLNQQQKVL DAGSMLVMMK DYSAHILSAK PREGQSLDQV KTLLLEQLEQ LKKGNFPDWL LVAAINDLKI EELKLYESNR GRVEAYVDAF IMGTEWNNYI SQIERLEKIT KEQLVAYAKI HYNDNYVAIY KQHGKEKSEA KIQKPPITPI KVNRDSSSTF AENLLAQHSE KTEPRFLDYA SDIGFFDVNQ NVRLHYLQNR ENELYSLYYV FDMGKSRNRK IDLALDYLSY LGSSGYTPAE FSQELYKIGA SFSAFTSDDF VYLQLSGLQK NFPAAIRLLE KLLTDARPDE SALQKLKAGI LKERADDKLS KKKILFEAMT NYGKYGASSP FTNVLSNSEL NSITSGDLLS EIQNLMEHGH RVLYYGPATS GEILSELHAV RHYPETFKPY PAADPYPELE QQNNLVYIVD YDMTQAEVII LSRDDLYSPD MVPDISLFNE YYGGGMSSVV FQELREAKAL AYSVFSVYRT PKLNNRHNYI FSYIGTQSDK LPEALEGIRH LMHELPKSPD LFASAKNGIL QKISTERLTR TEVLFNYEEA CRLGIDYDIR KNIYEHAATM TLEDIEKFHQ KHFRDKKHVM LVLGKKENLD MNTLKKYGGV KQLSLEEIFG Y
|
| |