Gene Cpha266_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2025 
Symbol 
ID4569145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2339043 
End bp2341988 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content48% 
IMG OID639766606 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_912461 
Protein GI119357817 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0310906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCCAA TTTGTCCAAC ACCAGCAAAA ACAGGTATTC CTGCTCTTAT TTTTGCGATG 
CTCCTCCTGA ACCTTATTTC ATGCACACCA ATGATAACCG ACGTAAAAAA TTATCCTTAC
ACAACCGTTC CTGGCGACTC TCTTCACACA CGAATCTATA CACTGAAAAA CGGGCTGACG
GTTTACATGA GTCCCTATCA TGACGAACCA CGCATCTACA CCTCTATTGC GGTCAGAGCC
GGCAGTAAAA ACGATCCCGC CGAAACAACC GGTCTTGCCC ACTATCTCGA ACACATGCTG
TTCAAGGGAA CCGACTCGAT CGGATCGCTG AATTATGAAA AAGAACATGC CGAACTCGAA
AAAATCATCG CCCTGTACGA GGAGTACAGA AAATCAACGG ATCCGGCAAA AAGAGCCGCT
ATCTACCGGG ATATCGACAC GCTCTCCAAT GCCGCAGCAC AATATACCGT ACCCAATGAA
TACGACAAGC TGCTGAATTC CATCGGCGCA CAGGGAACCA ACGCTTACAC CTGGGTTGAA
CAGACCGTCT ACATCAACGA CATTCCTGCA AACAAGCTCA ATCAGTGGCT TACCATAGAG
GCTGAACGGT TCCGCAATCC GGTGATGCGG TTATTTCATA CCGAACTTGA AACCGTCTAT
GAAGAAAAAA ATATGACGAT GGACAGCGAC AGCCGCAAGA TATGGGAGAG CCTCTTTGCC
GGACTCTTCA AAACACACAC TTACGGAACC CAGACAACGA TTGGCAAAGC CGAGCATCTT
AAAAATCCTT CCATAAAAAA CGTTCTGGAA TATTATCGCA CCTACTACGT GCCAAACAAC
ATGGCGCTCT GTATTGCCGG TGATTTTGAC CCCGATGAAA CCATCAAGCT CATCGATAAC
AAGTTCTCTC TTCTTGAACC GAAAGCGATC CCGGTATTTA CGCCGCCGGT CGAACCCCCG
ATAAGCAAGC CGATCATCGA AAAAGTCAAA GGCCCCGAAG CTGAGGAGCT GGTAATAGGA
TTTCGTTTCA ACGGAGTCAA CAGCAATGAT ACAGATTATA TTACTCTGAT CGACAAAATT
CTCTTCAATC AGACCGCCGG CATCATCGAC CTGAATCTGA ACCAGCAGCA AAAAGTACTT
GACGCAGGCT CCATGCTGGT CATGATGAAA GATTATTCCG CGCATATCCT GAGCGCCAAA
CCCAGAGAGG GGCAAAGCCT CGACCAGGTT AAAACACTGC TGCTTGAGCA ACTCGAACAA
CTCAAAAAAG GCAATTTCCC CGACTGGCTG CTCGTTGCCG CAATTAATGA CCTTAAAATC
GAAGAACTTA AACTCTATGA AAGCAATCGG GGAAGAGTCG AGGCCTATGT CGACGCCTTT
ATCATGGGAA CTGAATGGAA TAACTACATC AGCCAGATTG AACGACTTGA AAAAATCACT
AAAGAACAGC TTGTGGCTTA TGCGAAAATA CACTACAACG ACAACTATGT AGCGATCTAC
AAGCAGCATG GCAAGGAAAA AAGCGAAGCG AAAATCCAGA AGCCACCGAT CACGCCAATC
AAGGTCAATC GGGACAGCTC CTCAACGTTC GCCGAAAACC TGCTTGCGCA GCATTCGGAA
AAAACGGAAC CCCGGTTCCT CGACTATGCA AGCGATATCG GATTTTTCGA TGTAAACCAA
AACGTCAGGC TCCACTACCT GCAAAACCGG GAAAACGAGC TCTATTCGCT CTACTACGTC
TTCGATATGG GAAAAAGCCG AAACAGAAAA ATCGATCTTG CGCTCGATTA CCTCTCTTAC
CTCGGCTCGT CCGGTTACAC TCCTGCGGAA TTCAGTCAGG AACTCTATAA AATCGGAGCA
AGCTTTTCGG CATTTACCTC TGACGATTTC GTCTATCTTC AACTCTCGGG ACTGCAGAAA
AACTTTCCCG CCGCGATCAG GCTGCTTGAA AAACTGCTCA CCGACGCTCG CCCCGACGAA
AGCGCGCTGC AAAAACTCAA GGCAGGCATC CTCAAGGAAA GAGCCGACGA CAAGCTTTCA
AAAAAGAAAA TCCTGTTTGA AGCCATGACG AACTACGGCA AATACGGAGC CTCGTCACCC
TTTACCAACG TACTTTCAAA CAGCGAACTC AACTCGATAA CCTCAGGCGA CCTGCTCAGC
GAAATACAAA ACCTGATGGA ACACGGTCAT CGGGTGCTCT ACTACGGACC GGCAACTTCC
GGCGAAATCC TTTCGGAACT GCACGCAGTC CGACACTACC CCGAAACGTT CAAGCCCTAT
CCCGCTGCCG ATCCCTACCC TGAACTCGAG CAGCAGAATA ATCTCGTCTA TATCGTTGAT
TACGACATGA CGCAGGCTGA GGTGATAATT CTGTCGCGCG ATGATCTCTA CAGTCCCGAC
ATGGTACCGG ACATCAGCCT GTTCAACGAA TACTACGGCG GAGGAATGTC GTCGGTTGTC
TTTCAGGAAC TTCGTGAAGC AAAAGCCCTT GCCTATTCGG TATTCTCTGT TTACCGGACC
CCGAAACTCA ACAACAGGCA CAACTATATC TTCAGCTATA TCGGCACCCA GTCCGACAAA
CTGCCTGAAG CGCTTGAAGG CATCCGGCAC CTTATGCACG AGCTGCCGAA ATCCCCTGAT
CTCTTTGCCT CTGCAAAAAA CGGTATTCTT CAAAAAATAT CCACCGAAAG ACTGACAAGA
ACCGAAGTTC TCTTCAACTA CGAAGAGGCG TGCCGTCTTG GAATTGACTA TGACATCCGG
AAAAACATCT ATGAACATGC CGCAACCATG ACCCTGGAAG ACATTGAAAA ATTCCACCAG
AAGCATTTCA GAGATAAAAA ACATGTCATG CTCGTTCTTG GCAAAAAAGA AAACCTCGAT
ATGAACACCC TCAAAAAGTA TGGCGGGGTT AAACAGCTAT CCCTTGAAGA AATTTTCGGA
TATTGA
 
Protein sequence
MPPICPTPAK TGIPALIFAM LLLNLISCTP MITDVKNYPY TTVPGDSLHT RIYTLKNGLT 
VYMSPYHDEP RIYTSIAVRA GSKNDPAETT GLAHYLEHML FKGTDSIGSL NYEKEHAELE
KIIALYEEYR KSTDPAKRAA IYRDIDTLSN AAAQYTVPNE YDKLLNSIGA QGTNAYTWVE
QTVYINDIPA NKLNQWLTIE AERFRNPVMR LFHTELETVY EEKNMTMDSD SRKIWESLFA
GLFKTHTYGT QTTIGKAEHL KNPSIKNVLE YYRTYYVPNN MALCIAGDFD PDETIKLIDN
KFSLLEPKAI PVFTPPVEPP ISKPIIEKVK GPEAEELVIG FRFNGVNSND TDYITLIDKI
LFNQTAGIID LNLNQQQKVL DAGSMLVMMK DYSAHILSAK PREGQSLDQV KTLLLEQLEQ
LKKGNFPDWL LVAAINDLKI EELKLYESNR GRVEAYVDAF IMGTEWNNYI SQIERLEKIT
KEQLVAYAKI HYNDNYVAIY KQHGKEKSEA KIQKPPITPI KVNRDSSSTF AENLLAQHSE
KTEPRFLDYA SDIGFFDVNQ NVRLHYLQNR ENELYSLYYV FDMGKSRNRK IDLALDYLSY
LGSSGYTPAE FSQELYKIGA SFSAFTSDDF VYLQLSGLQK NFPAAIRLLE KLLTDARPDE
SALQKLKAGI LKERADDKLS KKKILFEAMT NYGKYGASSP FTNVLSNSEL NSITSGDLLS
EIQNLMEHGH RVLYYGPATS GEILSELHAV RHYPETFKPY PAADPYPELE QQNNLVYIVD
YDMTQAEVII LSRDDLYSPD MVPDISLFNE YYGGGMSSVV FQELREAKAL AYSVFSVYRT
PKLNNRHNYI FSYIGTQSDK LPEALEGIRH LMHELPKSPD LFASAKNGIL QKISTERLTR
TEVLFNYEEA CRLGIDYDIR KNIYEHAATM TLEDIEKFHQ KHFRDKKHVM LVLGKKENLD
MNTLKKYGGV KQLSLEEIFG Y