Gene Cphamn1_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2031 
Symbol 
ID6375724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2186242 
End bp2189199 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content49% 
IMG OID642684522 
Productpeptidase M16 domain protein 
Protein accessionYP_001960422 
Protein GI189500952 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.79996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTGT TTCTTTCCAG CAGTACATAC ACCCCGCTTC TTAACCGCCT CGTCGCCATC 
TGTACATCCT GTATTGTCTT CTTTCATTTA CTATCCTGCA CTGCCATGAG CCCAGAAAAC
AAGTATTCAT ACGTTTCCAT TCCCGACGAT CCGCTGCATA CGAGAATCTA CACACTGGAA
AACGGTCTGA CGGTCTATAT GAGCCCGAAA AAGGATGAGC CGAGAATCTA TACCTCTATC
GCCGTCAGGG CAGGGAGCAA AAACGATCCG GCGGAAACAA CCGGTCTTGC GCATTACCTC
GAACATATGC TGTTCAAAGG TACAGACTCA ATCGGCGCCC TCGACTATGA TCAGGAAAAA
ATCGAACTGC AGAAAATCAT CGACCTCTAT GAAGAGTACC GCTCTACGGA TGACCCTGAT
AAAAGGGCCG ACATCTACCG TCAGATAGAC AGCACCTCCA ATTTCGCGGC AAAGCTCACC
ATTCCCAACG AATACGACAA GCTGCTCAGT TCGATAGGAG CCAGAGGGAC AAACGCCTAT
ACCTGGGTAG AACAGACTGT CTACCTGAAT GACATTCCGT CGAACCAGCT TGAAAAGTGG
CTTTCGATAG AAGCCGAACG GTTCCGCAGC CCGGTAATGC GGCTTTTCCA TACGGAACTT
GAAACTGTCT ATGAAGAGAA AAACATGACA ATGGACAGTG ACAGCCGGAA AATCTGGGAA
AATCTTTTTG CAGGACTTTT CAGAAATCAT ACCTACGGCA CGCAGACAAC CATCGGTGAA
GCTGAACACC TGAAAAACCC TTCGATTAAA AATGTCATAG AATACTACCG CGCCTGGTAT
GTGCCGAACA ATATGGCGAT CTGCTTGGCC GGTGACTTTG ATCCGGATGA AACCATCAAA
CTGATCGATG AAAAATTTTC CGCCCTTGTA CCGGGAGAGA TACCGGCATT CACCCCGGCG
GGGGAAGATC CGATCATCAA GCCGGAAATA ACCCGGGTCA AAGGGCCTGA AGCCGAAGAA
CTTGTGATGG GATTCAGGTT TGGCGGTTCA GGTTCCAGAG ATATGGATAT CCTGACGCTC
ATTGATAAAA TCCTCTATAA CCATACTGCC GGACTTATCG ATCTCAGCCT CAATCAGGAA
CAGAGAGTAC TTGACGCCGG ATCGATGGTG GTCGAGATGA AGGATTACTC GGCACATATC
CTGAGTGCCA AACCCCGTGA AGGACAGAGC CTCGACGAGG TCAGGGATCT GCTTCTTGAA
CAGATTGAAA AAGTAAGAAC GGGAGATTTC CCGGACTGGC TGCTTGAAGC GGTCATCAAC
GATCTGAAGC TTGAAGAGCT CAAAACCTAT GAATCCAACA AAGGCCGCGC AGAGAGCTTT
GTCGACGCAT TTGTTCTCGG CCTTGACTGG AGCAGCGTGG AAGGCAGGAT CGACAGGCTG
AACCGTATCA CGAAAAAGGA AATCGTCGAA TTCGCACAGG AACGTTACGG AGAAAACTAT
ATCGCGATCT ACAAGGAACA CGGAAAGGAT GCCCTGGCAC CCAAGATCGA GAAACCGCCG
ATAACCCCCC TCACGGTCAA TCGGGACAGA ACGTCAACAT TCGCTGAAAA CATACTCTCA
CAAAAAACCG GAGAGATTGA CCCTGTCTTT GTTGATTTCG ACAAGGACAT ACAATCACTT
GAGATAACCT CCGGGACACC ACTTTTTACG GTCACAAACA CCGACAACAC GCTCTTTTCG
CTCTACTACG TTTTCGATAC CGGTACCAAT CATTCGAAAA CGATCGATCT GGCTCTGGAC
TATCTGGGTT ATCTCGGAAC ATCGGGACAC AGCCCGGCTG AATTCAGCCA GGAGATGTAC
AAAATCGGCG CAAGTTTCTC GGCATTCACT TCTGACGACC ACCTCTACCT GAAACTATCC
GGTCTGCAGG AAAACTTCGA TGCCGCGCTC GATATGCTCG AAGAGCTTCT GACAGATGCG
CAGCCCAACA CCGAAGCGCT TGAAAAACTC AAGGCAGGCG TGCTCAAGGA ACGCGCGGAC
GACAAGCTCT CCAAGCGAAA AATTCTGTTT GAGGCCATGT ATAACTTCGG AAGATACGGC
TCATCATCAC CCTTCACCAA CGTCCTTGAC AACAAGGAAC TGCAACAGAT ATCTTCAGAG
GAACTTCTCG AGGAAATCGA TACGCTGATT CACTACCGCC ACAGAGTCTT GTACTATGGC
CCTGAAAAGC CGGAAAACAT TGCCGGAAAA CTTAGCGGTC TACCGCACCT TAAAGAGAAA
CTCAACCCTC TCCCTGCCTC TGAACCCTTC AGGGAAATCG GGCAGGAAGA AAGCCGGGTC
TACGTTGTAG ACTATGATAT GACACAGGCC GAACTGTTGA TGCTGTCAAG AGACAGGCTC
TACGATGCCC AGGAGGTTCC GCTGATCACG CTTTTCAACG AGTATTACGG CGGCGGCATG
TCATCTGTCG TGTTTCAGGA ACTTCGTGAA GCCAAAGCGC TGGCTTATTC AGTATTTTCC
GTATACCGCA TCCCCAGAGA CAAGGATGAG CATCATTACA TCTTCAGCTA TATAGGCACC
CAGGCAGACA AACTCCCTGA AGCCCTTGAC GGCATCACGG AACTGCTTGA AAACCTTCCT
GAATCTCCGG ACCTGCTGGC TACGGCAAAA GAGGCAATAC GCGGCAAAAT CCGCACGGAT
CGCATCACGA AATCAAAAAT ACTCTTTACC AGAGAGGAAG CGGAAAAACT CGGCCTGAAC
CATGATATCC GCAAAGATAT CTTTGAAAAA GTCGACCGGT TCGGCTTTAA CGATATTGCC
GCTTTCCACA AAGACCGGTT TGCAGATAAA CGGTATACGC TGCTGGTCCT GGGCAAAAAA
GAAAACCTTG ACATGGAAAC CCTCGGCCGG TTCGGCACAG TGACCAGCCT GACCCTTGAG
GACGTTTTCG GTTACTGA
 
Protein sequence
MILFLSSSTY TPLLNRLVAI CTSCIVFFHL LSCTAMSPEN KYSYVSIPDD PLHTRIYTLE 
NGLTVYMSPK KDEPRIYTSI AVRAGSKNDP AETTGLAHYL EHMLFKGTDS IGALDYDQEK
IELQKIIDLY EEYRSTDDPD KRADIYRQID STSNFAAKLT IPNEYDKLLS SIGARGTNAY
TWVEQTVYLN DIPSNQLEKW LSIEAERFRS PVMRLFHTEL ETVYEEKNMT MDSDSRKIWE
NLFAGLFRNH TYGTQTTIGE AEHLKNPSIK NVIEYYRAWY VPNNMAICLA GDFDPDETIK
LIDEKFSALV PGEIPAFTPA GEDPIIKPEI TRVKGPEAEE LVMGFRFGGS GSRDMDILTL
IDKILYNHTA GLIDLSLNQE QRVLDAGSMV VEMKDYSAHI LSAKPREGQS LDEVRDLLLE
QIEKVRTGDF PDWLLEAVIN DLKLEELKTY ESNKGRAESF VDAFVLGLDW SSVEGRIDRL
NRITKKEIVE FAQERYGENY IAIYKEHGKD ALAPKIEKPP ITPLTVNRDR TSTFAENILS
QKTGEIDPVF VDFDKDIQSL EITSGTPLFT VTNTDNTLFS LYYVFDTGTN HSKTIDLALD
YLGYLGTSGH SPAEFSQEMY KIGASFSAFT SDDHLYLKLS GLQENFDAAL DMLEELLTDA
QPNTEALEKL KAGVLKERAD DKLSKRKILF EAMYNFGRYG SSSPFTNVLD NKELQQISSE
ELLEEIDTLI HYRHRVLYYG PEKPENIAGK LSGLPHLKEK LNPLPASEPF REIGQEESRV
YVVDYDMTQA ELLMLSRDRL YDAQEVPLIT LFNEYYGGGM SSVVFQELRE AKALAYSVFS
VYRIPRDKDE HHYIFSYIGT QADKLPEALD GITELLENLP ESPDLLATAK EAIRGKIRTD
RITKSKILFT REEAEKLGLN HDIRKDIFEK VDRFGFNDIA AFHKDRFADK RYTLLVLGKK
ENLDMETLGR FGTVTSLTLE DVFGY