Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2031 |
Symbol | |
ID | 6375724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2186242 |
End bp | 2189199 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642684522 |
Product | peptidase M16 domain protein |
Protein accession | YP_001960422 |
Protein GI | 189500952 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.79996 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCTGT TTCTTTCCAG CAGTACATAC ACCCCGCTTC TTAACCGCCT CGTCGCCATC TGTACATCCT GTATTGTCTT CTTTCATTTA CTATCCTGCA CTGCCATGAG CCCAGAAAAC AAGTATTCAT ACGTTTCCAT TCCCGACGAT CCGCTGCATA CGAGAATCTA CACACTGGAA AACGGTCTGA CGGTCTATAT GAGCCCGAAA AAGGATGAGC CGAGAATCTA TACCTCTATC GCCGTCAGGG CAGGGAGCAA AAACGATCCG GCGGAAACAA CCGGTCTTGC GCATTACCTC GAACATATGC TGTTCAAAGG TACAGACTCA ATCGGCGCCC TCGACTATGA TCAGGAAAAA ATCGAACTGC AGAAAATCAT CGACCTCTAT GAAGAGTACC GCTCTACGGA TGACCCTGAT AAAAGGGCCG ACATCTACCG TCAGATAGAC AGCACCTCCA ATTTCGCGGC AAAGCTCACC ATTCCCAACG AATACGACAA GCTGCTCAGT TCGATAGGAG CCAGAGGGAC AAACGCCTAT ACCTGGGTAG AACAGACTGT CTACCTGAAT GACATTCCGT CGAACCAGCT TGAAAAGTGG CTTTCGATAG AAGCCGAACG GTTCCGCAGC CCGGTAATGC GGCTTTTCCA TACGGAACTT GAAACTGTCT ATGAAGAGAA AAACATGACA ATGGACAGTG ACAGCCGGAA AATCTGGGAA AATCTTTTTG CAGGACTTTT CAGAAATCAT ACCTACGGCA CGCAGACAAC CATCGGTGAA GCTGAACACC TGAAAAACCC TTCGATTAAA AATGTCATAG AATACTACCG CGCCTGGTAT GTGCCGAACA ATATGGCGAT CTGCTTGGCC GGTGACTTTG ATCCGGATGA AACCATCAAA CTGATCGATG AAAAATTTTC CGCCCTTGTA CCGGGAGAGA TACCGGCATT CACCCCGGCG GGGGAAGATC CGATCATCAA GCCGGAAATA ACCCGGGTCA AAGGGCCTGA AGCCGAAGAA CTTGTGATGG GATTCAGGTT TGGCGGTTCA GGTTCCAGAG ATATGGATAT CCTGACGCTC ATTGATAAAA TCCTCTATAA CCATACTGCC GGACTTATCG ATCTCAGCCT CAATCAGGAA CAGAGAGTAC TTGACGCCGG ATCGATGGTG GTCGAGATGA AGGATTACTC GGCACATATC CTGAGTGCCA AACCCCGTGA AGGACAGAGC CTCGACGAGG TCAGGGATCT GCTTCTTGAA CAGATTGAAA AAGTAAGAAC GGGAGATTTC CCGGACTGGC TGCTTGAAGC GGTCATCAAC GATCTGAAGC TTGAAGAGCT CAAAACCTAT GAATCCAACA AAGGCCGCGC AGAGAGCTTT GTCGACGCAT TTGTTCTCGG CCTTGACTGG AGCAGCGTGG AAGGCAGGAT CGACAGGCTG AACCGTATCA CGAAAAAGGA AATCGTCGAA TTCGCACAGG AACGTTACGG AGAAAACTAT ATCGCGATCT ACAAGGAACA CGGAAAGGAT GCCCTGGCAC CCAAGATCGA GAAACCGCCG ATAACCCCCC TCACGGTCAA TCGGGACAGA ACGTCAACAT TCGCTGAAAA CATACTCTCA CAAAAAACCG GAGAGATTGA CCCTGTCTTT GTTGATTTCG ACAAGGACAT ACAATCACTT GAGATAACCT CCGGGACACC ACTTTTTACG GTCACAAACA CCGACAACAC GCTCTTTTCG CTCTACTACG TTTTCGATAC CGGTACCAAT CATTCGAAAA CGATCGATCT GGCTCTGGAC TATCTGGGTT ATCTCGGAAC ATCGGGACAC AGCCCGGCTG AATTCAGCCA GGAGATGTAC AAAATCGGCG CAAGTTTCTC GGCATTCACT TCTGACGACC ACCTCTACCT GAAACTATCC GGTCTGCAGG AAAACTTCGA TGCCGCGCTC GATATGCTCG AAGAGCTTCT GACAGATGCG CAGCCCAACA CCGAAGCGCT TGAAAAACTC AAGGCAGGCG TGCTCAAGGA ACGCGCGGAC GACAAGCTCT CCAAGCGAAA AATTCTGTTT GAGGCCATGT ATAACTTCGG AAGATACGGC TCATCATCAC CCTTCACCAA CGTCCTTGAC AACAAGGAAC TGCAACAGAT ATCTTCAGAG GAACTTCTCG AGGAAATCGA TACGCTGATT CACTACCGCC ACAGAGTCTT GTACTATGGC CCTGAAAAGC CGGAAAACAT TGCCGGAAAA CTTAGCGGTC TACCGCACCT TAAAGAGAAA CTCAACCCTC TCCCTGCCTC TGAACCCTTC AGGGAAATCG GGCAGGAAGA AAGCCGGGTC TACGTTGTAG ACTATGATAT GACACAGGCC GAACTGTTGA TGCTGTCAAG AGACAGGCTC TACGATGCCC AGGAGGTTCC GCTGATCACG CTTTTCAACG AGTATTACGG CGGCGGCATG TCATCTGTCG TGTTTCAGGA ACTTCGTGAA GCCAAAGCGC TGGCTTATTC AGTATTTTCC GTATACCGCA TCCCCAGAGA CAAGGATGAG CATCATTACA TCTTCAGCTA TATAGGCACC CAGGCAGACA AACTCCCTGA AGCCCTTGAC GGCATCACGG AACTGCTTGA AAACCTTCCT GAATCTCCGG ACCTGCTGGC TACGGCAAAA GAGGCAATAC GCGGCAAAAT CCGCACGGAT CGCATCACGA AATCAAAAAT ACTCTTTACC AGAGAGGAAG CGGAAAAACT CGGCCTGAAC CATGATATCC GCAAAGATAT CTTTGAAAAA GTCGACCGGT TCGGCTTTAA CGATATTGCC GCTTTCCACA AAGACCGGTT TGCAGATAAA CGGTATACGC TGCTGGTCCT GGGCAAAAAA GAAAACCTTG ACATGGAAAC CCTCGGCCGG TTCGGCACAG TGACCAGCCT GACCCTTGAG GACGTTTTCG GTTACTGA
|
Protein sequence | MILFLSSSTY TPLLNRLVAI CTSCIVFFHL LSCTAMSPEN KYSYVSIPDD PLHTRIYTLE NGLTVYMSPK KDEPRIYTSI AVRAGSKNDP AETTGLAHYL EHMLFKGTDS IGALDYDQEK IELQKIIDLY EEYRSTDDPD KRADIYRQID STSNFAAKLT IPNEYDKLLS SIGARGTNAY TWVEQTVYLN DIPSNQLEKW LSIEAERFRS PVMRLFHTEL ETVYEEKNMT MDSDSRKIWE NLFAGLFRNH TYGTQTTIGE AEHLKNPSIK NVIEYYRAWY VPNNMAICLA GDFDPDETIK LIDEKFSALV PGEIPAFTPA GEDPIIKPEI TRVKGPEAEE LVMGFRFGGS GSRDMDILTL IDKILYNHTA GLIDLSLNQE QRVLDAGSMV VEMKDYSAHI LSAKPREGQS LDEVRDLLLE QIEKVRTGDF PDWLLEAVIN DLKLEELKTY ESNKGRAESF VDAFVLGLDW SSVEGRIDRL NRITKKEIVE FAQERYGENY IAIYKEHGKD ALAPKIEKPP ITPLTVNRDR TSTFAENILS QKTGEIDPVF VDFDKDIQSL EITSGTPLFT VTNTDNTLFS LYYVFDTGTN HSKTIDLALD YLGYLGTSGH SPAEFSQEMY KIGASFSAFT SDDHLYLKLS GLQENFDAAL DMLEELLTDA QPNTEALEKL KAGVLKERAD DKLSKRKILF EAMYNFGRYG SSSPFTNVLD NKELQQISSE ELLEEIDTLI HYRHRVLYYG PEKPENIAGK LSGLPHLKEK LNPLPASEPF REIGQEESRV YVVDYDMTQA ELLMLSRDRL YDAQEVPLIT LFNEYYGGGM SSVVFQELRE AKALAYSVFS VYRIPRDKDE HHYIFSYIGT QADKLPEALD GITELLENLP ESPDLLATAK EAIRGKIRTD RITKSKILFT REEAEKLGLN HDIRKDIFEK VDRFGFNDIA AFHKDRFADK RYTLLVLGKK ENLDMETLGR FGTVTSLTLE DVFGY
|
| |