Gene Cagg_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1247 
Symbol 
ID7266233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1525808 
End bp1528183 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content56% 
IMG OID643566089 
ProductPeptidase M23 
Protein accessionYP_002462591 
Protein GI219848158 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCGAC TTGCCCGCTG GCTCATGATC TTGTGCTTAC TCGTCGCAGC TCCACCGGTA 
CACGCGCAGC CAGCGGGTAG GTGGAGTGTT GCCGAGTTTT TAGCCGATCA ACCGGGACGG
TTACGCGATC TCACCTTTGA TGGGCGTAGT GCAGCTCAGA TTATTGAGGA GCAGAGTAGC
TATTTTGGGA TCAGTCCGTT TTTGATCCTT GCGCTGCTCG AAGCTACTGC CGGTCTCCTG
AGTGATCCGT CACCGCCGGC AGACGCTATT TCGTACCCTT TCGGACGACA CGGCCCTGAG
GGTTTCGTCG CGCAAATTGA ATGGGTCAGT CGTGAATTGC GTGCCGGATT GGGGCCGTAC
CAACAATCGC CTACGTTGCG CCTAAGCGAT GGCTTAACGC TAACCCTGTC GCTCGATGAA
CCGGCAGAAT GGATTGCGAT CAAGCGTTTC TTGGCCCACG GTCGCGATTC GACGGCATGG
TTAGCAGCGG TAAAGGCTAC ACATACCGCA TTACGTACCT ATTTTGACGG GCAACTGGCT
CCTCCGGCAA CGGTTGCCTC GTCGAACACA GGTTGGTTGC GTGCGCCATG GCCTCTGGGT
ACACGTGTCG TCCATCTGGC CTATTTCGAT CACATGTACC CTATGGTCGA TCTTGGCAGT
GATGGCAATA GTGAGATGGT TGACTATCTC GGTCGGCGTA ATCTCCAATA CAACAGCCAT
GACGGTCACG ATTATGTCTT TCCAGATGCG CCGTTTACCA CACCTATTCT CGCCACCGCT
GCCGGAACAG CCTACGCTTT TAGCGAGGCG CGCGGACTTG GAGTAGTGAT TATTCATCCG
AACGGCTATG AAACCGTCTA CTGGCACTTG AGTGCGCTCG ATCCTATCTT CAACACCGGA
AACGGTGTGC CGGTCGTGGC CGGTCAGCCA ATTGGGGTCA GTGGTGCTAG TGGGGTAAGT
GGTACACCCC ATCTCCACTT TGAAGTCCGT CGTTGGGAGG GTGGGATACG TAAACAAGTC
GATCCGTATG GTTGGTATGG GACGGGAGTC GACCCTTGCC CTATGTACGC CGGTTGTGCG
ATAAGTACGT GGCTATGGCA TCCCGACCTC AGCGGCCAGT ATGACTTTAC CCCGCCCAAT
TACCCACCAC CACCTGGCGA CACGACGCCG CCGATTGGTA CAATGCGGGT AGCACCACCG
TCTGATCTAC TATTGGCCGC CACCTTTGAT GGGCATCCTC TCCAAACGGT TGGGCAGGGG
TTACCGCAGA TCAGCGGTGT ACTCAGCTTT GAAGCAGGGC GTTTTGGGCA AGCGTGGAAT
AGTGAGCGTG GTTCCTTAGC CTTCCCCACA ACCGGTAACC TCGATCTCGC GCAGGGGACA
ATTAGTTTGT GGGTAGACAT TCCGACCGAC TATCCGGTCA ATAGTCGCAA CCGTCATTAC
CTCTTCGCGA CGAGTGCTGA CCCAACGGGA GCACCGGTGT ATACGGGTAC CCTAGCACTG
CGACGAGACC GGTTAGGACC GCATGGAAGC GCCCAATGGA CATTCTGGAC GGTGCAGGAT
GCCGATAGGG GTGAGGATCA ATTGAGTGTA CCGGACACCT TATCGCCGGG ATGGCACCAT
TTTGCGGTGA GTTGGGAAGC AACGAGTGGC ACGAAAGCGC TTTATATCGA TGGTGTGCTG
GTTGCAGAGC GGAGCAATAC TGCATTCCCG ACAATTGCCG GCGCGCTCTT GCACCTCGGT
CGTTTCAGTA GTGATAGTCC AGGCGCCGGG GTGCGGGTTG ATGAACTGGC CGTCTACAAC
CGTCCGTTAA CAGCAGAGGA GATTGCCGAT TTAGCCAAAA AGCCGCCGCT TAGCACTGAG
CCGATACCGA TTACCGACCA ATTGATCCGG ATTGATACCA ACGCCCTCGA CGATAATGGT
GGCATTGCAG CGGTCATTCT TGGGATCAAC GATGAACTGA GCGATCCACT GCCGTACTAC
GACAGTTACC GATGGAGCTT GCCGGCGGTG AAGGGTGAGC ATATGGTGTC TGTCCAGTAT
ATTGATCGGG CGGGTAATAC AACCGTGGTT ACTCAGACGG TGCGAGTTAA TCTACCACCA
CACGTTGATG TGGATAGTGA ATGGCTCGAT CAAGTGACGG TGCGCTTGCG TTTTAACGTG
CGCGATGCTG AATTACCGGT CGAGATGCAA TTCAGCACAC AGCCCGACTT TGCCGCTACA
CCATGGTTGC CGCTGGTACC GGAGGTGCGC TGGCGATGGG AGGAAACCGA GCGACCACAC
CTCTTCGTGC GTTTTCGTGA TGCAGCCGGT CTGGTGAGTG TCCCGATAGA GATTACCCTA
CGCCATCAGG TGTTTATCCC GCTGGTGGGA CATTAG
 
Protein sequence
MHRLARWLMI LCLLVAAPPV HAQPAGRWSV AEFLADQPGR LRDLTFDGRS AAQIIEEQSS 
YFGISPFLIL ALLEATAGLL SDPSPPADAI SYPFGRHGPE GFVAQIEWVS RELRAGLGPY
QQSPTLRLSD GLTLTLSLDE PAEWIAIKRF LAHGRDSTAW LAAVKATHTA LRTYFDGQLA
PPATVASSNT GWLRAPWPLG TRVVHLAYFD HMYPMVDLGS DGNSEMVDYL GRRNLQYNSH
DGHDYVFPDA PFTTPILATA AGTAYAFSEA RGLGVVIIHP NGYETVYWHL SALDPIFNTG
NGVPVVAGQP IGVSGASGVS GTPHLHFEVR RWEGGIRKQV DPYGWYGTGV DPCPMYAGCA
ISTWLWHPDL SGQYDFTPPN YPPPPGDTTP PIGTMRVAPP SDLLLAATFD GHPLQTVGQG
LPQISGVLSF EAGRFGQAWN SERGSLAFPT TGNLDLAQGT ISLWVDIPTD YPVNSRNRHY
LFATSADPTG APVYTGTLAL RRDRLGPHGS AQWTFWTVQD ADRGEDQLSV PDTLSPGWHH
FAVSWEATSG TKALYIDGVL VAERSNTAFP TIAGALLHLG RFSSDSPGAG VRVDELAVYN
RPLTAEEIAD LAKKPPLSTE PIPITDQLIR IDTNALDDNG GIAAVILGIN DELSDPLPYY
DSYRWSLPAV KGEHMVSVQY IDRAGNTTVV TQTVRVNLPP HVDVDSEWLD QVTVRLRFNV
RDAELPVEMQ FSTQPDFAAT PWLPLVPEVR WRWEETERPH LFVRFRDAAG LVSVPIEITL
RHQVFIPLVG H