Gene Cagg_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3100 
Symbol 
ID7269517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3761912 
End bp3764776 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content56% 
IMG OID643567920 
ProductPeptidase M23 
Protein accessionYP_002464394 
Protein GI219849961 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCGA GCAAGCCAGA ACCTTTTCTG AACCGGTTAC TGTTCGGAAC GGTCGTGTTG 
GCAGTGACGA GTATCCTCTT CGTCCAGTTC TTCTTCACGC TCCGCAGTGA GACCGCCAGC
GCTTCACCTG CACTTAATCC TGCGTTAGAA CAGGCAGTGA GCGCGGCAGT CAGTGCGCAA
CACCCGAATG CGGCAACAGC GCAGATCATT GTCTCACCAA TTTCACAAGT CAAAGCATGG
GTCTTCGGTT CCGTTGCAAT TATTCCCCAA CCCGCCTCGG ATAACGCACC AGATGAAGAG
ACGGCTCCAC GTATCCTGCT CTTCTTTGCC GAGCAAAGCA ACGGCACCTG GCAAGTAGCG
CTCGAACACT CTGCTGCCTT TGAGCAAGCG CTCACGCAAA TCCCCAGAAA CCTCTTGACG
CCGGAACAGT GGTCAACAAT GCCAACGACG GCGCAATTGA GCGGTGATGG GTCAATGGAC
CTCAACCTGC CATGGGCGGT TGGCGAGACG TGGTGGTTAA CCAGTGGCCC ACACGGTATT
TCAAGGGCAG CTCTCGATTT TGCTGTCATT GGTGGTCGTG TACGAGCCGC ACGAGAGGGA
GTAGCCTATA CTCCATGCGG TCAAAGTTCG GATCTCGTGC GGATCGACCA CCCCGGCGGT
TTTAGCACCA ATTATTACCA CCTTTCGGGG ATTGCTGTCG CTAACGGGAC TCCGGTCGAT
CGGTACACCT ATATCGGTAT GATTGGCACC GGTACTAGAT GTCTCGGTAG TTATGTGACC
GGTGCGCACG TTCATTTTTG GATCTCGCGT GGCGGGACGC GGATACCAAT TGACGGCATT
GACATCGGTG GCTGGACTGT CAGTGCGCTT CCCGGCGATT TCAACGGTTG TATGACCCGA
ATTCGCGACG GGAATCGTCA ATGTGTTCGC AATGACAGTA ACCTCAATGT GATTGAAGGA
CCCGGACTGA TCACCAATGA AGGCGCCACT GGAAGTGGTA ATATTAAACC CAACCCGCCC
CGTCTCGTTA GCCCGGAAGA TGGCGCAATC ATCAGAACGA ATACCGTTAC TTTACACATG
GAAGATGCCG GCGACCCCGA TAACGGACCG CGTTCTTGGC GTGATTACTT CTTCATCATT
ACTAAATCTG ACAACTCCTG GCGTGTCGAA TCAGGCTGGA TCACAAGCAC CAGTTGGACC
GTCGAACTGC CGGGCGAGGG GAGCTATACT TGGACGGTCC ACGCCGGGGA CGGAGCTGCG
ATGAGTGATC CTGCTCCCCT GCGCACATTT ACCTACATCC CCAACCGCCC ACCCAACCCG
CCACACCTCA TTAGCCCAGA AAACGGTGCA ACCATCAGAA CGGCCAGTGT CACCCTCCAA
GTGGCCGATG CCGGCGACCC CGACAACTGG CCTGCTCCCC AGCGTAATTA CAGCTTCCAC
ATCGCTAAAT CGGATAACTC GTGGCGAACT GACTCCGGCT GGATCACAAG CACCAGTTGG
ACCGTCCAAC TACCGGGGGA GGGCAGCTAT ACTTGGACAG TCTACGCCGG GGACGGAGCT
GCGATGAGTG ATCCTGCTCC CCTGCGCACA TTTACCTACA CCCCCAACCG CCCACCCAAC
CCGCCTAATC TGATTAGTCC AGATCACAAC GCGACCGTCA ATACGGCTAC CGTTACCTTG
CAAGCGTCCG ACGGTGGCGA TCCGGACAAT TGGCCGAACT CAAGCCGTAA ATTCCGCTTC
ATCATCACCT CATCTGACAA CTCATGGCAA GCTAACTCCG GCTGGATTGA TAGCCCAAGC
TGGACAGTGC AATTGCCAAA AATAGGGGTG TATTCGTGGT ACGCACAAAC CAACGATGGC
GCGGCTGACA GTGCCAGCAG CAGCCGGCGG ACGTTATATT ACGCTGTAGT CGCTCCGATA
CCAACGCCAC CACCACCACC AACTCCAGCG CCGAACGTCT GGCGAGTACC CTATTACTCG
CAGTACGATC CGCGATGGAG TGGGCAAATG ATGAACCGGC CCTCGTATGC TACGGACTGT
AATTACACTA TCGGCCGAAT CGGGTGCGCC TTGACATCGC TCACTATGGT CGCGCGTTAC
TATGGAGTTG ACCACAACCC AGGAACGATG AACTCTTGTT TGGGAGCGCA TGCATGTCCG
TTGTACTGGT CAAGCCCGCA GGTGCGTACT TGCACCAACA ATAAGCTGCG CTGGGTGAAA
TGGCCAGCAT TCAGCTATCC CAATCTGGAG GCGGAACTAA GAAAAGGTCC GGTGATTCTC
GAATTGCAAA AAAGCGACGG TAATATGCAC TTCATCGTGG TTCTGGGCGG TAGCGGCTCG
AATCCGGCCA ATTACACCGT CCACGATCCG GGATTACGCA ATGGTGCATA CACGACGCTG
GCCAACGCAC TGAGCTACTG GAGAAACTAT CGGCCAAGCA GTATGCGTAT CTACACCGGT
ACGCCGGCGC CGCTGCCAGC CAGCCTTGCC GAAGAGGCGA CCGATTCTGC GAGCCAAGCC
GCAGAGGTTG ACCCAATCGA ACCGCTCGCC AGTCCGCCTT TGCTCAATGG CACCACTATT
ACTGGTGCGA TTGCCCCATA TCGGAATACC AGCACAGAGA TGGTGCTTGA ACTCGCCGCT
CAAAGTGCAG CCGGCACCGT GACAGAGATG CGAGTTTGGA CGAATGCCGA ACAGAGTAAC
ATCTGGCAAC CGTTCAGCCG ATATGTCACA GCACCACTGG CCGGCACGTA TTATGTTCAG
TTCCGCGACT CGGCTGGGAA CACTTCGGTG GTATTCTCAA CCGGTTTGCC GCAAGCGCGC
CGACTGAACA TCTACACCAC GTTCATCCCG ATGACTGCAC GCTAG
 
Protein sequence
MQSSKPEPFL NRLLFGTVVL AVTSILFVQF FFTLRSETAS ASPALNPALE QAVSAAVSAQ 
HPNAATAQII VSPISQVKAW VFGSVAIIPQ PASDNAPDEE TAPRILLFFA EQSNGTWQVA
LEHSAAFEQA LTQIPRNLLT PEQWSTMPTT AQLSGDGSMD LNLPWAVGET WWLTSGPHGI
SRAALDFAVI GGRVRAAREG VAYTPCGQSS DLVRIDHPGG FSTNYYHLSG IAVANGTPVD
RYTYIGMIGT GTRCLGSYVT GAHVHFWISR GGTRIPIDGI DIGGWTVSAL PGDFNGCMTR
IRDGNRQCVR NDSNLNVIEG PGLITNEGAT GSGNIKPNPP RLVSPEDGAI IRTNTVTLHM
EDAGDPDNGP RSWRDYFFII TKSDNSWRVE SGWITSTSWT VELPGEGSYT WTVHAGDGAA
MSDPAPLRTF TYIPNRPPNP PHLISPENGA TIRTASVTLQ VADAGDPDNW PAPQRNYSFH
IAKSDNSWRT DSGWITSTSW TVQLPGEGSY TWTVYAGDGA AMSDPAPLRT FTYTPNRPPN
PPNLISPDHN ATVNTATVTL QASDGGDPDN WPNSSRKFRF IITSSDNSWQ ANSGWIDSPS
WTVQLPKIGV YSWYAQTNDG AADSASSSRR TLYYAVVAPI PTPPPPPTPA PNVWRVPYYS
QYDPRWSGQM MNRPSYATDC NYTIGRIGCA LTSLTMVARY YGVDHNPGTM NSCLGAHACP
LYWSSPQVRT CTNNKLRWVK WPAFSYPNLE AELRKGPVIL ELQKSDGNMH FIVVLGGSGS
NPANYTVHDP GLRNGAYTTL ANALSYWRNY RPSSMRIYTG TPAPLPASLA EEATDSASQA
AEVDPIEPLA SPPLLNGTTI TGAIAPYRNT STEMVLELAA QSAAGTVTEM RVWTNAEQSN
IWQPFSRYVT APLAGTYYVQ FRDSAGNTSV VFSTGLPQAR RLNIYTTFIP MTAR