Gene Cagg_0786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0786 
Symbol 
ID7268105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp973107 
End bp975521 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content55% 
IMG OID643565637 
Productpeptidase S45 penicillin amidase 
Protein accessionYP_002462146 
Protein GI219847713 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTGC TGAAACGGAT TGCCGGCTGG ATAGCGGTAG ATATCGCATT GGTATTGGTA 
TTAGGGAGTA GTGGCGGGTA TCTCTGGTTA CGTCAATCGC TCGCGCAGAC ACGTGGGGAG
ATTCGCGTAG CCGGGATAAG CGGATCGGTC ACAATTGTTC GTGACCAGAA CGGGGTAGCA
CACATTACCG GTACGACCGA AACCGATGCG ATATTCGGGC TAGGTTTTGT TCATGCCCAA
GAACGGCTAT GGCAGATGGA GGTGCGACGT CGGATCGGAC ACGCTCGACT ATCGGAAATC
TTGGGGGAAG CAACTCTGCA AACCGACAAA TTTCTCCGCA CGCTCGGTGT AGCGCGAGCG
GCACAACATG CATTAGCGAA GCTAGATCGC GAAACAATTA CCATCTTAGA GGCATATGCT
GCCGGTGTAA ATGCTTTTTT GGCAATTAAT CCGGTACTAC CGCCAGAGTT TCTCATTCTC
GGTGTGCAAC CAGAGCCATG GCAACCCATT GATTCGTTGG TCTGGGCCAA AATGGTAGCT
TGGGATCTCG GTGGGAATTG GAATGATGAG ATTAGACGTT CGCTGCTCAT TGCCCAAGTC
GGTGCCGAAG ATGCCGATTT TCTGATGCCG GCTTTCACGC CCGATGGACC GGTCATTTTA
CCGGAAGGCA TGCCCACCAG CGCACCGTCG CCAATCAGTG GTCAGAGTAC GCCGTTGCAA
CCGACTACTG CGCGTGCAAT GTTCGACTTG TGGGCAACGG TCTATCAAAC AACCGGCTTG
GGCGACCAGC TTGCCGGCTC GAACAATTGG GTCATCGGCG GTTCACGGAC GGCTAGCGGG
AAGCCGTTGC TGGTCAACGA TCCACACCTC AGCCTACGCA TACCTTCCAT CTGGTATCTG
GCCCACATCA CAGGAGACGC CATCAACGCT ATCGGCGCTA CCTTTCCCGG TTTGCCGGCA
GTCGTGGTCG GCCACAATGA GCGGATTGCG TGGGGTGTTA CCAATACCAG CCCCGATGTG
CAGGACTTGT TTATCGAGCG GATTGATGCA CAAAATTATG TCGAGTATAA CAACACGCGC
GAACCGGTGA CCGTGATTAA CGAGATCATC AAGGTCAAAG GGGCTGAGCC GATCACGTTG
ACCGTGCGGG TCACCCGTCA CGGGCCGATC ATCAGCGACG TGCGCGACGA TATCAACGAA
ACCCTCGCCT TCCGCTGGAC GGCGCTCGAC GACGACGATG CTACCTTACG TGCCTTTTTG
AACATCAATC GGGCGCGCAA CTGGGACGAG TTTGTGGCAG CGCTGCGCGA TTATAAAGCG
CCGATGCAAA ACTTCGTGTA TGCCGATGTT GATGGCAATA TCGGTTACTA CGCGGCGGGG
GCAGTACCGA TCCGGCGGAA CGGCGATGGC CGCTTACCGG TACCAGGATG GACAGATGCG
TATGAATGGA TTGGATACGT CCCCTTTGAG GAGCTTCCGC ACATCGCCAA CCCGTCAAAA
GATTATGTGG TAACGGCCAA TCATCGCGTT ATCGGTGACG ATTATCCTTA CCTGCTGGGC
ACATCGTGGG CTGCACCATT CAGGGCACAG CGGATCATCG AGCTGATCGA GCAGGGTGAT
AAGCTGACGG TCGATGATAT GCGCGCAATG CTCAGCGACG TTGTTTCAAT TCAAGCACGT
GAACTTTTAC CAATACTCCG CAACGTGACC CCAACCGAAC CACGGGAGAC GGTGGCACTC
GAACTGCTGC GCAGTTGGGA TGGTGCAATG AATGGTGAGA GTGCAGCCGC TGCCGTCTAC
CAAAGCTACT ACTACGCCGC ACTGGAAGCA GTCTTCGCCG ACGAACTGCG TGGGTTTTTT
ACCGATGTTT ACCGTCTTCA GAACAATTTT CCGGCGTTGG CTCTACGGAG CGTATTGCTT
GGCGGTCACG ATGAGTGGTG TGATAACGTC ACAACGCCGG ACATTGTCGA GGATTGCCCC
ACGACGTTAG CGCAAGCGTT ACGCAAGGGT CTCGCAACAA TGGCAACGTT ACAGGGCGAG
AACGATCCAA CCCGCTGGCG GTGGGACAGG GTGCATCAGG CGATCTTCCC ACACAACCCC
TTTAGCCAGG TTGAAGCGCT GCGCGGCTTT TTCGAGCGGC GCGTACCAAC CGGGGGCGAT
ACTTTTACCA TCAACGTTGG GCCGGTACGG ATCCGCGAGC CGTATCTGCA ATACAATGGC
CCGTCGTACC GCCAAATTAT CGACTTGAGC AATCTCGCCA ACTCGCGGTT TATGCATACG
ACCGGTCAAT CGGGGAACGT GCTAAGTAGC CGGTACAGCG ATTTTCTAGG ATTGTGGCAG
CGCGGAGAAG ATATTCCGAT GCGCTTCGAT GCCGAGGTTG ATGGTGAGCG TTTGGTATTG
CGACCGACGC CGTGA
 
Protein sequence
MRLLKRIAGW IAVDIALVLV LGSSGGYLWL RQSLAQTRGE IRVAGISGSV TIVRDQNGVA 
HITGTTETDA IFGLGFVHAQ ERLWQMEVRR RIGHARLSEI LGEATLQTDK FLRTLGVARA
AQHALAKLDR ETITILEAYA AGVNAFLAIN PVLPPEFLIL GVQPEPWQPI DSLVWAKMVA
WDLGGNWNDE IRRSLLIAQV GAEDADFLMP AFTPDGPVIL PEGMPTSAPS PISGQSTPLQ
PTTARAMFDL WATVYQTTGL GDQLAGSNNW VIGGSRTASG KPLLVNDPHL SLRIPSIWYL
AHITGDAINA IGATFPGLPA VVVGHNERIA WGVTNTSPDV QDLFIERIDA QNYVEYNNTR
EPVTVINEII KVKGAEPITL TVRVTRHGPI ISDVRDDINE TLAFRWTALD DDDATLRAFL
NINRARNWDE FVAALRDYKA PMQNFVYADV DGNIGYYAAG AVPIRRNGDG RLPVPGWTDA
YEWIGYVPFE ELPHIANPSK DYVVTANHRV IGDDYPYLLG TSWAAPFRAQ RIIELIEQGD
KLTVDDMRAM LSDVVSIQAR ELLPILRNVT PTEPRETVAL ELLRSWDGAM NGESAAAAVY
QSYYYAALEA VFADELRGFF TDVYRLQNNF PALALRSVLL GGHDEWCDNV TTPDIVEDCP
TTLAQALRKG LATMATLQGE NDPTRWRWDR VHQAIFPHNP FSQVEALRGF FERRVPTGGD
TFTINVGPVR IREPYLQYNG PSYRQIIDLS NLANSRFMHT TGQSGNVLSS RYSDFLGLWQ
RGEDIPMRFD AEVDGERLVL RPTP