Gene Cagg_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0047 
Symbol 
ID7269044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp69782 
End bp71182 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content57% 
IMG OID643564920 
Productbeta-lactamase domain protein 
Protein accessionYP_002461436 
Protein GI219847003 
COG category[R] General function prediction only
[P] Inorganic ion transport and metabolism 
COG ID[COG0491] Zn-dependent hydrolases, including glyoxylases
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCTGA AATACTTCTA CGATGAGATG TTGGCGCAAG CCTCTTATCT CATCGGCTGC 
GCGAAGACCG GCGAAGCGAT GGTGATCGAC CCGATGCGTG ATGTAGAGCC CTACTTGCGG
GTAGCAGCCA AGGAAGGTTT GCGGATCACA CACGTGACCG AAACGCACAT CCACGCCGAC
TTCGTGAGTG GGGTTCGTGA GCTGGCTGCG CGTACCGGTG CGCAGATGTA TCTGAGCGAC
ATGGGTGACG CCAACTGGAA ATATGTCTAT CCTGAGATCG ATAAGGCGAT TTTGGTTAAA
GACGGCGATA CGTGGATGGT CGGTAATATC AAGGTGCAGG TAATCGCGAC CCCCGGCCAC
ACCCCAGAGC ACATTGCCTT CATGATCACC GATACTGCCG GCGCCGATCA GCCGATGGGT
ATCTTCACCG GTGACTTCCT CTTCGTTGGT GACGTAGGAC GGCCTGACCT TTTGGAGGAG
GCAGCCGGTA TCGCCGGAAC CAAGGAACCG GGTGCTCGCC GCCAGTTCCA GTCGGTGCAG
CGTATCAAAG CCCTCCCCGA CTACCTGCAA GTCTGGCCGG CACACGGATC GGGCAGTGCA
TGTGGAAAGG CGCTGGGTGC AATTCCATCA AGCACACTCG GCTACGAAAA GCGTTTTAAC
CCGGCATTCC AGTTCGACGA TGAAGATGCG TTTGTGAAAT GGCTGCTGCA CGGTCAACCC
GAACCACCAC GCTACTTCGC CCGCATGAAG CACGTCAACA AGGTCGGACC GGCACTCTTG
CACGACCTTG CTACGCCGGT TGAGGTCGAA CGGCCCGTGC TTGATCAGGC ATTAGCCGAA
GGGGCAACGG TGTTCGATCT CCGCAGTGCG GCTGAATTCG TTGCCGGCCA CGTACCGGGC
AGTATCAGCG TCCCAATCCG CAAGATGTAC AGCACCTATG TCGGCTGGTT TGTCGATTTC
AACAAACCGA CCTACCTCGT GGTGCCTGAT GATGCCGATT TGCAGCAGAT TTTGAAAGAT
CTCCGCGCGA TCGGTGTTGA CGATATTCCG GGATACATAC CGGCGTCGGC TTTGGGAGGT
AATCTGGCCC AACTGCCTAC TGCAACGGTT AAAGACTTGA GTGAGGCTCT CGCCAATCAA
TCGGCAGTCA TCCTCGATAT GCGCAACCAG ACGGAGTTTG ACGAGATTCA CTTGCCCGGC
GCGCTACACA TTCCGGTTGG CTATCTCCCA CGTCGTCTCG CCGACATCCC GCGTGACACG
CCGATCATTG CGCATTGCGC CACCGGCTAT CGTTCGCAGG TCGGTACGAG TGTCCTGCAC
CGGTTGGGCT TCACCAACGT CGCAACGCTT GTTGATCCAC AAGGCACGTG GCGCGAACTG
GTGTCGGCTG TAACCGTGTA A
 
Protein sequence
MLLKYFYDEM LAQASYLIGC AKTGEAMVID PMRDVEPYLR VAAKEGLRIT HVTETHIHAD 
FVSGVRELAA RTGAQMYLSD MGDANWKYVY PEIDKAILVK DGDTWMVGNI KVQVIATPGH
TPEHIAFMIT DTAGADQPMG IFTGDFLFVG DVGRPDLLEE AAGIAGTKEP GARRQFQSVQ
RIKALPDYLQ VWPAHGSGSA CGKALGAIPS STLGYEKRFN PAFQFDDEDA FVKWLLHGQP
EPPRYFARMK HVNKVGPALL HDLATPVEVE RPVLDQALAE GATVFDLRSA AEFVAGHVPG
SISVPIRKMY STYVGWFVDF NKPTYLVVPD DADLQQILKD LRAIGVDDIP GYIPASALGG
NLAQLPTATV KDLSEALANQ SAVILDMRNQ TEFDEIHLPG ALHIPVGYLP RRLADIPRDT
PIIAHCATGY RSQVGTSVLH RLGFTNVATL VDPQGTWREL VSAVTV