Gene Cagg_3506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3506 
Symbol 
ID7266434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4273720 
End bp4274940 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID643568314 
Productamidohydrolase 
Protein accessionYP_002464781 
Protein GI219850348 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3454] Metal-dependent hydrolase involved in phosphonate metabolism 
TIGRFAM ID[TIGR02318] phosphonate metabolism protein PhnM 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATC TCTTCACCAA CGCGACTGTT GTCTTACCCG ATCGAGTCAT TGAAGAGGGT 
TGGGTGGTGA TCGACCGAGG CCGGATCGGG GCGATTGGAC GCGGCAAGCA CCCGTATGCG
GCAACGATGC CGCAGTTTGA TCTCGACGGG GCGTATTTGT TGCCCGGCCT GATCGATCTG
CACTGTGACG CTATCGAAAA ACTCGTCCAA CCTCGGCCCG GTGTCGAGAT TGAAGTTGGT
ATCGCATTGC ACGCTGCCGA CCGGCTCTTG CTAGGGTGCG GCGTGACCTG CCAATTCCAC
GCCTTATCGC TCGACGACGC CGAATTCGGT GTGCGCAGCG ATCGCTTTGT CAGCGACTTC
CTTCACCAGC TCAGCGCCGA ACGGCACTGC GGCGCACGCC ATCTCGTGCA TGCCCGGGTC
GAGGTCAGCA GTGAGCGCGG CCTTGAGGCG CTGAAGACAA TGCTGGGCCA TCCCTTATTG
CGGCTGGTCT CAATCATGGA TCACAGTCCG GGGCAGGGGC AATATACCAC CGAAGCTGCG
TTTCGCCATT ACGTCGCCAA GACCACTGGG CGCAGCGATG CCGAGATCGA CGAATTGTTG
GCCCGCAAGC GAGCGGCGCA GAGCGATGTG CCCAACCGGA TCCGGCAAGT GATTGCATGG
GCGACGAAGT ATGGCTTGCC GGTCGCCAGC CACGATGACG ATACACCTGA ACGGGTGGCG
CAGTGGGTGG AGTTGGGGGT GAAACTTGCC GAATTCCCCA CCACCTTAAC CGCGGCCCAA
ACGGCGCATA CCGCCGGGAT GGCGGTGGGG ATGGGAGCGC CGAACGTGCT GCGCGGCAAG
TCAAGTGGCG GCAACCTGAG CGCGTTAACC GCAATCGAGG CCGGCGTTGT TGACTGGCTG
TGCGCCGACT ACTATCCGGC CTCGCTATTG CCGGTCATCT TCCGCCTCGC CGACCGCGGT
ACCCTTAGCT TGCCCGCCGC AGTGGCGCTA GTCAGCCATC ACCCGGCCTG CGCCGCCGGC
ATCGGCCATC TGATCGGCAG TATCCAGCCG GGGCTGATCG CCGATCTGAT CGTCGTGCGC
CGGATGCCCG ATCCGGTGGT GCAGCAGGTC TTTGTCAGCG GGAAGCCGGT GTACACCTTG
CAGGAGCAGA ACGAACCGCT ACCCTTCCCC GACCGCCACC AGATGGGTGA AGAATTGCCG
GCCCCACGCG AGCATCGTTA G
 
Protein sequence
MQYLFTNATV VLPDRVIEEG WVVIDRGRIG AIGRGKHPYA ATMPQFDLDG AYLLPGLIDL 
HCDAIEKLVQ PRPGVEIEVG IALHAADRLL LGCGVTCQFH ALSLDDAEFG VRSDRFVSDF
LHQLSAERHC GARHLVHARV EVSSERGLEA LKTMLGHPLL RLVSIMDHSP GQGQYTTEAA
FRHYVAKTTG RSDAEIDELL ARKRAAQSDV PNRIRQVIAW ATKYGLPVAS HDDDTPERVA
QWVELGVKLA EFPTTLTAAQ TAHTAGMAVG MGAPNVLRGK SSGGNLSALT AIEAGVVDWL
CADYYPASLL PVIFRLADRG TLSLPAAVAL VSHHPACAAG IGHLIGSIQP GLIADLIVVR
RMPDPVVQQV FVSGKPVYTL QEQNEPLPFP DRHQMGEELP APREHR