Gene Cagg_1419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1419 
Symbol 
ID7269251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1747474 
End bp1749444 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content56% 
IMG OID643566262 
Productamidohydrolase 
Protein accessionYP_002462762 
Protein GI219848329 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGTG TTGATACCCT GCTCATCGGC GGTACCGTCG TCACGATGGA TGCAGCATGG 
CGTATTTTTT CCGACGGTGC TGTCGCGATC CGTGATGGGA TGATCGTTGC CGTCGGACCG
AGTGCTGAGA TCGCAGCCCA TTACACGGCG ACCGAGACGA TTGATTGTCG TGATTGTGCG
ATTATTCCCG GTCTGATCAA CGGTCATGCC CATGTGCCGA TGAGCCTGTT ACGTGGTTTG
GTGGCCGACC AGCAACTTGA TGTCTGGCTG TTCGGTTATA TGTTTCCGGT CGAAAGTCAG
TTTGTCGATG CTGAGTTTAG CTACACCGGC ACACTGCTCA GTTGTGCCGA GATGATCCGT
GGCGGCACAA CAACTTTTGT CGATATGTAT TATTTTGAAG AGGAAGTTGC ACGCGCAGCC
GATGAAGTCG GGATGCGGGC GATTTGTGGT CAGACAGTCA TGCGTTTGCC AACACCTGAC
GCCGACTCAT TCGATGCCGG GATCGAGCGT GCCCGTCGAT TTATGGCCGA GTGGCGTGGT
CATCCACGGA TTATCGCTAC GGTTGCACCA CATGCCCCCT ATACCTGCAC CGATGAAATC
TATCGCCAAG CTGCTGCGCT TTGTGCCGAG TTTGGGGCAC CACTGATCAC CCACTTGTCG
GAGACGGCAC GCGAGGTAGA AGAGAGTATT CGCGACCGTG AAGTGACGCC GATCCGTTAT
GCTAAACGCG TGGGAGCGTT CGATGTGCCC TGCATCGCTG CGCACTGTGT GCATGCCACT
GAAGACGATC TGCGGCTCTT GCGCGAAGCT CGCGCCGGTG CTGTACCCTG TCCGACAAGT
AATCTTAAGC TCGCCAGTGG TGTCGCGCCT TTTCGCCGTA TGATCGAGAC CGGCGTGCGG
GTTGGTCTCG GCACCGACGG GCCGGCCAGC AATGATGACC AGGATATGTT TACGGAAATC
CAACTGGCTG CGCTGTTGCC AAAGGGATTG AGCGGTGACC CGACCGCAGT ACCGGCACGA
GAGGCATTTG CGCTGGCGAC GTGCTGGGGC GCACGCGCAG TCCATCTCGA TCACCTGGTC
GGTTCCCTCG AAGTAGGTAA ACGGGCCGAC ATTGCGGTGG TTGAGTTGCG GCGGTTGCAC
AGTGCTCCTC GCTACACCTA CGCTCCTGAT GCAATCTACT CACATCTCGT CTACAGCAGC
CGAGCCGCCG ATGTGCGTGA TGTGCTTGTT GATGGGAAGT TGTTGCTGCG CAATCGCCAT
CTCTTGACTA TTGATGAAGA TGCGATCATC GGACGTGCGC AAGCGATTGC TGCACGAATT
AACGAGTTTC TCGCTACGCG CGAACGCAAC CTGTTGGCGA AGATTTTGGC CCTTGGTGGT
GTGCAGCAAG CCGAAATCTT CGAGATTCAA GTCAAGGCCC GACTACAACC CGCCGATGTC
GAGCGCGTGC TGGCGCTGCT CAACCACCCG GCGATTACCA TTACGAAAAC GAGTGAGCGC
ACCCAATACG ACACTTACTT TATTTTCGCC AACGGTGAAC GTATCCGCAT TCGCGAGGAT
CACCGGACCG ATCCCGGTGC TCGTCCGCAA CCTAAGTACA CGATCACTTT GATGGCTGAA
GCTCGGCGCT TTGATCAATC GAAGTCGATG ATGATCTCGC GCGCACGGTA CACCGCTCCC
GCCGATCATA CGGTACGTTT CTACCGTGAA TATTTTCAAC CCGACCGGAT TGAAGAGTTA
GAGAAGCGAC GGCGGCGTTG GCGAATTCTG TACAAAGACC ACGACTTTGC CATCAACCTC
GATACACTGG TCGGGCACTC TGATCCCGGG CCTTTCCTCG AAATCAAGAG TCGCACATGG
AGTCGGCGCG ATGCCGAACG ACGCGCCGAG CTGATCGGTG AATTACTCGA ACTGGCCGGC
ATTGCCGATG AGGCGCTTGT GTTGCAGGAA TATGTGGAGA TGGCGGTATG A
 
Protein sequence
MERVDTLLIG GTVVTMDAAW RIFSDGAVAI RDGMIVAVGP SAEIAAHYTA TETIDCRDCA 
IIPGLINGHA HVPMSLLRGL VADQQLDVWL FGYMFPVESQ FVDAEFSYTG TLLSCAEMIR
GGTTTFVDMY YFEEEVARAA DEVGMRAICG QTVMRLPTPD ADSFDAGIER ARRFMAEWRG
HPRIIATVAP HAPYTCTDEI YRQAAALCAE FGAPLITHLS ETAREVEESI RDREVTPIRY
AKRVGAFDVP CIAAHCVHAT EDDLRLLREA RAGAVPCPTS NLKLASGVAP FRRMIETGVR
VGLGTDGPAS NDDQDMFTEI QLAALLPKGL SGDPTAVPAR EAFALATCWG ARAVHLDHLV
GSLEVGKRAD IAVVELRRLH SAPRYTYAPD AIYSHLVYSS RAADVRDVLV DGKLLLRNRH
LLTIDEDAII GRAQAIAARI NEFLATRERN LLAKILALGG VQQAEIFEIQ VKARLQPADV
ERVLALLNHP AITITKTSER TQYDTYFIFA NGERIRIRED HRTDPGARPQ PKYTITLMAE
ARRFDQSKSM MISRARYTAP ADHTVRFYRE YFQPDRIEEL EKRRRRWRIL YKDHDFAINL
DTLVGHSDPG PFLEIKSRTW SRRDAERRAE LIGELLELAG IADEALVLQE YVEMAV