Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1419 |
Symbol | |
ID | 7269251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 1747474 |
End bp | 1749444 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643566262 |
Product | amidohydrolase |
Protein accession | YP_002462762 |
Protein GI | 219848329 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCGTG TTGATACCCT GCTCATCGGC GGTACCGTCG TCACGATGGA TGCAGCATGG CGTATTTTTT CCGACGGTGC TGTCGCGATC CGTGATGGGA TGATCGTTGC CGTCGGACCG AGTGCTGAGA TCGCAGCCCA TTACACGGCG ACCGAGACGA TTGATTGTCG TGATTGTGCG ATTATTCCCG GTCTGATCAA CGGTCATGCC CATGTGCCGA TGAGCCTGTT ACGTGGTTTG GTGGCCGACC AGCAACTTGA TGTCTGGCTG TTCGGTTATA TGTTTCCGGT CGAAAGTCAG TTTGTCGATG CTGAGTTTAG CTACACCGGC ACACTGCTCA GTTGTGCCGA GATGATCCGT GGCGGCACAA CAACTTTTGT CGATATGTAT TATTTTGAAG AGGAAGTTGC ACGCGCAGCC GATGAAGTCG GGATGCGGGC GATTTGTGGT CAGACAGTCA TGCGTTTGCC AACACCTGAC GCCGACTCAT TCGATGCCGG GATCGAGCGT GCCCGTCGAT TTATGGCCGA GTGGCGTGGT CATCCACGGA TTATCGCTAC GGTTGCACCA CATGCCCCCT ATACCTGCAC CGATGAAATC TATCGCCAAG CTGCTGCGCT TTGTGCCGAG TTTGGGGCAC CACTGATCAC CCACTTGTCG GAGACGGCAC GCGAGGTAGA AGAGAGTATT CGCGACCGTG AAGTGACGCC GATCCGTTAT GCTAAACGCG TGGGAGCGTT CGATGTGCCC TGCATCGCTG CGCACTGTGT GCATGCCACT GAAGACGATC TGCGGCTCTT GCGCGAAGCT CGCGCCGGTG CTGTACCCTG TCCGACAAGT AATCTTAAGC TCGCCAGTGG TGTCGCGCCT TTTCGCCGTA TGATCGAGAC CGGCGTGCGG GTTGGTCTCG GCACCGACGG GCCGGCCAGC AATGATGACC AGGATATGTT TACGGAAATC CAACTGGCTG CGCTGTTGCC AAAGGGATTG AGCGGTGACC CGACCGCAGT ACCGGCACGA GAGGCATTTG CGCTGGCGAC GTGCTGGGGC GCACGCGCAG TCCATCTCGA TCACCTGGTC GGTTCCCTCG AAGTAGGTAA ACGGGCCGAC ATTGCGGTGG TTGAGTTGCG GCGGTTGCAC AGTGCTCCTC GCTACACCTA CGCTCCTGAT GCAATCTACT CACATCTCGT CTACAGCAGC CGAGCCGCCG ATGTGCGTGA TGTGCTTGTT GATGGGAAGT TGTTGCTGCG CAATCGCCAT CTCTTGACTA TTGATGAAGA TGCGATCATC GGACGTGCGC AAGCGATTGC TGCACGAATT AACGAGTTTC TCGCTACGCG CGAACGCAAC CTGTTGGCGA AGATTTTGGC CCTTGGTGGT GTGCAGCAAG CCGAAATCTT CGAGATTCAA GTCAAGGCCC GACTACAACC CGCCGATGTC GAGCGCGTGC TGGCGCTGCT CAACCACCCG GCGATTACCA TTACGAAAAC GAGTGAGCGC ACCCAATACG ACACTTACTT TATTTTCGCC AACGGTGAAC GTATCCGCAT TCGCGAGGAT CACCGGACCG ATCCCGGTGC TCGTCCGCAA CCTAAGTACA CGATCACTTT GATGGCTGAA GCTCGGCGCT TTGATCAATC GAAGTCGATG ATGATCTCGC GCGCACGGTA CACCGCTCCC GCCGATCATA CGGTACGTTT CTACCGTGAA TATTTTCAAC CCGACCGGAT TGAAGAGTTA GAGAAGCGAC GGCGGCGTTG GCGAATTCTG TACAAAGACC ACGACTTTGC CATCAACCTC GATACACTGG TCGGGCACTC TGATCCCGGG CCTTTCCTCG AAATCAAGAG TCGCACATGG AGTCGGCGCG ATGCCGAACG ACGCGCCGAG CTGATCGGTG AATTACTCGA ACTGGCCGGC ATTGCCGATG AGGCGCTTGT GTTGCAGGAA TATGTGGAGA TGGCGGTATG A
|
Protein sequence | MERVDTLLIG GTVVTMDAAW RIFSDGAVAI RDGMIVAVGP SAEIAAHYTA TETIDCRDCA IIPGLINGHA HVPMSLLRGL VADQQLDVWL FGYMFPVESQ FVDAEFSYTG TLLSCAEMIR GGTTTFVDMY YFEEEVARAA DEVGMRAICG QTVMRLPTPD ADSFDAGIER ARRFMAEWRG HPRIIATVAP HAPYTCTDEI YRQAAALCAE FGAPLITHLS ETAREVEESI RDREVTPIRY AKRVGAFDVP CIAAHCVHAT EDDLRLLREA RAGAVPCPTS NLKLASGVAP FRRMIETGVR VGLGTDGPAS NDDQDMFTEI QLAALLPKGL SGDPTAVPAR EAFALATCWG ARAVHLDHLV GSLEVGKRAD IAVVELRRLH SAPRYTYAPD AIYSHLVYSS RAADVRDVLV DGKLLLRNRH LLTIDEDAII GRAQAIAARI NEFLATRERN LLAKILALGG VQQAEIFEIQ VKARLQPADV ERVLALLNHP AITITKTSER TQYDTYFIFA NGERIRIRED HRTDPGARPQ PKYTITLMAE ARRFDQSKSM MISRARYTAP ADHTVRFYRE YFQPDRIEEL EKRRRRWRIL YKDHDFAINL DTLVGHSDPG PFLEIKSRTW SRRDAERRAE LIGELLELAG IADEALVLQE YVEMAV
|
| |