Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0872 |
Symbol | |
ID | 7268323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 1081758 |
End bp | 1083455 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643565718 |
Product | peptidase M9A collagenase domain-containing protein |
Protein accession | YP_002462227 |
Protein GI | 219847794 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0507103 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTACC TGATCGTCGG GTTCGTTGTT CTGATCCTGC TTATTCCGTC CGGTTCGCGG CTTCCACTGG CGGTTACCCC GGCTTATCCA CCACCGATCC GCCTGCCGGC GTGCCCACAG CCCACTGTGG CCGATGTCGA CGCAATGCTC GCGTTGTTGC CACAGGCTGG CTATGATTGC ACTGAGCAGA TCGCGGTAGC ACTTCGCCCC CGCATCGAGC CGGTATACGT GCAGGAGTTA CTCGCGATTG CTGTTGAGCC GACCTTCGAT ACCCGTACCC GCCGCAATGC ACTACGCATT TTGGGCCGCT TGGCCGAAAG TGGGCCGGTT ACACGTGCTC GTGAGTTGAT GGTGCAGCAG CAGTCGATGG TGCAGATGAC GGCGCTCACC CTCCTTGAAC GCGAACGCGA CAATTTTCTG TTGCAAGATG CGGTCTGGCT GCTCGACAGC CACTACTATC CAAGCTGGGT AGCAGCACCG GCACTCGAAC AGATTGCTCT CGGTGGCGAG TATGCGCCGG CGTTGCGTTA CCGCGCCGCC CGTGCTCGTG CTCGGCTCAT CGCTGCTGAA TACGGACCAT TGCGTGATGA TTCGCAGCGT TTCATCGTGG CGGCACTGCG CAGCGCCGAT CCCGGTGTAC GCACCGCTGC CGCTGAAGCG ATTAGTTTCT TGCGCGACGA CCAATTGACA GCACGGACAG ATTGGTTACA GCTTGTGGAG GAAGCGCTGG TCCACGAGCC GCCGTTGCAC GTAGCTACCG ATAGCGGCGA TCCACGTGGA GCCGCACTGT TAACCTTCCT TGAGAGTACG CCAACGACGC TGACGGCACG GGCTGCATTA GCACGCGCTG CCGACCGATT GGCCGGCGAA ACTGCGACGG TGCCACGGCT CAATGCCTTG CGCATAGCAT ACGAACACCT TGCCCTACCG TTGCAACACG AAAGTGCAAC AGTCGTGCTG CGCACCGGTC CTGCCGAGAC GAGCGATGGC GATGAACTGT TAGCGATCGT AGCTACAACG TATGCACAGG CTCGTCGCTT CCTGGGTTCG GTAGGAGAAA CGCCAATCCC CGGCGAAGAG CATCTCCCAT TACAGGTGCT AATCTTCCCC GGCCAGGCTG CATATCGTGA CTATATGCGC GCCTTTACCC CCTTCACCGT TGATGTCGAT GGCATTTACG ACGTGCAGCA GAACACGCTC TACAGCTACC GGCGTCGTGA TGATCAGACT GCGAACACGC TCGCCGAGAC GCTCCGCCAC GAAGTAGCCC ATGCAGTCAC AGCGGCCTAT CTCTTTCCCG GCCAGTGGCA CACGCCCGGC TACCACGCCG AACCGAAGGG CTGGTTTGAT GAAGGATTTG CTGAAGTATT GGCCGCACAA ACCAAACCCG ACGCGCCACT TCAGCCGCAT CCGCGTCATT TGGCGACTAT CTGTGCACAA CCGCTCAAGC CGGCTCTCGC CGATCTGGTG GCGTTACGTA CCGGGTATGA CCAGTATGGG ACGTTCGATT ACCCGGCAGC TTGGGCATTG ATGCATTTCT TGCTCGCCGA ACGACCCGCA GCAGCGGCAG CCTTGATCTC GGCATGGCGC AACCAGACGT ATCATCTAGC TAACTGGCCA ACGTTGGGTG GTTGGTCAGA TTGGACCAGC GCCGAATCCG ATTGGCACTT TGCGATTGAG CGTTGGTGTA GGCTATAA
|
Protein sequence | MRYLIVGFVV LILLIPSGSR LPLAVTPAYP PPIRLPACPQ PTVADVDAML ALLPQAGYDC TEQIAVALRP RIEPVYVQEL LAIAVEPTFD TRTRRNALRI LGRLAESGPV TRARELMVQQ QSMVQMTALT LLERERDNFL LQDAVWLLDS HYYPSWVAAP ALEQIALGGE YAPALRYRAA RARARLIAAE YGPLRDDSQR FIVAALRSAD PGVRTAAAEA ISFLRDDQLT ARTDWLQLVE EALVHEPPLH VATDSGDPRG AALLTFLEST PTTLTARAAL ARAADRLAGE TATVPRLNAL RIAYEHLALP LQHESATVVL RTGPAETSDG DELLAIVATT YAQARRFLGS VGETPIPGEE HLPLQVLIFP GQAAYRDYMR AFTPFTVDVD GIYDVQQNTL YSYRRRDDQT ANTLAETLRH EVAHAVTAAY LFPGQWHTPG YHAEPKGWFD EGFAEVLAAQ TKPDAPLQPH PRHLATICAQ PLKPALADLV ALRTGYDQYG TFDYPAAWAL MHFLLAERPA AAAALISAWR NQTYHLANWP TLGGWSDWTS AESDWHFAIE RWCRL
|
| |