Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3443 |
Symbol | |
ID | 7269668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4183322 |
End bp | 4185481 |
Gene Length | 2160 bp |
Protein Length | 719 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643568253 |
Product | CRISPR-associated protein, Csm1 family |
Protein accession | YP_002464721 |
Protein GI | 219850288 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02578] CRISPR-associated protein, Csm1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.791619 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACTT CCCACCCAAC CGATCCGGTT GCCACGGCCA CGCACGTGTT GCAGGCGTGG GTGGCGGCTG CCGTGGGCGC AACCATAACC GCCTCACCCG ATCTGGTAGC AACTGCGTGG CAGATCGCCG GCCGGTCGCC ACAAGCGCAG TGGCCGCCGT TGCCGGATCA CCTGGCGACC GTCTTCACCC GATTGGTCGG TCAAGCACCG CCGTCGGCCT GGCAACCACC GACGGTCTTG CAGTGTGACC CTGACGCGCT TTTTCCGAGG GCGCAACCGG CTACGCCTAC GGATGAGTTG CGCAATGGCT TACGTACTGG GCTCGCTACA GCAGCACAGG CCACCGATCC GGCCAACCGC CTTGAACTGC TTCTGAACGC GCTGCAACGC TACGGATCGG CCTGGCCATC ACCACTAGCG GCAGTGTCAC TCTACGATCT TGCGCGGTTG CATGCAGCCG TCGCGGCGGC GTTGACCGGT GATGCACAAC AGCAGATCTG TCTGCTCGGC GGTGATCTGT CTGGCTTGCA AGAGTTTCTC TACAGTATTC CGGCGGAAGG CGCGGCACGT CAGTTGCGTG GACGTTCCCT GTACTTGCAG TTGCTTACCG ATGCATGTGC GCAATGGGTC TTGCGGAAGA GCGGGATGCC GCTCTGTAAT CTGCTGTATG CCGGTGGTGG TCGGTTCTAC GCTGTGCTGC CGGGACGTTT CGCCAATGAG GTGTCGGCTT GGCGGCGTGA ATTGGGGCAG GTGCTGCTCG AATACCATCG TGGTGCGCTC TATGTGGCAC TCGGTGCAAC CGCGCCGTTT GCCCCCGATC AGTACAACGA GCAAACATGG CACGAGTTGA CGCGCGCGAT TGACACCGAC AAGCGACGAC GGTTCGCAGC GTTGGACGAT GAATCGTTTG CCCGTATCTT CCAACCGCGC CCACCACGTC CGCCGCGCAG TGATGGTAAG GAAGCTCCCG ATCCGCTTGG CGAGTCGTTA GCCGATTTGG GGCGGCAACT GACCCGTGCT GCTGTGCTGT TGGTCGATTC CCAGGCGTCG GCGGTGTCCG GTACGACGTG GCGCAGTGTG GTGAGCAAAT TGGGAGTGAA TTACGAACTG ACAGAACAAG TTCGCTTGCC GGTCGCCAAG CGGCGTGCAT TGGCATTGAC CGACGAGGTG GTGCCGTCCG AACCGGCGCA AGCGATTTTG ATCGGTCAGC GTTATCTGGT GGCCGAAGCC TACCGTTTGC GCGACACCGA TCTGTCGCGT TACCGTGCGC TTGATCCGAG TCAAGCCGAA GAACTACGTC CCGGCGATGT TGCTCCGTTC AACTTGTTGG CCGACCAGAG CGAAGGGATT AGCCGGATTG CCGTCTTGCG GATGGACGTA GATAACCTCG GTGATTTGTT TGGGCGCGGG TTACAGCGCC CCGCCGGTTT GGCCGGTTTA GCGGTGACTG CGGCGCTCAG TAGTGCGCTG AGTCGCTTTT TTGAAGGTTG GGTTGGCGAA CTCTGTCGCC AGGTTAACGA ACGCAGCCAA GGTCAGGGTG GAATCTATGC AGTCTACAGC GGTGGCGACG ATCTGTTTCT GGTCGGGTCG TGGCATCTGT TGCCCGAACT GGCCCGCACC ATTCGCGCCG ACTTTATCCG CTACGCCGGC GGTGCGGTGA CGGTCTCGGC CGGAATTACC TTGCATCAGG CCGGCTACCC GCTCTATCAG GCGGCAGAAG ATGCCGGCAA CGCATTGGAC GCGGCCAAAT CCTACGAACA TCCTGATGGG CGGAGCAAGG ATGCGATTAC GTTTTTGGGA CAGACGTTGA GCTGGGTGGA GTTTGAACAG GCTGCCACCC TGAAAGATGA GCTGATCGAT TTGATTGCAA ATGGTGCGCC GCGCAGCTTG CTCATGACGA TCCAAGCGTT GGCCGTCCGT GCCCGCCAGC GGTTTAATCG GAGCGGTCAG GTGCAGCTTC TCGTCGGGCC GTGGGTGTGG CAGGGTGCGT ACCAACTGAC CCGCCTCGCC GAACGGTCGG GTGATCTGCG TCGGCGGATT GAAGCCCTGC GCGAGCAGCT ACTCAGCAGC GAAGGGATTG CGTCCCGCAC CATTATTCCC GCCGGGTTAG CGGCGCGGTG GGCACAGTTG CTACTCCGCG GGCGTGATAC GCGGCGGTGA
|
Protein sequence | MTTSHPTDPV ATATHVLQAW VAAAVGATIT ASPDLVATAW QIAGRSPQAQ WPPLPDHLAT VFTRLVGQAP PSAWQPPTVL QCDPDALFPR AQPATPTDEL RNGLRTGLAT AAQATDPANR LELLLNALQR YGSAWPSPLA AVSLYDLARL HAAVAAALTG DAQQQICLLG GDLSGLQEFL YSIPAEGAAR QLRGRSLYLQ LLTDACAQWV LRKSGMPLCN LLYAGGGRFY AVLPGRFANE VSAWRRELGQ VLLEYHRGAL YVALGATAPF APDQYNEQTW HELTRAIDTD KRRRFAALDD ESFARIFQPR PPRPPRSDGK EAPDPLGESL ADLGRQLTRA AVLLVDSQAS AVSGTTWRSV VSKLGVNYEL TEQVRLPVAK RRALALTDEV VPSEPAQAIL IGQRYLVAEA YRLRDTDLSR YRALDPSQAE ELRPGDVAPF NLLADQSEGI SRIAVLRMDV DNLGDLFGRG LQRPAGLAGL AVTAALSSAL SRFFEGWVGE LCRQVNERSQ GQGGIYAVYS GGDDLFLVGS WHLLPELART IRADFIRYAG GAVTVSAGIT LHQAGYPLYQ AAEDAGNALD AAKSYEHPDG RSKDAITFLG QTLSWVEFEQ AATLKDELID LIANGAPRSL LMTIQALAVR ARQRFNRSGQ VQLLVGPWVW QGAYQLTRLA ERSGDLRRRI EALREQLLSS EGIASRTIIP AGLAARWAQL LLRGRDTRR
|
| |