Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_1578 |
Symbol | |
ID | 3581942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | + |
Start bp | 1819567 |
End bp | 1821351 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637685273 |
Product | CRISPR-associated Cmr2 family protein |
Protein accession | YP_289637 |
Protein GI | 72161980 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02577] CRISPR-associated protein, Crm2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACGGCC AAGACCTCGT GGTGATCGCA CTGTCCGGAG TCCAGCGATT CATCGAAGAA TCCAGAAGCA CCTCGGACCT GCGGGCAGGC AGCCGAATCA TCGCTGAGCT CACCGAACAA GCCGTGTCCG TGTGCCAGGA GTCCGCTGCC GAACTCGTCT TCCCCGCCCC GAGCAAGAAA AGGAAAGACG TGGGAATGCC TAACCGGGTG GTGGCACTCG CCCCCGCAGG GCAGGGGACG ACACTTGCTC AGCGAGCCAC CGCACGGGTG AATGAGAAGT GGAAGGAGTG GTTGCAAAAG ACTCTTGGCC GTATCGAAGA AACCCCCGGT ATGCCCGCCG TGACGTGGGC CGTAGCTCCC CCACGGCCAG GCGGCTACCC CGAACAGTGG AAAGAAGCCC AACGTCGCCT AGCCGCACGC AAACAGGTCC GGAACTTCAC CGCCCTGGAG GCACGCAACG GCCCCTGCCA GCTGAGTCCT CACTGGCCCG CGACCACGAA GCCACCGAAA GGTCTTCCCT CTTTTGAGCG TGACACGCTG AGCGCTGCCA ACTGGCTGAA ACGCAAGCTT GGCCGCCCCA ATGATAAGGA GAGCGTGTCA CCTTACGGGA TCCCCTCGAC CTACGGGATC GCGTCGGCCC CATTCCGCAT CGAAGTTCTG AAAAATATCA ACGATCCGGA TGTAACTGAA CAGGTGGACT ACTTGCATCA AGTAGTCCAG GACGAAGATA CGGGCATACA TATCCCCAAC TGGCCACTGC CAGGCATGCC GAAACCTCAG GGAGAATTAG CCCAGTGGCT GTGGCGGCAC GGAGGACAGT GGGTTTACAC AGACTCATGG TCCGCAGAGT CGCTGGCCCG CGAATTCAGG AAGCCGACTT CTGGAGAAGC ATTCGACAAC TTCTCCAGAA CAGTAGAGAT AGGCCGTATT GCTGCCGCTG ATCTTCAAAA GATCATGCGG GACAAGTTCC AGGTAGCGCC TCCCTCCTCC TACCTGGCGA TTCTTGTCCA AGACCTGGAC GGCATGGGCT CCTACCTCTC CGAGGAAAAG AACCTGTCGC ACAAGCGGCA CACCGAAATT TCTGACCAGT TGCGGAGAGT CGCTGAAAGG CAGACAGAAC TGCTGCACGA CTCGGCGCTG CTGGGCGTTG CAGTCTATGC CGGCGGAGAC GACCTGCTGG CGTTCCTTCC CGCAGCGACA GCCCTTGCGG GGGCACGCGC CTGCAGACAG GCAGTCACCG AAGTATCCCC GGAACTGCCC ACGGCGAGCA GTGCCTTACT GTTCTTCCAC CGCAAATACC CGTTGCGGCT CGCCTTGGCC GCGAGCCGGG AAGCCCTGGC TGCCGCCAAA AACGTTCCTG GAAAGAACGC TCTCGCCGTG AGCTACCTGC GCCGCTCCGG AACCCAGGAA ACATACGTCC AGAAATGGGC TCTTCCTGAC CACAACGATC CCGAGGCCTT GGTGTGTGAC CGCCTGTCCC TCTTCACCAG GAGTGAGACG GGAGCACGGC TATCCCCCGG GCTACTGCGG GACCTGGAAC GCGACAGTCA CGCCCTATGC ACGCTCAGTC TGGACCTCTT TACCGCTGAG CTCCAGCGCC TCGTCCACCG GCATACTCTC GCGCCCACCA GCGAGCAACA AAAAGCATTT GCCCTCAAGG CTGCAGAGCA GCTCCGCCTT CTGGGACGGC CCACCAGCGC CCTAGAAGAC TCAGAAAAGA TTTCCGAACA ATCACTGATC GCGGCGGCTC GAGTCGCAGT GTTCTTGCGT CAGGAGTGCC GATGA
|
Protein sequence | MDGQDLVVIA LSGVQRFIEE SRSTSDLRAG SRIIAELTEQ AVSVCQESAA ELVFPAPSKK RKDVGMPNRV VALAPAGQGT TLAQRATARV NEKWKEWLQK TLGRIEETPG MPAVTWAVAP PRPGGYPEQW KEAQRRLAAR KQVRNFTALE ARNGPCQLSP HWPATTKPPK GLPSFERDTL SAANWLKRKL GRPNDKESVS PYGIPSTYGI ASAPFRIEVL KNINDPDVTE QVDYLHQVVQ DEDTGIHIPN WPLPGMPKPQ GELAQWLWRH GGQWVYTDSW SAESLAREFR KPTSGEAFDN FSRTVEIGRI AAADLQKIMR DKFQVAPPSS YLAILVQDLD GMGSYLSEEK NLSHKRHTEI SDQLRRVAER QTELLHDSAL LGVAVYAGGD DLLAFLPAAT ALAGARACRQ AVTEVSPELP TASSALLFFH RKYPLRLALA ASREALAAAK NVPGKNALAV SYLRRSGTQE TYVQKWALPD HNDPEALVCD RLSLFTRSET GARLSPGLLR DLERDSHALC TLSLDLFTAE LQRLVHRHTL APTSEQQKAF ALKAAEQLRL LGRPTSALED SEKISEQSLI AAARVAVFLR QECR
|
| |