Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_1601 |
Symbol | |
ID | 3581088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | - |
Start bp | 1844722 |
End bp | 1847634 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637685295 |
Product | CRISPR-associated helicase Cas3 family protein protein |
Protein accession | YP_289659 |
Protein GI | 72162002 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0436721 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTCG ATGACTCTGC GTCGCCGTGG CCGGGCAGAC TGTCTGCTGC AGCACGGTCC GTGTGGGCCA AACACGACCG TGACACGGGT GACTGGTTAC CGCTGTGGCG GCACATGGCA GACAGTGCCG CGGTTGCAGG CCGGCTGTGG GACGAATGGC TACCCCGCCA GGTGCGGCGC CTGATCGCCG CGGCCCTGCC CGGCGGGGAA GACGACGCCC GGCGGCTAGC GGTGTGGCTG GCCGCAACCC ACGACATTGG GAAAGCCACT CCGGCATTCG CCTGCCAGGT CGAGGAGCTG GCTGCCGCCA TGCGCCGCAC CGGCCTGGAC ATGCCGCTGC ACAAGCAGAT GCCGGACCGT AAGGACGCCC CCCATAACCT GGCCGGGCAG ATCCTGCTCC AGCAGTGGCT CATCGAACGG CATGGTTGGC ACAAGCGCAA AACACTTCCG TGGACGATCG TCGCGGGCGG CCACCACGGG GTGCCACCCA CGCACGATGA GCTGAGGAAG CTCGCCCAGT CCCGCGAACT GCTGTGTACC TCCTCCTATG AAGACATGTG GCGCGGTGTC CAGACCGAAC TGCTCGACAC TGCTGCCGCG GCCTGCGGGG TGACTGAGCG GCTGGGGGAG TGGCAGCACG TCGACCTGCC TCAACCAGTG CAGGTACTCC TCACCGCGCT GGTCATTGTT GCCGACTGGA TAGCCAGCAA CGCCGACCTT TTCCCCTACT TTCCCGACGG GGTCGCCACC GACCCTGAAC GCATCGACAC AGCCTGGAAG CAACTGGACC TGCCACACCC TTGGCAGGCA GTGGAACCGG AAGAGGATCC CGCAACCCTG TTCGCCACCC GCTTCGACTT GCCGAAAGGG GCACGGATCC GTCCCGTTCA GGAAGAAGCC GTGCGACTGG CCCGCGCCCT CCCCGCCCCG GGAATGATGA TCATCGAAGC GCCGATGGGG GAGGGGAAAA CGGAGGCGGC GCTCGCTGTC ACCGAGATCT TCGCTGCCCG CTCCGGAGCC GGCGGATGCT TCATCGCCCT GCCAACCATG GCCACCGGGA ATGCCATGTT CCCGCGGATG CTCCACTGGT TGAAGCGTCT CCCCAACAAA GCCGGTACGC ACTCGGTGTT CCTGGCACAC TCCAAGGCTG CTCTCAACGA GGAGTACACC ACGCTGGCCC GCCAGGACAA TGGACGGATC ACCGACGTCG ACCGCGACGG CACAGAAGGG GAATGGCAGC CCCGCAGCGA TGAACGAGTG GCCCCCGCTC AACTCGTCGC TCACCACTGG CTGCGCGGCC GGAAGAAAGG AATGCTGTCC TCGTTTGTGA CAGGCACCAT CGACCAGCTT CTCTTCACCG GACTGAAGAG CCGCCACCTG GCGTTACGCC ACCTCGCGAT GGCCGGAAAA GTGGTTGTCG TCGACGAAGC CCACGCCTAC GACACCTACA TGAACTCCTA CCTCGACCGG GTACTGTCCT GGCTGGGTGC CTACGGCGTG CCAGTAGTGG TACTTTCCGC GACCCTGCCG GCTCGTCGCC GCCGCGAACT CCTCGAAGCC TATGCCGGGG CCGCAAGTAT CGACAGCGGT TTCGCCGAGG TGGAGAAAGC AGACGGCTAC CCACTGCTCA CCGCAGTCGC CCCGGGAACA GCCCCGCGTA TCGTGGCGCC AGCCGCCTCA GGCCGCGGCA CCACCGTGTG GCTGGAACGC ATAGACGACG ACCTGGACAC ACTCGCGGAC CGGTTGGAAG AAGAGCTCGC CGAGGGCGGC TGTGCCCTAG TCGTCCGCAA CACGGTCGAC CGAGTCCACG AGGCCGCCGC CCACCTGCGC CAGCGTTTCG GGAACGACCA GGTGAGCGTC GCCCACGCGC GTTTCGTGGA TGCGGACCGG GTCGAGAACG ACACGCGGCT ACTGGAAACC TTCGGCTCCC CCGACAAAGT CGCGCAAGCG GGAGGCAGCC GCCCCACAAA ACACATTGTG GTCGCCAGCC AGGTAGTGGA ACAGTCTTTA GACGTTGACT TCGACCTGCT GGTCAGCGAT CTCGCACCAA TCGACCTGCT GTTGCAGCGG ATGGGCCGCC TGCACCGCCA CCAGCGCGGG AAAGAGCAGT GCGAGCGTCC CCCGCGGCTG CGCGCCGCGC GCTGCCTGAT CACCGGGGTG GACTGGACAA CCAGGGTCCC CCAACCAGTG AAAGGCTCTC GGACCATCTA CCGGGACTAT CCACTGCTGC GCTCCCTGGC GGTCCTGGAA CCCTACCTCG CCGAGACTGC TGGGCAAGGA CAAGCGATCC GCCTGCCGGA CGCTATCCAC CCGCTGGTGC AGGCAGCCTA CGGCTCCGAA ACGGTCGGCC CCGCCGAATG GCAGGAAACC CTGGAAAAAG CCCGAATCGC CGACGACGCC CACCAAGCCG ACCAGCGCGG CCGTGCCAGG GACTTCCTCC TCGGACCAGT GAAGAAACCC GGACAACCCC TGACCGGATG GGTAAACGCC GGGTTGGGAG ACGCCGACGA CACTCGCCGC GGTGCCGCCC AAGTCCGTGA CAGTCAGGAA ACCCTGGAAG TCCTCGTCGT GCAGCGGCGC GCCGACGGTA CGCTCACCAC CCTGCCGTGG CTAGGCAAGA ACCGCGGCGG GTTGAAACTG CCCACCGATG CTGCTCCTCC TGTCCGGCTT GCCCGCATCG TCGCTTCTTC AGCGCTGCGC TTGCCCTACC AATTCAGTTC CACAGCCAAA GTTCTTGACC AAGTCATCGC CGAGCTTGAA GCCAACTGCA TCCCGGCGTG GCAAACGAAA GAAGCGTATT GGCTGGCAGG CGAACTGATC CTGTTTCTCG ACGAGAACTG TCGTAGCCAC CTGGCGGGCT ACGAACTCCG CTACACCCGT GCCAACGGCC TTGAGGTCCA CCGTGACAAC TGA
|
Protein sequence | MAFDDSASPW PGRLSAAARS VWAKHDRDTG DWLPLWRHMA DSAAVAGRLW DEWLPRQVRR LIAAALPGGE DDARRLAVWL AATHDIGKAT PAFACQVEEL AAAMRRTGLD MPLHKQMPDR KDAPHNLAGQ ILLQQWLIER HGWHKRKTLP WTIVAGGHHG VPPTHDELRK LAQSRELLCT SSYEDMWRGV QTELLDTAAA ACGVTERLGE WQHVDLPQPV QVLLTALVIV ADWIASNADL FPYFPDGVAT DPERIDTAWK QLDLPHPWQA VEPEEDPATL FATRFDLPKG ARIRPVQEEA VRLARALPAP GMMIIEAPMG EGKTEAALAV TEIFAARSGA GGCFIALPTM ATGNAMFPRM LHWLKRLPNK AGTHSVFLAH SKAALNEEYT TLARQDNGRI TDVDRDGTEG EWQPRSDERV APAQLVAHHW LRGRKKGMLS SFVTGTIDQL LFTGLKSRHL ALRHLAMAGK VVVVDEAHAY DTYMNSYLDR VLSWLGAYGV PVVVLSATLP ARRRRELLEA YAGAASIDSG FAEVEKADGY PLLTAVAPGT APRIVAPAAS GRGTTVWLER IDDDLDTLAD RLEEELAEGG CALVVRNTVD RVHEAAAHLR QRFGNDQVSV AHARFVDADR VENDTRLLET FGSPDKVAQA GGSRPTKHIV VASQVVEQSL DVDFDLLVSD LAPIDLLLQR MGRLHRHQRG KEQCERPPRL RAARCLITGV DWTTRVPQPV KGSRTIYRDY PLLRSLAVLE PYLAETAGQG QAIRLPDAIH PLVQAAYGSE TVGPAEWQET LEKARIADDA HQADQRGRAR DFLLGPVKKP GQPLTGWVNA GLGDADDTRR GAAQVRDSQE TLEVLVVQRR ADGTLTTLPW LGKNRGGLKL PTDAAPPVRL ARIVASSALR LPYQFSSTAK VLDQVIAELE ANCIPAWQTK EAYWLAGELI LFLDENCRSH LAGYELRYTR ANGLEVHRDN
|
| |