Gene Tfu_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_1601 
Symbol 
ID3581088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp1844722 
End bp1847634 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content65% 
IMG OID637685295 
ProductCRISPR-associated helicase Cas3 family protein protein 
Protein accessionYP_289659 
Protein GI72162002 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0436721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTCG ATGACTCTGC GTCGCCGTGG CCGGGCAGAC TGTCTGCTGC AGCACGGTCC 
GTGTGGGCCA AACACGACCG TGACACGGGT GACTGGTTAC CGCTGTGGCG GCACATGGCA
GACAGTGCCG CGGTTGCAGG CCGGCTGTGG GACGAATGGC TACCCCGCCA GGTGCGGCGC
CTGATCGCCG CGGCCCTGCC CGGCGGGGAA GACGACGCCC GGCGGCTAGC GGTGTGGCTG
GCCGCAACCC ACGACATTGG GAAAGCCACT CCGGCATTCG CCTGCCAGGT CGAGGAGCTG
GCTGCCGCCA TGCGCCGCAC CGGCCTGGAC ATGCCGCTGC ACAAGCAGAT GCCGGACCGT
AAGGACGCCC CCCATAACCT GGCCGGGCAG ATCCTGCTCC AGCAGTGGCT CATCGAACGG
CATGGTTGGC ACAAGCGCAA AACACTTCCG TGGACGATCG TCGCGGGCGG CCACCACGGG
GTGCCACCCA CGCACGATGA GCTGAGGAAG CTCGCCCAGT CCCGCGAACT GCTGTGTACC
TCCTCCTATG AAGACATGTG GCGCGGTGTC CAGACCGAAC TGCTCGACAC TGCTGCCGCG
GCCTGCGGGG TGACTGAGCG GCTGGGGGAG TGGCAGCACG TCGACCTGCC TCAACCAGTG
CAGGTACTCC TCACCGCGCT GGTCATTGTT GCCGACTGGA TAGCCAGCAA CGCCGACCTT
TTCCCCTACT TTCCCGACGG GGTCGCCACC GACCCTGAAC GCATCGACAC AGCCTGGAAG
CAACTGGACC TGCCACACCC TTGGCAGGCA GTGGAACCGG AAGAGGATCC CGCAACCCTG
TTCGCCACCC GCTTCGACTT GCCGAAAGGG GCACGGATCC GTCCCGTTCA GGAAGAAGCC
GTGCGACTGG CCCGCGCCCT CCCCGCCCCG GGAATGATGA TCATCGAAGC GCCGATGGGG
GAGGGGAAAA CGGAGGCGGC GCTCGCTGTC ACCGAGATCT TCGCTGCCCG CTCCGGAGCC
GGCGGATGCT TCATCGCCCT GCCAACCATG GCCACCGGGA ATGCCATGTT CCCGCGGATG
CTCCACTGGT TGAAGCGTCT CCCCAACAAA GCCGGTACGC ACTCGGTGTT CCTGGCACAC
TCCAAGGCTG CTCTCAACGA GGAGTACACC ACGCTGGCCC GCCAGGACAA TGGACGGATC
ACCGACGTCG ACCGCGACGG CACAGAAGGG GAATGGCAGC CCCGCAGCGA TGAACGAGTG
GCCCCCGCTC AACTCGTCGC TCACCACTGG CTGCGCGGCC GGAAGAAAGG AATGCTGTCC
TCGTTTGTGA CAGGCACCAT CGACCAGCTT CTCTTCACCG GACTGAAGAG CCGCCACCTG
GCGTTACGCC ACCTCGCGAT GGCCGGAAAA GTGGTTGTCG TCGACGAAGC CCACGCCTAC
GACACCTACA TGAACTCCTA CCTCGACCGG GTACTGTCCT GGCTGGGTGC CTACGGCGTG
CCAGTAGTGG TACTTTCCGC GACCCTGCCG GCTCGTCGCC GCCGCGAACT CCTCGAAGCC
TATGCCGGGG CCGCAAGTAT CGACAGCGGT TTCGCCGAGG TGGAGAAAGC AGACGGCTAC
CCACTGCTCA CCGCAGTCGC CCCGGGAACA GCCCCGCGTA TCGTGGCGCC AGCCGCCTCA
GGCCGCGGCA CCACCGTGTG GCTGGAACGC ATAGACGACG ACCTGGACAC ACTCGCGGAC
CGGTTGGAAG AAGAGCTCGC CGAGGGCGGC TGTGCCCTAG TCGTCCGCAA CACGGTCGAC
CGAGTCCACG AGGCCGCCGC CCACCTGCGC CAGCGTTTCG GGAACGACCA GGTGAGCGTC
GCCCACGCGC GTTTCGTGGA TGCGGACCGG GTCGAGAACG ACACGCGGCT ACTGGAAACC
TTCGGCTCCC CCGACAAAGT CGCGCAAGCG GGAGGCAGCC GCCCCACAAA ACACATTGTG
GTCGCCAGCC AGGTAGTGGA ACAGTCTTTA GACGTTGACT TCGACCTGCT GGTCAGCGAT
CTCGCACCAA TCGACCTGCT GTTGCAGCGG ATGGGCCGCC TGCACCGCCA CCAGCGCGGG
AAAGAGCAGT GCGAGCGTCC CCCGCGGCTG CGCGCCGCGC GCTGCCTGAT CACCGGGGTG
GACTGGACAA CCAGGGTCCC CCAACCAGTG AAAGGCTCTC GGACCATCTA CCGGGACTAT
CCACTGCTGC GCTCCCTGGC GGTCCTGGAA CCCTACCTCG CCGAGACTGC TGGGCAAGGA
CAAGCGATCC GCCTGCCGGA CGCTATCCAC CCGCTGGTGC AGGCAGCCTA CGGCTCCGAA
ACGGTCGGCC CCGCCGAATG GCAGGAAACC CTGGAAAAAG CCCGAATCGC CGACGACGCC
CACCAAGCCG ACCAGCGCGG CCGTGCCAGG GACTTCCTCC TCGGACCAGT GAAGAAACCC
GGACAACCCC TGACCGGATG GGTAAACGCC GGGTTGGGAG ACGCCGACGA CACTCGCCGC
GGTGCCGCCC AAGTCCGTGA CAGTCAGGAA ACCCTGGAAG TCCTCGTCGT GCAGCGGCGC
GCCGACGGTA CGCTCACCAC CCTGCCGTGG CTAGGCAAGA ACCGCGGCGG GTTGAAACTG
CCCACCGATG CTGCTCCTCC TGTCCGGCTT GCCCGCATCG TCGCTTCTTC AGCGCTGCGC
TTGCCCTACC AATTCAGTTC CACAGCCAAA GTTCTTGACC AAGTCATCGC CGAGCTTGAA
GCCAACTGCA TCCCGGCGTG GCAAACGAAA GAAGCGTATT GGCTGGCAGG CGAACTGATC
CTGTTTCTCG ACGAGAACTG TCGTAGCCAC CTGGCGGGCT ACGAACTCCG CTACACCCGT
GCCAACGGCC TTGAGGTCCA CCGTGACAAC TGA
 
Protein sequence
MAFDDSASPW PGRLSAAARS VWAKHDRDTG DWLPLWRHMA DSAAVAGRLW DEWLPRQVRR 
LIAAALPGGE DDARRLAVWL AATHDIGKAT PAFACQVEEL AAAMRRTGLD MPLHKQMPDR
KDAPHNLAGQ ILLQQWLIER HGWHKRKTLP WTIVAGGHHG VPPTHDELRK LAQSRELLCT
SSYEDMWRGV QTELLDTAAA ACGVTERLGE WQHVDLPQPV QVLLTALVIV ADWIASNADL
FPYFPDGVAT DPERIDTAWK QLDLPHPWQA VEPEEDPATL FATRFDLPKG ARIRPVQEEA
VRLARALPAP GMMIIEAPMG EGKTEAALAV TEIFAARSGA GGCFIALPTM ATGNAMFPRM
LHWLKRLPNK AGTHSVFLAH SKAALNEEYT TLARQDNGRI TDVDRDGTEG EWQPRSDERV
APAQLVAHHW LRGRKKGMLS SFVTGTIDQL LFTGLKSRHL ALRHLAMAGK VVVVDEAHAY
DTYMNSYLDR VLSWLGAYGV PVVVLSATLP ARRRRELLEA YAGAASIDSG FAEVEKADGY
PLLTAVAPGT APRIVAPAAS GRGTTVWLER IDDDLDTLAD RLEEELAEGG CALVVRNTVD
RVHEAAAHLR QRFGNDQVSV AHARFVDADR VENDTRLLET FGSPDKVAQA GGSRPTKHIV
VASQVVEQSL DVDFDLLVSD LAPIDLLLQR MGRLHRHQRG KEQCERPPRL RAARCLITGV
DWTTRVPQPV KGSRTIYRDY PLLRSLAVLE PYLAETAGQG QAIRLPDAIH PLVQAAYGSE
TVGPAEWQET LEKARIADDA HQADQRGRAR DFLLGPVKKP GQPLTGWVNA GLGDADDTRR
GAAQVRDSQE TLEVLVVQRR ADGTLTTLPW LGKNRGGLKL PTDAAPPVRL ARIVASSALR
LPYQFSSTAK VLDQVIAELE ANCIPAWQTK EAYWLAGELI LFLDENCRSH LAGYELRYTR
ANGLEVHRDN