Gene Tfu_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_1074 
Symbol 
ID3580066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp1254785 
End bp1256110 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content69% 
IMG OID637684769 
Productendoglucanase 
Protein accessionYP_289135 
Protein GI72161478 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0857507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCCA GACCTCTTCG CGCTCTTCTG GGCGCCGCGG CGGCGGCCTT GGTCAGCGCG 
GCTGCTCTGG CCTTCCCGTC GCAAGCGGCG GCCAATGATT CTCCGTTCTA CGTCAACCCC
AACATGTCCT CCGCCGAATG GGTGCGGAAC AACCCCAACG ACCCGCGTAC CCCGGTAATC
CGCGACCGGA TCGCCAGCGT GCCGCAGGGC ACCTGGTTCG CCCACCACAA CCCCGGGCAG
ATCACCGGCC AGGTCGACGC GCTCATGAGC GCCGCCCAGG CCGCCGGCAA GATCCCGATC
CTGGTCGTGT ACAACGCCCC GGGCCGCGAC TGCGGCAACC ACAGCAGCGG CGGCGCCCCC
AGTCACAGCG CCTACCGGTC CTGGATCGAC GAATTCGCTG CCGGACTGAA GAACCGTCCC
GCCTACATCA TCGTCGAACC GGACCTGATC TCGCTGATGT CGAGCTGCAT GCAGCACGTC
CAGCAGGAAG TCCTGGAGAC GATGGCGTAC GCGGGCAAGG CCCTCAAGGC CGGGTCCTCG
CAGGCGCGGA TCTACTTCGA CGCCGGCCAC TCCGCGTGGC ACTCGCCCGC ACAGATGGCT
TCCTGGCTCC AGCAGGCCGA CATCTCCAAC AGCGCGCACG GTATCGCCAC CAACACCTCC
AACTACCGGT GGACCGCTGA CGAGGTCGCC TACGCCAAGG CGGTGCTCTC GGCCATCGGC
AACCCGTCCC TGCGCGCGGT CATCGACACC AGCCGCAACG GCAACGGCCC CGCCGGTAAC
GAGTGGTGCG ACCCCAGCGG ACGCGCCATC GGCACGCCCA GCACCACCAA CACCGGCGAC
CCGATGATCG ACGCCTTCCT GTGGATCAAG CTGCCGGGTG AGGCCGACGG CTGCATCGCC
GGCGCCGGCC AGTTCGTCCC GCAGGCGGCC TACGAGATGG CGATCGCCGC GGGCGGCACC
AACCCCAACC CGAACCCCAA CCCGACGCCC ACCCCCACTC CGACCCCCAC GCCGCCTCCC
GGCTCCTCGG GGGCGTGCAC GGCGACGTAC ACGATCGCCA ACGAGTGGAA CGACGGCTTC
CAGGCGACCG TGACGGTCAC CGCGAACCAG AACATCACCG GCTGGACCGT GACGTGGACC
TTCACCGACG GCCAGACCAT CACCAACGCC TGGAACGCCG ACGTGTCCAC CAGCGGCTCC
TCGGTGACCG CGCGGAACGT CGGCCACAAC GGAACGCTCT CCCAGGGAGC CTCCACAGAG
TTCGGCTTCG TCGGCTCTAA GGGCAACTCC AACTCTGTTC CGACCCTTAC CTGCGCCGCC
AGCTGA
 
Protein sequence
MSPRPLRALL GAAAAALVSA AALAFPSQAA ANDSPFYVNP NMSSAEWVRN NPNDPRTPVI 
RDRIASVPQG TWFAHHNPGQ ITGQVDALMS AAQAAGKIPI LVVYNAPGRD CGNHSSGGAP
SHSAYRSWID EFAAGLKNRP AYIIVEPDLI SLMSSCMQHV QQEVLETMAY AGKALKAGSS
QARIYFDAGH SAWHSPAQMA SWLQQADISN SAHGIATNTS NYRWTADEVA YAKAVLSAIG
NPSLRAVIDT SRNGNGPAGN EWCDPSGRAI GTPSTTNTGD PMIDAFLWIK LPGEADGCIA
GAGQFVPQAA YEMAIAAGGT NPNPNPNPTP TPTPTPTPPP GSSGACTATY TIANEWNDGF
QATVTVTANQ NITGWTVTWT FTDGQTITNA WNADVSTSGS SVTARNVGHN GTLSQGASTE
FGFVGSKGNS NSVPTLTCAA S