Gene Tfu_1627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_1627 
Symbol 
ID3580405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp1887518 
End bp1890514 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content67% 
IMG OID637685321 
Productcellulose 1,4-beta-cellobiosidase 
Protein accessionYP_289685 
Protein GI72162028 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.234171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGCGC TCCCATGGTG GGCCTCCGCT GTGAGGTCAT CCTCCCAGTT CGAATCCCCC 
TACGGAAGGA CTTCCGTGCT TAGGAGACCC AGATCTCGAT CCCCCCTTGT CGCCCTCACC
GCGGCGACTT GCGCAGTCGC GCTCGGGGGT ACGGCGGTTC CCGCCCAGGC AGACGAAGTC
AACCAGATTC GCAACGGCGA CTTCAGCTCC GGCACCGCAC CCTGGTGGGG AACCGAGAAC
ATCCAACTCA ACGTCACCGA CGGGATGCTG TGCGTCGACG TCCCCGGCGG CACCGTCAAC
CCGTGGGACG TGATCATCGG CCAGGACGAC ATCCCCCTCA TCGAAGGTGA GTCCTACGCC
TTCTCCTTCA CTGCCTCCAG CACCGTCCCC GTCTCCATCC GCGCCCTGGT GCAAGAGCCC
GTGGAGCCGT GGACCACCCA GATGGACGAG CGTGCCCTGC TCGGCCCCGA GGCAGAAACC
TACGAATTCG TCTTCACCTC CAACGTCGAC TGGGACGACG CCCAAGTCGC CTTCCAGATC
GGCGGCTCCG ACGAACCGTG GACCTTCTGC CTCGACGACG TCGCCCTGCT CGGCGGCGCC
GAACCCCCGG TCTACGAACC CGACACCGGA CCGCGGGTCC GCGTCAACCA GGTCGGCTAC
CTCCCGCACG GTCCCAAGAA GGCGACCGTG GTCACCGACG CCACCAGCGC GCTCACCTGG
GAGCTTGCCG ACGCCGACGG TAACGTGGTC GCCAGCGGCC AGACCAAGCC GCACGGCGCG
GACTCCAGCT CCGGGCTCAA CGTCCACACC GTCGACTTCA GCTCCTACAC CACGAAGGGA
AGCGACTACA CGCTCACCGT CGACGGTGAA ACCAGCTACC CCTTCGACAT CGACGAAAGC
GTCTACGAGG AACTGCGCGT CGACGCGCTG TCGTTCTACT ACCCGCAGCG CAGCGGCATC
GAGATCCTCG ACTCCATCGC CCCCGGCTAC GGACGCCCGG CCGGCCACAT CGGCGTGCCC
CCCAACCAGG GCGATACCGA CGTGCCGTGC GCGCCCGGCA CCTGCGACTA CTCCCTGGAC
GTCTCCGGCG GCTGGTACGA CGCGGGCGAC CACGGCAAAT ACGTGGTCAA CGGCGGTATC
TCGGTGCACC AGATCATGAG CATCTACGAG CGCTCCCAGC TCGCCGACAC CGCCCAGCCC
GACAAGCTGG CCGACTCCAC CCTGCGCCTG CCCGAAACCG GCAACGGCGT GCCCGACGTG
CTCGACGAAG CACGCTGGGA GATGGAGTTC CTCCTCAAGA TGCAGGTGCC CGAAGGCGAA
CCGCTCGCCG GCATGGCGCA CCACAAGATC CACGACGAAC AGTGGACCGG GCTGCCGCTG
CTGCCCTCCG CTGACCCGCA GCCGCGCTAC CTGCAGCCGC CGTCCACCGC GGCCACGCTG
AACCTGGCCG CCACCGCCGC CCAGTGCGCT CGCGTGTTCG AACCCTTCGA CGAGGATTTC
GCCGCCGAGT GCCTGGCTGC CGCGGAAACC GCGTGGGACG CCGCCAAGGC CAACCCGAAC
ATCTACGCGC CTGCCTTCGG TGAAGGCGGC GGCCCGTACA ACGACAACAA CGTCACCGAC
GAGTTCTACT GGGCCGCGGC CGAACTGTTC CTCACCACCG GCAAGGAGGA GTACCGCGAC
GCGGTGACCT CGTCGCCGCT GCACACCGAC GACGAAGAGG TCTTCCGCGA CGGCGCCTTC
GACTGGGGAT GGACTGCTGC GCTGGCCCGC CTCCAGCTGG CCACGATCCC CAACGACCTC
GCCGACCGCG ACCGGGTGCG CCAGTCCGTG GTCGATGCCG CCGACATGTA CCTCGCCAAC
GTCGAGACCA GCCCGTGGGG CCTGGCCTAC AAGCCGAACA ACGGCGTGTT CGTCTGGGGC
TCCAACAGCG CTGTCCTCAA CAACATGGTG ATCCTGGCGG TCGCCTTCGA CCTCACCGGT
GACACCAAAT ACCGCGACGG CGTGCTGGAA GGCATGGACT ACATCTTCGG CCGCAACGCG
CTGAACCAGT CCTACGTCAC CGGCTACGGC GACAAGGACT CCCGCAACCA GCACAGCCGC
TGGTACGCCC ACCAGCTCGA CCCCCGGTTG CCCAACCCGC CCAAGGGCAC GCTGGCCGGT
GGACCCAACT CCGACTCCAC CACCTGGGAC CCGGTGGCCC AGTCCAAGCT GACCGGGTGC
GCCCCCCAGA TGTGCTACAT CGACCACATC GAGTCGTGGT CCACCAACGA GCTGACCATC
AACTGGAACG CCCCCCTGTC GTGGATCGCG TCCTTCATCG CCGACCAGGA CGACGCCGGC
GAGCCCGGCG GAGAAGAGCC CGGACCGGGC GACGACGAGA CCCCGCCGAG CAAGCCTGGG
AACCTGAAGG CCAGCGACAT CACCGCGACC AGCGCCACCC TGACCTGGGA CGCCTCCACC
GACAACGTCG GAGTGGTCGG CTACAAGGTC TCCCTGGTCC GCGACGGTGA CGCTGAAGAG
GTGGGCACCA CCGCGCAGAC CAGCTACACG CTCACCGGGC TGAGCGCGGA CCAGGAGTAC
ACCGTCCAGG TGGTCGCCTA CGACGCGGCA GGCAACCTCT CCACGCCAGC CACCGTCACC
TTCACCACCG AGAAGGAGGA CGAGACTCCC ACGCCCAGCG CCTCCTGCGC GGTGACGTAC
CAGACCAACG ACTGGCCGGG CGGCTTCACC GCCTCGGTGA CGCTGACCAA CACCGGCAGC
ACCCCGTGGG ACTCCTGGGA ACTGCGCTTC ACCTTCCCGT CGGGACAGAC TGTCAGCCAC
GGCTGGAGCG CCAACTGGCA GCAGAGCGGC AGTGACGTGA CCGCCACCTC CTTGCCGTGG
AACGGATCAG TTCCGCCGGG CGGCTCAGTC AACATCGGCT TCAACGGAAC CTGGGGCGGT
TCGAACACCA AACCTGAGAA GTTCACCGTC AACGGCGCGG TCTGCTCCAT CGGCTGA
 
Protein sequence
MGALPWWASA VRSSSQFESP YGRTSVLRRP RSRSPLVALT AATCAVALGG TAVPAQADEV 
NQIRNGDFSS GTAPWWGTEN IQLNVTDGML CVDVPGGTVN PWDVIIGQDD IPLIEGESYA
FSFTASSTVP VSIRALVQEP VEPWTTQMDE RALLGPEAET YEFVFTSNVD WDDAQVAFQI
GGSDEPWTFC LDDVALLGGA EPPVYEPDTG PRVRVNQVGY LPHGPKKATV VTDATSALTW
ELADADGNVV ASGQTKPHGA DSSSGLNVHT VDFSSYTTKG SDYTLTVDGE TSYPFDIDES
VYEELRVDAL SFYYPQRSGI EILDSIAPGY GRPAGHIGVP PNQGDTDVPC APGTCDYSLD
VSGGWYDAGD HGKYVVNGGI SVHQIMSIYE RSQLADTAQP DKLADSTLRL PETGNGVPDV
LDEARWEMEF LLKMQVPEGE PLAGMAHHKI HDEQWTGLPL LPSADPQPRY LQPPSTAATL
NLAATAAQCA RVFEPFDEDF AAECLAAAET AWDAAKANPN IYAPAFGEGG GPYNDNNVTD
EFYWAAAELF LTTGKEEYRD AVTSSPLHTD DEEVFRDGAF DWGWTAALAR LQLATIPNDL
ADRDRVRQSV VDAADMYLAN VETSPWGLAY KPNNGVFVWG SNSAVLNNMV ILAVAFDLTG
DTKYRDGVLE GMDYIFGRNA LNQSYVTGYG DKDSRNQHSR WYAHQLDPRL PNPPKGTLAG
GPNSDSTTWD PVAQSKLTGC APQMCYIDHI ESWSTNELTI NWNAPLSWIA SFIADQDDAG
EPGGEEPGPG DDETPPSKPG NLKASDITAT SATLTWDAST DNVGVVGYKV SLVRDGDAEE
VGTTAQTSYT LTGLSADQEY TVQVVAYDAA GNLSTPATVT FTTEKEDETP TPSASCAVTY
QTNDWPGGFT ASVTLTNTGS TPWDSWELRF TFPSGQTVSH GWSANWQQSG SDVTATSLPW
NGSVPPGGSV NIGFNGTWGG SNTKPEKFTV NGAVCSIG