Gene Tfu_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_0901 
Symbol 
ID3579455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp1062716 
End bp1064116 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content68% 
IMG OID637684596 
Productcellulase 
Protein accessionYP_288962 
Protein GI72161305 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAT CCCCCGCCGC CCGGAAGGGC GGCCCTCCGG TCGCTGTCGC GGTGACCGCG 
GCCCTCGCCC TGCTGATCGC GCTCCTCTCC CCCGGAGTCG CGCAGGCCGC CGGTCTCACC
GCCACAGTCA CCAAAGAATC CTCGTGGGAC AACGGCTACT CCGCGTCCGT CACCGTCCGC
AACGACACCT CGAGCACCGT CTCCCAGTGG GAGGTCGTCC TCACCCTGCC CGGCGGCACT
ACAGTGGCCC AGGTGTGGAA CGCCCAGCAC ACCAGCAGCG GCAACTCCCA CACCTTCACC
GGGGTTTCCT GGAACAGCAC CATCCCGCCC GGAGGCACCG CCTCCTTCGG CTTCATCGCT
TCCGGCAGCG GCGAACCCAC CCACTGCACC ATCAACGGCG CCCCCTGCGA CGAAGGCTCC
GAGCCGGGCG GCCCCGGCGG TCCCGGAACC CCCTCCCCCG ACCCCGGCAC GCAGCCCGGC
ACCGGCACCC CGGTCGAGCG GTACGGCAAA GTCCAGGTCT GCGGCACCCA GCTCTGCGAC
GAGCACGGCA ACCCGGTCCA ACTGCGCGGC ATGAGCACCC ACGGCATCCA GTGGTTCGAC
CACTGCCTGA CCGACAGCTC GCTGGACGCC CTGGCCTACG ACTGGAAGGC CGACATCATC
CGCCTGTCCA TGTACATCCA GGAAGACGGC TACGAGACCA ACCCGCGCGG CTTCACCGAC
CGGATGCACC AGCTCATCGA CATGGCCACG GCGCGCGGCC TGTACGTGAT CGTGGACTGG
CACATCCTCA CCCCGGGCGA TCCCCACTAC AACCTGGACC GGGCCAAGAC CTTCTTCGCG
GAAATCGCCC AGCGCCACGC CAGCAAGACC AACGTGCTCT ACGAGATCGC CAACGAACCC
AACGGAGTGA GCTGGGCCTC CATCAAGAGC TACGCCGAAG AGGTCATCCC GGTGATCCGC
CAGCGCGACC CCGACTCGGT GATCATCGTG GGCACCCGCG GCTGGTCGTC GCTCGGCGTC
TCCGAAGGCT CCGGCCCCGC CGAGATCGCG GCCAACCCGG TCAACGCCTC CAACATCATG
TACGCCTTCC ACTTCTACGC GGCCTCGCAC CGCGACAACT ACCTCAACGC GCTGCGTGAG
GCCTCCGAGC TGTTCCCGGT CTTCGTCACC GAGTTCGGCA CCGAGACCTA CACCGGTGAC
GGCGCCAACG ACTTCCAGAT GGCCGACCGC TACATCGACC TGATGGCGGA ACGGAAGATC
GGGTGGACCA AGTGGAACTA CTCGGACGAC TTCCGTTCCG GCGCGGTCTT CCAGCCGGGC
ACCTGCGCGT CCGGCGGCCC GTGGAGCGGT TCGTCGCTGA AGGCGTCCGG ACAGTGGGTG
CGGAGCAAGC TCCAGTCCTG A
 
Protein sequence
MAKSPAARKG GPPVAVAVTA ALALLIALLS PGVAQAAGLT ATVTKESSWD NGYSASVTVR 
NDTSSTVSQW EVVLTLPGGT TVAQVWNAQH TSSGNSHTFT GVSWNSTIPP GGTASFGFIA
SGSGEPTHCT INGAPCDEGS EPGGPGGPGT PSPDPGTQPG TGTPVERYGK VQVCGTQLCD
EHGNPVQLRG MSTHGIQWFD HCLTDSSLDA LAYDWKADII RLSMYIQEDG YETNPRGFTD
RMHQLIDMAT ARGLYVIVDW HILTPGDPHY NLDRAKTFFA EIAQRHASKT NVLYEIANEP
NGVSWASIKS YAEEVIPVIR QRDPDSVIIV GTRGWSSLGV SEGSGPAEIA ANPVNASNIM
YAFHFYAASH RDNYLNALRE ASELFPVFVT EFGTETYTGD GANDFQMADR YIDLMAERKI
GWTKWNYSDD FRSGAVFQPG TCASGGPWSG SSLKASGQWV RSKLQS