Gene Tfu_2990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_2990 
Symbol 
ID3581914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp3501674 
End bp3502774 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID637686721 
Productcellulose-binding family II protein 
Protein accessionYP_291046 
Protein GI72163389 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.826139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGTA CCTGGGCACG TGCGGTCACC GGCGCGTGCA GCGCACTGGT GGTAGCAGCG 
GCCACGCTCG TCATCGGCGG CGCCCCGGCA GCCTCGGCAG CCGACGGCTG CTCTGCGGAA
TACACGGTTG CCAGCGACTG GGGAAGCGGT TTCGTCGGCA ACGTCACCGT CACCAACACC
AGCGGAACGC CCGCCACCGG GTGGACGGTC CAGTGGACCC TGCCGTCCGG TCACACCATC
ACCAACACGT GGAACGCCGA ACTGTCCGTG AACGGCTCCA CAGTGACCGC CACCAACGCC
TCGTGGAACG GGTCGCTCCC CGTGGGCGGC AGCGCGTCCT TCGGTTTCCA GGGCACGGGG
TCGGGCGCCT CCTCCCTCCC CACCGACATC GCGTGCTTCC TCGACGCCCC GGGCGGCGGC
ACCCCGGGCG GCCCTAACGA GCCCGGTGGC CCCGATGAGC CCGGCGGTCC TGGAACCCCG
GGCGAACCGG TGCGGATCAT GCCGCTGGGC GACTCGATCA CCGGCTCCCC CGGCTGCTGG
CGGGCCCTCC TGTGGCGTGA CCTGACCGAC GCCGGATACA CCAACATCGA CTTCGTCGGC
TCCCGCGCCG GCGACGGCTG CGGCTTCCCC TACGACCACG AAAACGAAGG CCACGGGGGC
ATGCTGGTCA CCAACCTGGC TCGCAGCGGA CAACTGTCCA CCTGGCTGTC CGCCACCAAC
CCCGACATCG TGCTCATGCA CTTCGGCACC AACGACGTCT GGAGCTCTCT GCCCACCCAG
ACGATCCTCG ACGCCTACAG CACGCTGGTC AGTCAGATGC GGGCCAACAA CCCGAACATG
ACGATCCTCG TGGCCCAGAT CATCCCTATG GACTCGGCGC GAAGCTGCGC CACCTGCGCC
CAGGGCGTGC AAGCCCTCAA CGCTGCGATC CCCGCGTGGG CGGCCAGCGA AAGCACCGCC
CAGTCCCCCG TCATCGTCGT GGACCAGTGG ACCGGATTCG ACACTGACGC CGACACCTAC
GACGGGGTGC ACCCCAACGC TTCCGGGGAC GCCAAGATCG CGCAGAACTG GCTGGAGGCG
CTGATCCCGC TGCTTGACTA A
 
Protein sequence
MSRTWARAVT GACSALVVAA ATLVIGGAPA ASAADGCSAE YTVASDWGSG FVGNVTVTNT 
SGTPATGWTV QWTLPSGHTI TNTWNAELSV NGSTVTATNA SWNGSLPVGG SASFGFQGTG
SGASSLPTDI ACFLDAPGGG TPGGPNEPGG PDEPGGPGTP GEPVRIMPLG DSITGSPGCW
RALLWRDLTD AGYTNIDFVG SRAGDGCGFP YDHENEGHGG MLVTNLARSG QLSTWLSATN
PDIVLMHFGT NDVWSSLPTQ TILDAYSTLV SQMRANNPNM TILVAQIIPM DSARSCATCA
QGVQALNAAI PAWAASESTA QSPVIVVDQW TGFDTDADTY DGVHPNASGD AKIAQNWLEA
LIPLLD