Gene Cfla_1332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1332 
Symbol 
ID9145212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1477450 
End bp1480482 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content73% 
IMG OID 
ProductEndo-1,3(4)-beta-glucanase 
Protein accessionYP_003636429 
Protein GI296129179 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00217358 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAACAC ATCACAGGAC CCCCCAGGGG TCCCCATCCC ACCATGCCCG CAGCAGGGGC 
CCGGCGCCGC ACGCGCAGCG GCGGACAGCG GCCGGAACCG CACTGGCGCT GGCCGTCGCA
GCCGCACTCG TCGTCGTGCC GTCGGGCGCG TCGACGGCGG CCGAGGGCGT CGTCGACGTC
GGTGCCGGCG GGTACGCCGC CGCGCCCGTC GGTCCGACGC CCGAGGGGTG CGACTCGATC
GAGGCCGATC CCCGCTCGGC CCTGACCGAG GACGCGCCGC AGGGCCCGCT GCCGACCAAC
GACTGGTGGT CGTCGCTGCT GTACAAGCGC CTCGACTGCC GCATGAGCGA ACCCCTGCAC
GCGCACCCCG CGTCCTACCA GCCCACCCCC GCGGGCCTGG GGATCTCGAC GCCTCGCGAG
GCGACGCTGT CCGGGACGAA GGGCGGCATC GGTGAGTTCC ACTTCACGTA CGTCCAGGAC
GTCCTGGTCG GTGTCGCGGG CCTCGACGCC CCGACGGTCC AGGTCGCCGG CTGGACCGAC
TGGACCGTGA CACCGTCGTG GTCCGACGGG ACGCGTTCGC TGCGCGCCAC GATCGGGCAC
GGCCTGCCCA CGTCCTGGTA TCACGTCGAG GGCGGTGACG CCCTGCTGCG CTCGCAGCAC
GACGTGCGCG TCTGGCAGCG CGACGGCAGC ACCGTCGGGT TCACGGCCAA CGGTCACGAC
TACGCGGCGT TCGCGCCGTC GGGCGCGAGC TGGGACGTCT CCGGTTCCAC GCTGCGCTCC
TCGCTCGCGG GCCAGGGCTA CCTCGCGGTG ACGGCGCTTG CCACGGCCGC CGACGCGACC
GACGCCGACC GCGAGGCGGC GCTCCAGGCG GTCGCCGGCT CCGCGTTCGC CGAGGTCACC
AGCACCGAGG CGAGCTACAG CTACGACGCT GCCGGCGCCG TGGTCTCGAC GACCTACGAG
ATCGGGACGA GCGCGCTCGA GGGCTACGCC GAGGGTGCCG TCGTCGCGCT CTACCCGCAC
CAGCAGCGCT ACCTCGCGGA CGTCGACGGC GACGAGCTCG ACGCGACCTA CCCGAGCCCG
CGCGGCACGA TGACGGCGTA CGCGGGCACC ACGTCGTTCA CCACCGAGAC GCCGTTCACC
GGCATCCTCC CCGAGGTCCC GGCCGTCGCG ACGGCCGACG GCGAGGGCCG TGCCACGCTC
GACCGCCTGC TCGCCGAGGC CGCAGCCGAC CCGCTGCCGA TCCTGCGGGC CGACACCTAC
TGGACCGGCA AGGCCCTGGG CCGTGCCACG CGGATCATCG AGATCGCCGA CCAGCTCGGC
GAGACCGAGG TGCGCGACCG CACTCTGCGG CTGGTCCGCG ACACCCTGAC CGACTGGTTC
ACCGCCGAGC CCGGCAAGTC CGAGCAGGTC TTCGCCTACG ACGAGCGCTG GGGCACCCTC
ATCGGCTACC CCGCGTCCTA CGGCTCGGAC ACCGAGCTCA ACGACCACCA CTTCCACTAC
GGCTACTTCA TCGCCGCAGC GGCCACGCTC GCACGCTTCG ACCCCGCGTG GGCGTCGGAC
GAGCAGTACG GCGGCATGGT CGACCTGCTG ATCCGCGACG CCAACGGGTA CGACCGCGCC
GAGACGCGGT TCCCGTACCT GCGGGACTTC GACATCTACG CCGGGCACGA CTGGGCCTCG
GGACACGGTG CGTTCGCGGC CGGCAACAAC CAGGAGTCGA GCTCCGAGGG CCAGAACTTC
GCGGGCGCGC TCGTCCAGTG GGGCGAGGCG ACGGGGAACA CGGCGGTGCG CGACGCCGGT
GCGTACCTCT ACGCCACGCA GGCCGCGACG ATCCAGGAGT ACTGGTTCGA CCAGGCCAAG
GCGATCCCGG ACGAGTTCGG CCACACGACC CTCGGCATGG TCTGGGGCGA CGGCGGCACG
TACTCGACGT GGTTCTCCGC CGAGGCGGAG ATGATCCAGG GCATCAACAC GCTGCCCATC
ACCGGCTCGC ACCTGTACCT CGGTATCCGG CCCGACGACG TGGTCGAGAA CTACGCCGAG
CTCGTCAAGG CCAACGGCGG CAAGCCCACG GTCTGGCAGG ACATCCTGTG GAGCTATCTC
GCGCTCGGTA ACGGCGAGGA GGCGCTGGAG CAGCTGGAGG CCGACCCCGG CTACGCCGTC
GAGGAGGGCG AGTCGCGGGC GCACACCTAC CACTGGGTCG CCAACCTGGC GGCGCTGGGG
AACCTCGACA CCACCGTGCG CGGGTCGAGC CCGCTGTCGG CGGCGTTCGT CAAGGACGGT
GCGCGGACGT ACGTCGCCGC CAACGTCTCG TCGAAGGCCC GCACGGTCGT CTTCAGTGAC
GGGACGAAGG TCGAGGTCCC GCCGGGCAAG ACCGTCGCGA CGGGTGCGCA CACGTGGTCC
GGCGGCGGCG CGGTCGGCGG TCCCGGTGGT CCGCAGCCGA CCACCGAGCC GAAGCCGACG
GTGACGCCGG CGCCCACGGC CACGCCGAAG CCGACGCCCA CGGCCACGCC GAAGCCGACG
GTGAGCCCGA AGCCGACCCC GAAGCCCCAG CCGACGACGC CCGCGGGCGG GTTCCGTCTG
GCCTTCGGTC CCGGCGGCAC GCTCGTGCCG TCGCCGGGCG CGCCGGGCGC CCTCGAGGTG
CCGGCGGCGC GCGGGATCGA CACGGCCACC GAGGCGCCGG ACGCGGTCGT GGCGCAGGCC
ACGGGCCTGA ACGGCACCGC CACGGGCGCC GCGACGGCGT TCGACCTCGC GCTGGACGCG
GGCACCCGGG TCGGCAACGG CACGCGCGTG TCGGTCTCGT ACGACCTCAC GGGCGACGGG
ACGTGGGACC GGGTCGAGGT GTACCGGTAC TTCGCGACCG ACCCGGTGCC GGGCCCCGAG
CGCTACACGC AGACCGTCGG CCTGGACCGG GTGACCGGAG AGCTCGGTGA CCTGCGCAAC
GGCACGGTGC GGGTCGCGGT GTGGAACGCG ATCGGCAGCA GCCCGACGTC GGTGAGCACC
GGTGACTCCG TGGTGGAGCT GCCGTTCCGC TGA
 
Protein sequence
MSTHHRTPQG SPSHHARSRG PAPHAQRRTA AGTALALAVA AALVVVPSGA STAAEGVVDV 
GAGGYAAAPV GPTPEGCDSI EADPRSALTE DAPQGPLPTN DWWSSLLYKR LDCRMSEPLH
AHPASYQPTP AGLGISTPRE ATLSGTKGGI GEFHFTYVQD VLVGVAGLDA PTVQVAGWTD
WTVTPSWSDG TRSLRATIGH GLPTSWYHVE GGDALLRSQH DVRVWQRDGS TVGFTANGHD
YAAFAPSGAS WDVSGSTLRS SLAGQGYLAV TALATAADAT DADREAALQA VAGSAFAEVT
STEASYSYDA AGAVVSTTYE IGTSALEGYA EGAVVALYPH QQRYLADVDG DELDATYPSP
RGTMTAYAGT TSFTTETPFT GILPEVPAVA TADGEGRATL DRLLAEAAAD PLPILRADTY
WTGKALGRAT RIIEIADQLG ETEVRDRTLR LVRDTLTDWF TAEPGKSEQV FAYDERWGTL
IGYPASYGSD TELNDHHFHY GYFIAAAATL ARFDPAWASD EQYGGMVDLL IRDANGYDRA
ETRFPYLRDF DIYAGHDWAS GHGAFAAGNN QESSSEGQNF AGALVQWGEA TGNTAVRDAG
AYLYATQAAT IQEYWFDQAK AIPDEFGHTT LGMVWGDGGT YSTWFSAEAE MIQGINTLPI
TGSHLYLGIR PDDVVENYAE LVKANGGKPT VWQDILWSYL ALGNGEEALE QLEADPGYAV
EEGESRAHTY HWVANLAALG NLDTTVRGSS PLSAAFVKDG ARTYVAANVS SKARTVVFSD
GTKVEVPPGK TVATGAHTWS GGGAVGGPGG PQPTTEPKPT VTPAPTATPK PTPTATPKPT
VSPKPTPKPQ PTTPAGGFRL AFGPGGTLVP SPGAPGALEV PAARGIDTAT EAPDAVVAQA
TGLNGTATGA ATAFDLALDA GTRVGNGTRV SVSYDLTGDG TWDRVEVYRY FATDPVPGPE
RYTQTVGLDR VTGELGDLRN GTVRVAVWNA IGSSPTSVST GDSVVELPFR