Gene Cfla_3193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3193 
Symbol 
ID9147109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3552476 
End bp3554035 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content72% 
IMG OID 
Productglycoside hydrolase family 26 
Protein accessionYP_003638274 
Protein GI296131024 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000819014 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGAGC ACACCGGAAA GCACTGGTGG ACCCTCACGG GTCGCCTCGG TCTCGTCGGC 
AAGCTGAGCG CGTTCGTCCT GGCGCTGGCC CTCGGTGTCG TGACCGGCAT CGTGTGGCTC
TCACCGTCGG GCGCCACCCC CGGGCCCCTG AAGACGGCGG CCGAGAAGCA GCTCGAGCAG
GAGAACGCCG ACCTCGAGGC GATCCTGCGG GACCGCGACG AGGAGCTCGA GAACCTTCGG
CGCGGGCAGG AGAAGGCCGC CGCCGAGCGT GCCGCCGGCG TGGCGTCGGG CACGGCGAAG
GCGGAGGCCG AGAAGGCCGC CGCGGCGGAG CGCGGCAGCG CCAAGGCGCA GGCCGAGAAG
GCCGCCGCCG CTGCGCGCGG CAAGCAGAAG GCCGCCGCGG ACAAGGCTGC GGCGCAGGCG
AGCGCCCAGC AGAAGGCGGC TCTCGAGCGC CGGCTCGCCG ACGCCAGGGG CGACGCCGAG
GGCGCCGCCA TCCGCGAGGC CGTCGCGCGG TCGCAGGTCG AGTTCGCCCG GGCGGCGGCC
GCGGCCCGCG CGGAGGCCGC CAGGCCCGTC CGCCCGACCG CGCCCGCCAG GGACGCGATC
GTGCACCCCA CGGACCGCTA CTTCGGCCTG TACACCGCCC AGTCGCCGTT CTCCTGGGCG
GAGTTCGACG AGGTCAGCGC GACCGTCGGC GTCCAGCCGA ACCTGGGCGG GTACTTCCAG
GGCTGGGACA CGCCGTTCCG CCCCGACGCG GTCGAGCGCT CCTGGACCAA GGGCGTCCTG
CCCATGGTGA CGTGGGAGTC CCGTCCGATG GAGGCGAGCA ACAGCCAGGC GACCGAGCCC
GAGTACTCCT TGCCGCGCAT CATCGGCGGC GCGTTCGACG ACTACCTGCG GCAGTACGCC
CGGGACGTCA AGGCGCTCGG GCTGCCCGTC GCGATCCGCC TCAACCACGA GATGAACGGC
GGCTGGTACC CGTGGGGCGA GCTCGGCTCC AACGGCGTGC AGGTCAACGG CAACAACCGC
GGTGACTACG TGAAGATGTG GCGGCACGTG CACGACATCT TCCAGGCCGA GGGCGCCAAC
GAGCACGTCA TCTGGGTCTG GTCGCCGAAC ATCGTCAACA ACCTGCCCCA GCGCAACGTG
TCGTTGGCGT ACACGGCGAG CATGTACCCG GGGGACGAGT ACGTCGACTG GGTCGGCCTG
TCCGGCTACT ACCGCCCGCC GTTCGCGGCG GACCAGACCG CGACGTTCTC GTACACGTTC
GACCGGTCGC TGAAGCAGCT GCGCGCCATC ACGAGCAAGC CGATCCTCCT CGCGGAGATC
GGCGCCTCGG AGACCGGCGG GGGCGACAAG CCCGCCTGGG TCGCGGACCT GTTCCGCTCG
CTCGCGAAGC CGGAGAACTC CGACGTCATC GGGTTCGCCT GGTTCCACCA CACCGTCACG
ACGATCAGCG GCGGCCAGCG GGTCACCAAC GACTGGAAGA TCACCTCGCG CGAGGACTCC
CAGCGGGCCT TCGTCGACGG CATCCACGCA CCTTCCGCCG GCTTCGTCCG CGGCAACTGA
 
Protein sequence
MSEHTGKHWW TLTGRLGLVG KLSAFVLALA LGVVTGIVWL SPSGATPGPL KTAAEKQLEQ 
ENADLEAILR DRDEELENLR RGQEKAAAER AAGVASGTAK AEAEKAAAAE RGSAKAQAEK
AAAAARGKQK AAADKAAAQA SAQQKAALER RLADARGDAE GAAIREAVAR SQVEFARAAA
AARAEAARPV RPTAPARDAI VHPTDRYFGL YTAQSPFSWA EFDEVSATVG VQPNLGGYFQ
GWDTPFRPDA VERSWTKGVL PMVTWESRPM EASNSQATEP EYSLPRIIGG AFDDYLRQYA
RDVKALGLPV AIRLNHEMNG GWYPWGELGS NGVQVNGNNR GDYVKMWRHV HDIFQAEGAN
EHVIWVWSPN IVNNLPQRNV SLAYTASMYP GDEYVDWVGL SGYYRPPFAA DQTATFSYTF
DRSLKQLRAI TSKPILLAEI GASETGGGDK PAWVADLFRS LAKPENSDVI GFAWFHHTVT
TISGGQRVTN DWKITSREDS QRAFVDGIHA PSAGFVRGN