Gene Cfla_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0016 
Symbol 
ID9143880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp18226 
End bp21198 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content72% 
IMG OID 
Productglycoside hydrolase family 9 
Protein accessionYP_003635135 
Protein GI296127885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.114686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGCA CCTCCGCCCA CCGCGCGCGC CGCCGCTCGC TGTGGGCCCG CACCGCCGCC 
GTCGCGCTCG TCGTCAGCGG CACCGTGCTG CCCCTCCAGG GCGCGCAGGC GGCGCCGGCG
TACAACTACG CCGAGGCGCT GCAGAAGTCG ATGTTCTTCT ACCAGGCGCA GCGCTCGGGC
GACCTCCCGG CGAACTACCC GGTCTCGTGG CGCGGCGACT CCGGGCTCGA CGACGGCAAG
GACGTCGGCA AGGACCTCAC GGGCGGCTGG TACGACGCCG GTGACCACGT GAAGTTCGGG
CTGCCGATGG CGTTCACCGC CACGATGCTG GCGTGGGGCG CGCTCGAGAG CCCCGACGGG
TACGCGGCGG CCGGCCAGAC CGACGAGCTG CGCGACAACC TGCGCGCCGT CAACGACTAC
TTCGTCAAGG CACACACCGC GCCCAACGAG CTGTACGTGC AGGTCGGCAA GGGCGACGAC
GACCACAAGT GGTGGGGCCC CGCCGAGGTC ATGACCATGG CCCGCCCCGC CTACAAGATC
ACGGCCGCCT GCCCGGGCTC GGACGCCGCA GCCGAGACGG CCGCCGCCAT GGCGTCCGCC
TCGCTCGTCT TCCAGGGCAG CGACCCCTCC TACGCGGCGA CGCTGCTGAC CCACGCCAAG
CAGCTCTACT CGTTCGCCGA CACCTACCGC GGCAAGTACT CCGACTGCGT CACCGACGCG
CAGTCGTTCT ACAAGTCGTG GTCCGGCTAC CAGGACGAGC TGGTCTGGGG CGCGTACTGG
CTCTACAAGG CCACGGGCGA CGCCACCTAC CTGGCGAAGG CCGAGGCGGA GTACGACAAG
CTGAGCAACG AGAACCAGAC GACGACGAAG TCGTACAAGT GGACCGTCGC GTGGGACGAC
AAGTCGTACG CGGCGTACGC GCTGCTCGCC ATGGAGACGG GCAAGCAGAA GTACGTCGAC
GACGCGAACC GCTGGCTCGA CTACTGGACC GTCGGCGTCA ACGGCGCCAA GGTGACCTAC
TCCCCCGGCG GCATGGCCGT GCTCGACTCG TGGGGCGCGC TGCGGTACGC GGCGAACACC
TCGTTCGTCG CCCTGGTGTA CTCCGACTGG CTGACCGACA GCACCCGGAA GGCGCGGTAC
CACGACTTCG GCGTCCGCCA GATCAACTAC GCGCTGGGCG ACAACCCCCG CAAGTCGTCG
TACGTCGTGG GCTTCGGGGC GAACCCGCCG AAGAACCCGC ACCACCGCAC GGCGCACGGC
TCGTGGCTGG ACTCCCTCAA GGACCCGGCC GAGACGCGGC ACGTGCTGTA CGGCGCGCTC
GTCGGCGGCC CCGGGTCGGC CAACGACGCG TACACCGACG ACCGCGGCGA CTACGTCGCC
AACGAGGTCG CCACGGACTA CAACGCCGGG TTCACGAGCG CGCTGGCGTA CCTGACGGCC
CAGTACGGCG GAACACCGCT GGCGTCGTTC CCGCGGCCCG AGACGCCGGA CCAGGACGAG
TTCTTCGTCG AGGCCAAGCT CAACCAGCCT GCCGGTGGCA CGTTCACCGA GGTCAAGGCC
GTCGTGCGCA ACCGGTCGGC GTTCCCGGCC CGGTCCCTGA CCAACGCCTC GATCCGTTAC
TGGTTCACGC TCGACGCGGG CGTCGACGCG TCCGCGGTCA CGGTCTCGAC CAACTACAGC
GAGTGCGGCA CGACGCCCTC GAAGGTCGTC CACGCCGGCG GCAGCCTGTA CTACGCCGAG
CTGAGCTGCG CCGGGCAGAA CATCCACCCG GGCGGGCAGT CGCAGCACCG GCGGGAGATC
CAGTTCCGCG TCGCCGGGAC CACCGGGTGG GACGCCACGA ACGACCCGTC GTTCGCCGGC
CTGCCGGCGT CCGGCGACCC CGTCAGGACG AAGGGCATCA CGCTGCACGA AGGCAGCACC
CTGCTGTGGG GGACGGTGCC GGGCGGGACC ACGCCGACGC CCACCCCCAC GCCCACCCCG
ACGCCGACGC CCACCCCGAC GCCGACGCCC ACGCCGACGC CCACGCCGAC GCCCTCGCCG
TCCCCGACGA TGCACCCGAT CATCGCGAGC CCGCAGGACG TGCGGCTGGT CGCCGCGGGT
GACGAGCTGC GCCTGACGTG GACCGCGCCG ACGATCGGCG CGGCCGTCAC GGGCTACCAG
ACGCAGCTCG ACGACGAGCC GCTCCAGAGC ACGCGCAACA CCTGGTACAA CCGGCTGGAC
CTGGCACCGG GCGAGCACAC GATCCGCGTC CGTTCGCAGG CGGCCGAGGG CACGTCGGCG
TGGGTCGAGC GGACCGTCAG CATCCCGGCC GACGGCGAGT GGCCGACGAC CGTGCCGGTG
CCGGCGAACT TCGAGGCCAA GGGCGGCTCG CACGGCTTCT CGTTCACCTG GGAGACGCCC
ACCGGCGGAC CGGAGGTCGT GGCGTTCGAG ACGATCATGA CGTGGTCCGG GAGCACGGGC
AGCCCGACGA TCTCGCGGGA CACCGCGCAC GCGATCATGG GGTACGACTT CCCGCGCGGT
GACTACACGG TGTCCGTCCG GTCGGTCGCC GCGGACAACT CCGTGTCGCC GTGGGTCACG
GGGACGGCCA CGATCACCGA GGGCGGGATC CCCGGGATCA CCCCGCCCCC GACCCCGCCC
CCGACCACGC CGCCGGCCGC GTCCTGCGCC GTCACGTACA CCGCGAACAG CTGGAACAGC
GGCTTCACGG CCTCGATCCG ACTGACGAAC ACCGGTGCGA CGGCCGTGAC CTGGAAGCTC
GCGTTCGACC TGGCGGCCGG GCAGAAGGTC CAGCAGGGCT GGAGCGCCAC GTGGGCCCAG
TCGGGCACCA CGGTGACGGC CACCGGTGCC GCCTGGAACG CGACCCTGGC GCCCGGCGGG
ACCATCGACC TGGGCTTCAA CGGGTCGCAC ACCGGCCAGA ACCCCGACCC GACGTCGTTC
ACGCTGAACG GCGCGACCTG CACGGCCGGG TAG
 
Protein sequence
MPSTSAHRAR RRSLWARTAA VALVVSGTVL PLQGAQAAPA YNYAEALQKS MFFYQAQRSG 
DLPANYPVSW RGDSGLDDGK DVGKDLTGGW YDAGDHVKFG LPMAFTATML AWGALESPDG
YAAAGQTDEL RDNLRAVNDY FVKAHTAPNE LYVQVGKGDD DHKWWGPAEV MTMARPAYKI
TAACPGSDAA AETAAAMASA SLVFQGSDPS YAATLLTHAK QLYSFADTYR GKYSDCVTDA
QSFYKSWSGY QDELVWGAYW LYKATGDATY LAKAEAEYDK LSNENQTTTK SYKWTVAWDD
KSYAAYALLA METGKQKYVD DANRWLDYWT VGVNGAKVTY SPGGMAVLDS WGALRYAANT
SFVALVYSDW LTDSTRKARY HDFGVRQINY ALGDNPRKSS YVVGFGANPP KNPHHRTAHG
SWLDSLKDPA ETRHVLYGAL VGGPGSANDA YTDDRGDYVA NEVATDYNAG FTSALAYLTA
QYGGTPLASF PRPETPDQDE FFVEAKLNQP AGGTFTEVKA VVRNRSAFPA RSLTNASIRY
WFTLDAGVDA SAVTVSTNYS ECGTTPSKVV HAGGSLYYAE LSCAGQNIHP GGQSQHRREI
QFRVAGTTGW DATNDPSFAG LPASGDPVRT KGITLHEGST LLWGTVPGGT TPTPTPTPTP
TPTPTPTPTP TPTPTPTPSP SPTMHPIIAS PQDVRLVAAG DELRLTWTAP TIGAAVTGYQ
TQLDDEPLQS TRNTWYNRLD LAPGEHTIRV RSQAAEGTSA WVERTVSIPA DGEWPTTVPV
PANFEAKGGS HGFSFTWETP TGGPEVVAFE TIMTWSGSTG SPTISRDTAH AIMGYDFPRG
DYTVSVRSVA ADNSVSPWVT GTATITEGGI PGITPPPTPP PTTPPAASCA VTYTANSWNS
GFTASIRLTN TGATAVTWKL AFDLAAGQKV QQGWSATWAQ SGTTVTATGA AWNATLAPGG
TIDLGFNGSH TGQNPDPTSF TLNGATCTAG