Gene Cfla_0245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0245 
Symbol 
ID9144111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp283694 
End bp286699 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content68% 
IMG OID 
Productglycoside hydrolase family 10 
Protein accessionYP_003635363 
Protein GI296128113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAACA CCAGATCCTC CGTTCCCCGC CGCACCCGGG GGCTACGCGC TGCCATCAGC 
AGCGTCGCCA CCGGTGCCCT CCTCGCGACC AGCGTCCTGC TCGCGGCCCC TGCCTCCGCG
CAGACGATCA CCAGCAACGG CACCGGCACC CACAGCGGGT GGTGGTACTC GTTCTGGACC
GACTCCCCGG GCACCGTCTC GGCGACGATG AAGAGCGGGG GCAGCTACTC CACGTCGTGG
CGCAACACCG GCAACTTCGT CATCGGCAAG GGCTGGAACC GCGGTGACGC GACCAAGGTC
GTGAACTACT CCGGGTCCTT CAACCCGTCC GGCAACGCCT ACCTCACGCT CTACGGGTGG
ACCAACGGCC CGCTGATCGA GTACTACATC GTCGACAACT GGGGCACCTA CCGCCCGACC
GGCACCTTCA AGGGCACGGT CACCTCCGAC GGCGGCACGT ACGACATCTA CGAGACCACC
CGCGTCAACG CCCCCTCCAT CGAGGGCGAC CGCTCGACGT TCAAGCAGTT CTGGTCCGTG
CGCCAGCAGA AGCGCAACGG CGGCGGCACC ATCACCGCGG CGAACCACTT CAACGCGTGG
TCCCGTGCGG GCATGCAGCT CGGCACGCAC AACTACATGA TCTTCGCGAC CGAGGGCTAC
CAGTCCTCCG GCTCCTCCGA CCTGACCGTC ACCTCCGGCG GCGGCAACAC CGGCGGCAAC
ACCGGCGGCA ACACCGGCGG CGGCACGTCC AACGGCTGCA CCGTGACGGC CACGCGCGGC
GAGTCGTGGT CCGACAGGTT CAACGTCACG TACACCGTGT CCGGCGCCTC GAACTGGACC
GTGACCGTCA ACCCCGGCTC GGGCCAGTCG ATCCAGAACT CCTGGAACGC CACCCGCTCA
GGCAACACCT TCACGTCGTC CGGCTCCAAC AGCTTCGGCG TGACGTACTA CTCGGGCGGC
AACACGAGCA TCCCGTCGGC CAGCTGCAAC TCGACCGGCG GCAGCACCGG CGGGGACACC
GGGGGCAACA CGCAGAACTG CTCCGCGGGC TACGTCGCGC TGACGTTCGA CGACGGCCCG
AACACCGGCA CGACCAACTC GCTCATCAAC GCGCTGAAGT CGGCCGGCGC CACCGCCACG
GTCTTCCCGA CGGGCCAGAA CGTCGCGGCG AACCCGTCGC TGGCGAAGGC CTACGCGGAT
GCCGGGTTCC AGATCGGGAA CCACAGCTGG GACCACCCGT ACCTGACGCA GCAGAGCCAG
AGCAACCAGC AGTCGCAGCT CTCGCGCACG CAGGACGCGA TCCGCAGCGC CACCGGCCAG
ACGCCGACCA TCTTCCGGCC GCCGTACGGT GACACCAACA GCCAGCTGCA GTCCGTGGCG
TCCGGCCTCG GCCTGCGCAC GGTCACGTGG GACGTCGACT CGACGGACTA CAACAACGCG
TCGGTCCAGA CGATCCGCAA CGCGGCCGCG CGTCTGACCA GCGGCCAGAT CATCCTCATG
CACGACTGGC CGGCCAACAC CATCCAGGCG ATCCCGGGCA TCGTGCAGGA CCTGCGCTCG
CGCAACCTGT GCACCGGTCA CATCTCGTCC TCGACGGGTC GCGCTGTCGC CCCGGCAGGT
GGCGGCACGC AGCCGACGCA GCAGCCCACG ACGCAGCCGA CGACGCAGCC GACGCAGCAG
CCGACGACGC AGCCGACGGC CCAGCCGACG AACGCGCCGG CCGGCACGAC GCTGCAGGCC
GCGGCCGCTC GCACGGGCCG GTACTTCGGT GCCGCGGCGG CGAACTTCTA CCTGACCAAC
TCGGGCATCT CGCCGATCCT CAACCGTGAG TTCAACATGA TCACGGCGGA GAACGAGATG
AAGGTCGACG CGATGCAGCC CAACCAGGGC CAGTTCAACT GGAACTCGGG CAACACGATC
GTCAACTGGG CGCTGCAGAA CAACAAGCGG GTCCGCGGGC ACGCCCTGGC GTGGCACTCC
CAGCAGCCCG GCTGGATGCA GAACCAGTCC GGCACCACGC TGCGCAACTC GATGCTCAAC
CACATCACCC AGGTCGCCGG CTACTACAAG GGCAAGATCT ACGCCTGGGA CGTGGTGAAC
GAGGCCTTCG CCGACGGCTC GTCCGGCGCG CGACGTGACT CCAACCTGCA GCGCACCGGC
AACGACTGGA TCGAGGCGGC GTTCCGCGCC GCTCGCGCCG CCGACCCGCA GGCCAAGCTC
TGCTACAACG ACTACAACAC CGACAACTGG TCGCACGCCA AGACGCAGGG CGTCTACAAC
ATGGTGCGCG ACTTCAAGGC CCGCGGTGTC CCGATCGACT GCGTCGGCTT CCAGGCGCAC
TTCAACTCGG GCAACCCCGT GCCGTCGAAC TACGACGTGA CCCTGCGCAA CTTCGCCGAC
CTCGGCGTGG ACGTGCAGAT CACCGAGCTC GACATCGAGG GCTCCGGCAC CTCGCAGGCC
GAGCAGTTCC GCGGCGTGAC CCAGGCCTGC CTGTCGGTGC CGCGCTGCAC GGGTATCACC
GTGTGGGGCG TCCGCGACAG CGAGTCGTGG CGCTCCTACG GCACCCCGCT GCTGTTCGAC
GGGTCCGGCA ACAAGAAGGC CGCGTACAAC TACGTGCTCG ACGCCCTCAA CGCCGGCTCC
GGCCAGTCGA CCGGTGGCAA CACCGGCGGC AACACGGGTG GCGGCGACAC CGGCGGCAAC
ACCGGTGGCG GCGACGGCGG TGCGACGGGT GGCTACGACA GCGGCAGCTA CGTCAACAAC
GGCTGCACGG TCTCCTGGAC CCGTGACAAC GACTGGAGCG ACCGCTTCAA CGTCACCTAC
ACCGTGACGG GCAAGTCGAA CTGGACCGTG ACCATGCACC CGAACTCCGG TCAGTCGATC
CAGAGCTCCT GGAACGCCAC CCGGTCGGGC AACACGTTCA CGCCCGCCGG TAGCAACTCG
TTCGGCGTCA CCTGGTACAA GGGCTCGTCG AACAACGGCT GGATCCCGTG GGGCATCTGC
TCCTGA
 
Protein sequence
MRNTRSSVPR RTRGLRAAIS SVATGALLAT SVLLAAPASA QTITSNGTGT HSGWWYSFWT 
DSPGTVSATM KSGGSYSTSW RNTGNFVIGK GWNRGDATKV VNYSGSFNPS GNAYLTLYGW
TNGPLIEYYI VDNWGTYRPT GTFKGTVTSD GGTYDIYETT RVNAPSIEGD RSTFKQFWSV
RQQKRNGGGT ITAANHFNAW SRAGMQLGTH NYMIFATEGY QSSGSSDLTV TSGGGNTGGN
TGGNTGGGTS NGCTVTATRG ESWSDRFNVT YTVSGASNWT VTVNPGSGQS IQNSWNATRS
GNTFTSSGSN SFGVTYYSGG NTSIPSASCN STGGSTGGDT GGNTQNCSAG YVALTFDDGP
NTGTTNSLIN ALKSAGATAT VFPTGQNVAA NPSLAKAYAD AGFQIGNHSW DHPYLTQQSQ
SNQQSQLSRT QDAIRSATGQ TPTIFRPPYG DTNSQLQSVA SGLGLRTVTW DVDSTDYNNA
SVQTIRNAAA RLTSGQIILM HDWPANTIQA IPGIVQDLRS RNLCTGHISS STGRAVAPAG
GGTQPTQQPT TQPTTQPTQQ PTTQPTAQPT NAPAGTTLQA AAARTGRYFG AAAANFYLTN
SGISPILNRE FNMITAENEM KVDAMQPNQG QFNWNSGNTI VNWALQNNKR VRGHALAWHS
QQPGWMQNQS GTTLRNSMLN HITQVAGYYK GKIYAWDVVN EAFADGSSGA RRDSNLQRTG
NDWIEAAFRA ARAADPQAKL CYNDYNTDNW SHAKTQGVYN MVRDFKARGV PIDCVGFQAH
FNSGNPVPSN YDVTLRNFAD LGVDVQITEL DIEGSGTSQA EQFRGVTQAC LSVPRCTGIT
VWGVRDSESW RSYGTPLLFD GSGNKKAAYN YVLDALNAGS GQSTGGNTGG NTGGGDTGGN
TGGGDGGATG GYDSGSYVNN GCTVSWTRDN DWSDRFNVTY TVTGKSNWTV TMHPNSGQSI
QSSWNATRSG NTFTPAGSNS FGVTWYKGSS NNGWIPWGIC S