Gene Cfla_3558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3558 
Symbol 
ID9147474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3946852 
End bp3949314 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content68% 
IMG OID 
Productglycoside hydrolase family 10 
Protein accessionYP_003638629 
Protein GI296131379 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCC CATCCATGCT CCGAGCACAG CGCCGGGCAG CGCGAGGCGC GTCGCTCGTC 
GCCGTGCTCG CCCTCTCGGT CACCTGTGCC GCCGCCATCC CGGCGCAGGC CGCCGGCTCG
ACGCTGCAGG AGGCAGCAGC CATCAGCGGC CGCTACTTCG GCACCGCGAT CGCCGCGGGC
CGTCTGAACG ACTCCACCTA CTCGTCCATC GCGAACCGTG AGTTCAACAT GATCACGGCC
GAGAACGAGA TGAAGATGGA CGCCACGGAG CCGAACCAGA ACCAGTTCAA CTTCTCCCAG
GGCGACCGCA TCTACAACTG GGCCGTGCAG AACGGCAAGC GGGTTCGTGG GCACGCTCTC
GCGTGGCACT CCCAGCAGCC GGGCTGGATG CAGAACATGG GCGGCACGCA GCTGCGCAAC
GCCATGCTGA ACCACGTCAC GAAGGTCGCG GAGTACTACA AGGGCAAGAT CTACGCCTGG
GACGTCGTGA ACGAGGCGTT CGCCGACGGC AACGGCGGCG GTCGTCGCAA CTCGAACCTC
GAGCAGACCG GTTCGGACTG GATCGAGGCC GCCTTCCGCG CGGCTCGCTC GGCCGACCCG
AGCGCGAAGC TCTGCTACAA CGACTACAAC ATCGACAACT GGAACTGGGA CAAGACGCAG
GCCGTGTACC GCATGGTCCG CGACTTCAAG TCCCGTGGTG TCCCGATCGA CTGCGTCGGC
CTCCAGTCCC ACTTCAACTC CGGTAGCGCG TACAACAGCA ACTACCGCAC CACGATCTCC
AGCTTCGCCG CCCTGGGCGT CGAGGTGCAG ATCACGGAGC TGGACATCGA GGGCTCGGGC
TCGCAGCAGG CACAGACGTA CGCGAACGTC GTCAACGACT GCCTCGCCGT GCCGCGCTGC
ACCGGCATCA CCGTGTGGGG CGTGCGCGAC ACCGACTCCT GGCGCGCCTC GGGCACGCCG
CTGCTGTTCG ACGGCTCGGG CAACAAGAAG CAGGCGTACA CCTCCACGCT CAACGCGCTG
AACGCCGCGA CGCCGGTCCA GACGACGCCG CCGACGCAGA CGACGCCGCC CACCCAGACG
ACGCCGCCGA CGCAGACGAC GCCGCCGACC CAGACGACGC CGCCCACCCA GACGACGCCG
CCCACCCAGA CCACCCCGCC GACCCAGACG ACGCCGCCGT CCCAGCCCGG CGCGACGCTG
CAGGCCGCGG CGGCCCGCAC GGGCCGGTAC TTCGGTGTGG CCCTGGCGGC GGGCAAGCTC
AACGACTCGA CCTACACGAC CATCGCGAAC CGTGAGTTCA ACATGGTGAC GGCCGAGAAC
GAGATGAAGA TGGACGCCAC GGAGCCGAAC CAGAACCAGT TCAACTTCTC CCAGGGCGAC
CGGATCCTGA ACTGGGCGAC GCAGAACGGC AAGCAGGTGC GTGGGCACGC GCTGGCGTGG
CACTCCCAGC AGCCGGGCTG GATGCAGAAC ATGTCCGGCA CGCAGCTGCG CAACGCGATG
CTCAACCACG TCACGCGGGT CGCGACGTAC TACAAGGGCA AGATCCACAG CTGGGACGTG
GTGAACGAGG CGTTCGCCGA CGGCAACGGC GGTGCGCGTC GTGACTCGAA CCTGCAGCGC
ACCGGTGACG ACTGGATCGA GGCCGCGTTC CGCGCCGCGC GCGCCGCGGA CCCGGGCGCC
AAGCTCTGCT ACAACGACTA CAACACCGAC AACTGGACGT GGGACAAGAC CCAGGCCGTC
TACCGCATGG TCCGTGACTT CAAGTCGCGC GGCGTGCCGA TCGACTGCGT GGGTTTCCAG
TCGCACTTCA ACGCCCAGTC GGCGTACAAC AGCAACTACC GCACGACGCT GTCGAGCTTC
GCCGCCCTGG GTGTCGAGGT GCAGATCACC GAGCTGGACA TCGAGGGCTC GGGTCAGCAG
CAGGCGCAGA CGTACGCGAA CGTCGTGAAC GACTGCCTCG CCGTGCCGGC CTGCAAGGGC
ATCACGGTCT GGGGCGTGCG TGACTCCGAC TCGTGGCGCT CGTACGGCAC CCCGCTGCTG
TTCGACAACT CGGGCAACAA GAAGGAGGCC TACACGGCCA CCCTCAACGC CCTGAACGCG
GGCCAGACCC CGACCGAGAC GACGCCGCCG ACCGAGACGA CGCCGCCCAC GCAGACCACC
CCGCCGACGG AGACCACCCC GCCCCCGGGC AACGGGGCCT GCTCCGCGGC GCTGACGGTC
GCCAACTCGT GGCCCGGTGG CTACCAGGGC ACCGTCACGG TCACGGCCGG CTCGTCCGCG
ATCAACGGCT GGCGCGTCAC GCTCGGCAGC GTGTCGACCA ACAACGTGTG GAACGGCACC
CTGTCCGGCG GTGTCGTCTC GAACGCCCCG TACAACGGGT CGCTCGGGGC GGGCCAGTCG
ACGACCTTCG GGTTCGTCGG CTCGGGCAGC GCACCGGCGA GCACCTCGCT GGCCTGCGCC
TGA
 
Protein sequence
MTPPSMLRAQ RRAARGASLV AVLALSVTCA AAIPAQAAGS TLQEAAAISG RYFGTAIAAG 
RLNDSTYSSI ANREFNMITA ENEMKMDATE PNQNQFNFSQ GDRIYNWAVQ NGKRVRGHAL
AWHSQQPGWM QNMGGTQLRN AMLNHVTKVA EYYKGKIYAW DVVNEAFADG NGGGRRNSNL
EQTGSDWIEA AFRAARSADP SAKLCYNDYN IDNWNWDKTQ AVYRMVRDFK SRGVPIDCVG
LQSHFNSGSA YNSNYRTTIS SFAALGVEVQ ITELDIEGSG SQQAQTYANV VNDCLAVPRC
TGITVWGVRD TDSWRASGTP LLFDGSGNKK QAYTSTLNAL NAATPVQTTP PTQTTPPTQT
TPPTQTTPPT QTTPPTQTTP PTQTTPPTQT TPPSQPGATL QAAAARTGRY FGVALAAGKL
NDSTYTTIAN REFNMVTAEN EMKMDATEPN QNQFNFSQGD RILNWATQNG KQVRGHALAW
HSQQPGWMQN MSGTQLRNAM LNHVTRVATY YKGKIHSWDV VNEAFADGNG GARRDSNLQR
TGDDWIEAAF RAARAADPGA KLCYNDYNTD NWTWDKTQAV YRMVRDFKSR GVPIDCVGFQ
SHFNAQSAYN SNYRTTLSSF AALGVEVQIT ELDIEGSGQQ QAQTYANVVN DCLAVPACKG
ITVWGVRDSD SWRSYGTPLL FDNSGNKKEA YTATLNALNA GQTPTETTPP TETTPPTQTT
PPTETTPPPG NGACSAALTV ANSWPGGYQG TVTVTAGSSA INGWRVTLGS VSTNNVWNGT
LSGGVVSNAP YNGSLGAGQS TTFGFVGSGS APASTSLACA