Gene Cfla_0165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0165 
Symbol 
ID9144031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp197910 
End bp200456 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content77% 
IMG OID 
Productprotein of unknown function DUF404 
Protein accessionYP_003635283 
Protein GI296128033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.089788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00349746 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGACC TGCTCTCGTC GCCGCGCCCC GTGGGCGACG GGACGGTCGT GCACCGCCCG 
GACTGGACGT GGCTGCCGCC CGGCACGGCC CCCACGGAGG CCGACCTCGC GCGCGCCGCC
CGCCAGGCCG AGCAGCTCCT CGCCGCGCAC GGGGTCACCT ACGGTGCCGA GGCGGTGGAC
GGCGACCACC CCTGGCGCCT GGACCCCGAG CCCGTCGTCG TCGACGAGCC CGAGTGGACG
CGCCTCGAGG CCGCGCTCAC GCAGCGTGCG GAGCTGCTCG ACGCGGTCCT GCACGACCTG
TACGGCCCGC GTCGCCTGCT CGACGACCGC ATCCTGCCGC CGACGACCGT GCTCGCCCAC
CCCGGGTTCC TGCGCGCCGT CGACGGGCTG CGGCTGCCCG GGGGGCGCGA GCTGGTCCTG
TCCGCCACCG ACCTCGTGCG CGACCGCGCC GGGGAGTGGT GCGTCGTCGC CGACCGCACC
CAGGCGCCGT CGGGCGCCGG GTACGCGATG GAGGACCGGC GCATCACCGC GCAGGTGCTC
GCGCCCGTCT ACCGGCAGGC GCCCATCGCG CGGCTCGGGC CGTTCTTCCA CGCGCTGCGC
AAGGCCCTGC GCGAGGTCGC GCCGCCCACC GCGGGCGACG AGCCGCGTGC GGTCCTGCTC
TCCCCCGGCC CCGCGAGCGA GACGGCGTTC GACCAGGCGT ACCTCGCCTC GATGCTCGGT
CTCCCGCTCG TCGAGGGCAG CGACCTCGTG GTGCGGGCGG GCCGCGTGTA CCTGCAGGGC
ATCGACGGGC TCGAGCACGT CGACGTCGTG CTGCGGCGCG TCGACGGCGA CTGGTGCGAC
CCGCTCGACC TGCGCGCCGG CTCCCGCCTC GGCGTGCCCG GGCTCGTGCA CGCCGTGCGG
CAGGGCACCG TGAGCGTCGT CAACCCGCTC GGCACGTCCG TGCTGGACAA CCCCGCGCTG
CTCGCGTACC TGCCGCGGCT CGCGCGCGCC GTGCTCGACC AGGACCTCAC GCTGGCGTCC
GCGCCCACGT GGTGGTGCGG CGAGGAGCGC GCGCTGCGGC ACGTGCTCGG GCGCCTCGAC
CGGCTCGTGC TCAAGCCGGT CGTGCACGGT GCGGAGAGCA CGACCGTGGT CGGCGAGCGG
CTGAGCGCGG CCGAGCGCGA GGACCTCGCG GCGCGCATCA CCGCCGAGCC GTGGCGGTGG
GTCGGCCAGG AGCGCGTCGG ACCGGAGGAG CCCGGGTCCC GCGCCGCGGT CCTGCGGACC
TTCGCGGTCG CGCACGCCGG GTCGTACACC GTGATGTCCG GGGGGCTTGC GCGCGTCGCC
GACGACCCCG TGGTCACGTC GTCCGCCCCG GGCGCGGTGG CCAAGGACGT CTGGGTGCTC
ACCTCCCGGC CCGCCGCGAC GGGCGCGGTG CTGCGGGAGG ACGACGCCGC GGCCGGTGGC
CGCACGCTCG CCTACGGCAT CTCGCCGCGC GCCGCCGAGA ACCTCTACTG GATGGGCCGC
TACGCCGAGC GCGCCGAGGA CGGTGTGCGC GTGCTGCGGG CCGTCGCCGA CCGGTGGGAC
GACTACCACC GCACGCCCGG CACGGCGGGC GGCCAGGCCC TGGCGGTGCT CCTGCAGGCG
CTCACCCCCG CGGCCCTGCC CGACGGCGGC GAGGCCGCGG TCCCCGCGCC CGAGCACGTC
GGACCGCGCG TGCCGGCGCT GCGCGACCTG CTGCTCGACC GCCGCACCCC CGGCTCCGTC
GCACGTGCCG TGCACCGGCT GCGCACCTCG GCCGCGACCG TGCGCGACCA GCTCTCCACC
GACACGTTCG GCCCGATCGC GCGCATCGAG AGCACGCTGC GCGACGAGCG CGCGCGGCTG
CGGGCGCGCC GGCAGCCCGA CGCCGGTCTC GCGGCGCCCG CGTCCGTCAC CGCGGGTCTG
CGCCCCACGC TCGACGGCGT CCTGGAGAGC CTGCTCGCGA TCTCCGGCAT CGCCGCCGAG
GGCCTCACAC GGGACGTCGG GTGGCACCTG CTCGACGCGG GCCGCCGCAT CGAGCGCGCG
CAGCGGCTCG TGGCCATGCT CCGGGCGACC CTCGTCGAGC ACCGGCCCGC CGAGGTCGAG
GACCTGCTGC TCGAGTCCGT GCTGCTGGCC ACCGAGTCCG CGATCACCTA CCGCCGTCGG
CACCAGTCGC GCACCGACGT CGCGCGCGTC CTCGACCTCC TCGTCCACGA CCGCACCAAC
CCGCGCTCGC TGGCGTTCGC GCTCGACCGG CTGCTCGCGG ACCTCGAGGC GGTGCCCGCG
CCGCGGTCCG CCACGCAGCG CGACCACCTG CTGCACGGCG TCGCCGGGCT CGTCGCCGAG
CTCGACACCG TCGTCGTCGG CAACGAGGTG TCCGACGACG GGCGGCGCGT ACGGCTCGCC
GACGCGCTGG ACTCGATGCT GTGGCGGCTG CGCGAGGCGT CCGACGAGAT CGAGCGCGTG
CACTTCGTGC GGCCCGCGCC GAGCCGGGCG CTCGACGACG TGTGGGGCGC GGGCACGGAC
GACCGGACCG GGGAGGAGGA GCGGTGA
 
Protein sequence
MTDLLSSPRP VGDGTVVHRP DWTWLPPGTA PTEADLARAA RQAEQLLAAH GVTYGAEAVD 
GDHPWRLDPE PVVVDEPEWT RLEAALTQRA ELLDAVLHDL YGPRRLLDDR ILPPTTVLAH
PGFLRAVDGL RLPGGRELVL SATDLVRDRA GEWCVVADRT QAPSGAGYAM EDRRITAQVL
APVYRQAPIA RLGPFFHALR KALREVAPPT AGDEPRAVLL SPGPASETAF DQAYLASMLG
LPLVEGSDLV VRAGRVYLQG IDGLEHVDVV LRRVDGDWCD PLDLRAGSRL GVPGLVHAVR
QGTVSVVNPL GTSVLDNPAL LAYLPRLARA VLDQDLTLAS APTWWCGEER ALRHVLGRLD
RLVLKPVVHG AESTTVVGER LSAAEREDLA ARITAEPWRW VGQERVGPEE PGSRAAVLRT
FAVAHAGSYT VMSGGLARVA DDPVVTSSAP GAVAKDVWVL TSRPAATGAV LREDDAAAGG
RTLAYGISPR AAENLYWMGR YAERAEDGVR VLRAVADRWD DYHRTPGTAG GQALAVLLQA
LTPAALPDGG EAAVPAPEHV GPRVPALRDL LLDRRTPGSV ARAVHRLRTS AATVRDQLST
DTFGPIARIE STLRDERARL RARRQPDAGL AAPASVTAGL RPTLDGVLES LLAISGIAAE
GLTRDVGWHL LDAGRRIERA QRLVAMLRAT LVEHRPAEVE DLLLESVLLA TESAITYRRR
HQSRTDVARV LDLLVHDRTN PRSLAFALDR LLADLEAVPA PRSATQRDHL LHGVAGLVAE
LDTVVVGNEV SDDGRRVRLA DALDSMLWRL REASDEIERV HFVRPAPSRA LDDVWGAGTD
DRTGEEER