Gene Cfla_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3049 
Symbol 
ID9146961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3392862 
End bp3395291 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content75% 
IMG OID 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003638131 
Protein GI296130881 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.340429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.82066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAGA CCCTCGGCAC CGCCGCCCAG CCCGCGGCCC CGTCGGACGC CCTGCCGCCC 
GTCCTGCCCG TCGTCTCCCA GCGGGTCCGT GACCTGCACG CGCGCATGAC GCTCGAGGAG
AAGCTGGCCC AGCTGGTCGG CTACTGGTTG GACCAGAACG GCACCGTCGC CCCGATGCAG
TCGGAGATGG CCGCCGGCCA GAAGGGCTCC GACGAGCTCG CGGAGATCAC GCGGCACGGC
CTGGGCCACT ACACGCGCGT CTACGGCACG CGCCCGGTCG ACCCGGCCGA GCGCGCCGCG
TGGCTGTGGG CCGAGCAGCG CCGGCTCAAG CGCGAGACGC GCCTGGGCAT CCCGGCGCTG
GTGCACGAGG AGTGCCTCAC GGGCCTCGCC GCCTGGAAGG CGGCGAGCTA CCCGACGCCG
CTCGCGTGGG GCGCGTCGTT CGACCCCGAG CTCGTCCACG CCGCCGCGCG CGCGATCGGC
GACTCGATGC GCGAGCTCGG CATCCACCAG GGCCTCGCGC CCGTCCTCGA CGTCGTCGGC
GACCCCCGCT GGGGCCGCGT CGACGAGTGC ATCGGGGAGG ACCCGTACCT CGTCGGGACC
GTCGGCACGG CGTACGTGCG CGGTCTGCAG GAGGCCGGCG TCCACGCGAC GCTCAAGCAC
TTCGTCGGGT ACTCCGCGTC CGCCGCGGGC CGTAACCACG CGCCCGTGCA CGCGGGCCCG
CGCGAGCTCG CCGAGATCTA CCTCCCGCCG TTCGAGATGG CCGTGCGCGA CGGCGGCGTC
CGCTCGGTGA TGAACTCCTA CGCGGACGTC GACGGCGTGC CCGTCGCGGC CGACCCGCAC
TACCTCACCG AGGTGCTGCG CGAGCAGTGG GGCTTCGACG GCGTCGTCGT CGCCGACTAC
TTCGCGGTCG CGTTCCTGCA GGTCATGCAC CAGGTCGCGG CCGACCGCGG CGAGGCGGCA
GCGCTCGCGC TGGCCGCGGG CCTGGACATC GAGCTGCCGA CGGGCGACGC GTTCCTCGCG
CCGCTGGCCG AGCGCGTGCG CGCCGGGCTG ACCGACGAGG CCCTCGTCGA CCGGGCCGTG
CTGCGCGCGC TGGCGCAGAA GGAGGAGCTC GGGCTGCTCG ACGCCGACGC GTTCGAGGAC
GAGCCGCCCG CGCACGTCGA CCTGGACTCC CCGCGGCACC GCGAGCTCGC CCGCAAGCTG
GCCGAGGAGT CGGTCGTGCT GCTGTCGAAC GACGGCGTGC TGCCGCTGGC TCCGGGGCGG
CGCGTCGCCG TCGTCGGCCC CAACGCCGCG CGGCCCGAGG CGCTCATGGG CTGCTACTCG
TTCGCCAACC ACGTGCTGGC GCACCACCCG GGCCTGCCGC TGGGCTTCGA GATCCGCAGC
GTGCACGAGG CGCTCGCCGC GGCCGTGCCC GGCGTCACGT ACGTCGAGGG CTGCACGGTC
GAGGGCGACG ACACCGGCGG CTTCGACGCT GCCGTCGCGG CCGCGGCCGA CGCGGACGTC
GCGGTCGTGG TGGTCGGCGA CCAGGCCGGG CTGTTCGGCC GCGGCACGGT GGGCGAGGGC
AACGACGTGC AGTCGCTCGA GCTGCCGGGC GTGCAGCGGC AGCTCGTCGA GGCGCTGGTG
GCCACGGGGA CGCCCGTCGT CATGCTGCTG CTCACCGGCC GCCCGTACGC GATCGGCTGG
GCACTCGACG GGCAGGGCGC CAAGCCCGCC GCCGTGCTGC AGGCGTTCTT CCCCGGCGAG
GGCGGCGGCG ACGCGATCGC CGACCTGCTC ACGGGCGTCG CCAACCCGTC CGGGCGTCTG
CCCGTCTCGC TGCCGCGCGC CGCGGGCGCG CAGCCGTTCC GCTACCTGCA CCCCGTGCTC
GGGGGCCCGT CCGACGTCAC GTCGACCGAC CCGACGCCCG TGCGGCCCTT CGGGTTCGGG
CTGTCGTACA CGACCTTCGC GTACGACGAC CTCGCGGTCG ACGAGACGGT CGAGTCGGCC
GGCACGTTCA CGACGTCCGT CACGGTGACG AACACCGGTG ACGTCGACGG CGCCGAGGTC
GTGCAGCTCT ACGGGCGGGA CGTCGTCGCG TCCGTCGTGC GCCCCGTCGT GCAGCTCCTC
GGGTACGCGC GCGTCGAGCT CGCCGCGGGG CAGTCCCGCC GCGTGACGTT CCGCGTCCCC
ACCACGCGCC TCGCGCTGGC CGACCGCCGC CTCGTGCGCG TCGTCGAGCC CGGCGACGTG
CAGGTCTGGG TCGCCTCGCA CGCGGCCGTC GCCGCGCCGG ACGCCCCCAC GGACGCCACG
GGCGGCGCCA TCACCAGCAC GCGCGAGCAC GAGAAGCGCA CACTGCCGGG GCAGAGCACA
CCGCACGCCG TCCTGCGGGT CACGGGCGCC GTGCACGAGA TCACCGCCGA GGACCGGCGG
ATCGTCGACG TGGAGGTCAA CGACGCATGA
 
Protein sequence
MTETLGTAAQ PAAPSDALPP VLPVVSQRVR DLHARMTLEE KLAQLVGYWL DQNGTVAPMQ 
SEMAAGQKGS DELAEITRHG LGHYTRVYGT RPVDPAERAA WLWAEQRRLK RETRLGIPAL
VHEECLTGLA AWKAASYPTP LAWGASFDPE LVHAAARAIG DSMRELGIHQ GLAPVLDVVG
DPRWGRVDEC IGEDPYLVGT VGTAYVRGLQ EAGVHATLKH FVGYSASAAG RNHAPVHAGP
RELAEIYLPP FEMAVRDGGV RSVMNSYADV DGVPVAADPH YLTEVLREQW GFDGVVVADY
FAVAFLQVMH QVAADRGEAA ALALAAGLDI ELPTGDAFLA PLAERVRAGL TDEALVDRAV
LRALAQKEEL GLLDADAFED EPPAHVDLDS PRHRELARKL AEESVVLLSN DGVLPLAPGR
RVAVVGPNAA RPEALMGCYS FANHVLAHHP GLPLGFEIRS VHEALAAAVP GVTYVEGCTV
EGDDTGGFDA AVAAAADADV AVVVVGDQAG LFGRGTVGEG NDVQSLELPG VQRQLVEALV
ATGTPVVMLL LTGRPYAIGW ALDGQGAKPA AVLQAFFPGE GGGDAIADLL TGVANPSGRL
PVSLPRAAGA QPFRYLHPVL GGPSDVTSTD PTPVRPFGFG LSYTTFAYDD LAVDETVESA
GTFTTSVTVT NTGDVDGAEV VQLYGRDVVA SVVRPVVQLL GYARVELAAG QSRRVTFRVP
TTRLALADRR LVRVVEPGDV QVWVASHAAV AAPDAPTDAT GGAITSTREH EKRTLPGQST
PHAVLRVTGA VHEITAEDRR IVDVEVNDA