Gene Cfla_1741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1741 
Symbol 
ID9145630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1935988 
End bp1937865 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content75% 
IMG OID 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_003636837 
Protein GI296129587 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0932578 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGGG GGGACGCGAC GAAGGGCACG ACGGAGGGCA CGGCAGGCAG CGCACGGCGG 
CTCACGCCGC GCGTGTGGGC GCCGTACCCG CAGCGCGTGG AGCTCGTGCT GCCCGGCAGC
GACGAGCGCA CGGCGATGGT GCGTGACGAC GAGGGGTGGT GGACCGCGCC TGCCCCGCTG
GCGCACGGCA CGGACTACGG GTTCTCGCTC GACGGCGGGC CGCCACGTCC CGACCCGCGC
GCCGCGTGGC TGCCCCACGG CGTGCACGGG CCGGCGCGCA CCTTCGACCC CTCGCGCTTC
ACGTGGACGG ACGCGGGGTG GAGCGGCGTC GACGTCCGCG GCGCGGTGAC GTACGAGCTG
CACGTCGGCA CGTTCACGCC CGCCGGGACG CTGGCCGCGG CGGCGGAGCG CCTCGAGCAC
CTCGTGCGGC TCGGTGTCGA CGTGGTCGAG CTGATGCCGC TCGCCGCGTT CAACGGTCGG
CACGGCTGGG GCTACGACGG CGTCTCGCTC TACGCCGTGC ACGAGCCGTA CGGCGGTCCC
GAGGCGCTGC AGGCGTTCGT CGAGGCCGCG CACGCATACG GCCTGGCGGT GTGCCTCGAC
GTCGTGCACA ACCACCTCGG CCCGTCGGGC AACTACCTGG GCGAGCTCGG TCCGTACTTC
ACGGACGCGC ACCGCACGCC GTGGGGCGAC GCGGTGAACC TCGACGGACC GGGCTCGGAG
CACGTGCGGC GGTGGATCTG CGACTCGGTC CTGGGCTGGG CACGCGACTT CCACGTCGAC
GCGTTCCGGC TCGACGCCGT GCACGCGCTG CGCGACGACT CCCCGCGCCA CCTGCTCGCC
CAGCTCTCCG ACGAGGTCGC CGACCTGGCA GCCGAGCTGG GCCGGCCGAT CGGCCTGGTC
GCGGAGTCCG ACCTCAACGA CGTCGTCAGC CTGACCACCA CGCAGGACGG CGGCTGGGGC
ATGACGGGCC AGTGGGCCGA CGACGTGCAC CACGCCGTGC ACGCCCTGGT GTCGGGCGAG
CGGCACGGCT ACTACTGCGA CTTCGGCACG CCCGAGGTGC TGCGCACGGC CCTCACGCGG
GTGTTCGTGC ACGACGGGTC GATGTCGACC TTCCGCGGGG AGCCGTGGGG GGCGCCGGTA
CCGGACGACG TCGACGGGCA CCGGTTCGTC GTGTTCGGCG CCAACCACGA CCAGGTGGGC
AACCGTGCAC TGGGCGACCG GCCGGCCGCG CACGACGACG CGGGCGGCCT CGCCGTGCGG
GCCGCGCTCG TCCTGCTCTC GCCGTTCACG CCGCTGGTGT TCATGGGCGA GGAGTGGGGC
GCCCGCACGC CGTGGCGGTT CTTCACGGAC CATCCGGAGC CGGAGCTTGC CGCCGCGGTG
CGCGAGGGTC GGACGCGGGA GTTCGGCGGG CACGGCTGGA CCGACCTCTA CGGCGGCCCG
GTCGACGTGC CGGACCCGCA GGATCCCGGG ACGTTCGCGG CGAGCGTCCT GGACTGGGAC
GAGCCGGCAC GACCGGAGCA CGCGCGCCTG CTGGAGTGGC ACCGTGTGCT CGTCGCGCTG
CGGCGCGCGG TGCCGGACCT CGCGTCCGGG GACCGCCACC GCACGTCGCT CGACGTCCAC
GAGGTCACGC CGGACGTCGA CCCGCACGGC ACGCAGGAAC CGGGCGCCGG TGGGTGGCAC
GGCGTGCTCG TGCTGCACCG CGGTGACGCA CGTGTCGTGC TCAACCTCGC GCACCGGCCC
GTCGCGGTGC CCGTGCCGGT GGCGCGCCCC GTGCGGGTGG TCGCCGCGTG GGACGGGGGC
ACGGTGCACG CGCCCGGCGG GGCGGACGAG CCGCTGGTCG TCGACGTCCC CGCCCGCAGC
GTCGTGGTAC TGGCCTGA
 
Protein sequence
MTGGDATKGT TEGTAGSARR LTPRVWAPYP QRVELVLPGS DERTAMVRDD EGWWTAPAPL 
AHGTDYGFSL DGGPPRPDPR AAWLPHGVHG PARTFDPSRF TWTDAGWSGV DVRGAVTYEL
HVGTFTPAGT LAAAAERLEH LVRLGVDVVE LMPLAAFNGR HGWGYDGVSL YAVHEPYGGP
EALQAFVEAA HAYGLAVCLD VVHNHLGPSG NYLGELGPYF TDAHRTPWGD AVNLDGPGSE
HVRRWICDSV LGWARDFHVD AFRLDAVHAL RDDSPRHLLA QLSDEVADLA AELGRPIGLV
AESDLNDVVS LTTTQDGGWG MTGQWADDVH HAVHALVSGE RHGYYCDFGT PEVLRTALTR
VFVHDGSMST FRGEPWGAPV PDDVDGHRFV VFGANHDQVG NRALGDRPAA HDDAGGLAVR
AALVLLSPFT PLVFMGEEWG ARTPWRFFTD HPEPELAAAV REGRTREFGG HGWTDLYGGP
VDVPDPQDPG TFAASVLDWD EPARPEHARL LEWHRVLVAL RRAVPDLASG DRHRTSLDVH
EVTPDVDPHG TQEPGAGGWH GVLVLHRGDA RVVLNLAHRP VAVPVPVARP VRVVAAWDGG
TVHAPGGADE PLVVDVPARS VVVLA