Gene Cfla_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0959 
Symbol 
ID9144834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1062473 
End bp1063894 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content73% 
IMG OID 
Productfumarate lyase 
Protein accessionYP_003636065 
Protein GI296128815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.05909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000463461 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGACA TGAACGCTCC CGACGGCGCG GGCGAGTACC GCATCGAGCA CGACACGATG 
GGCGAGGTGC GCGTCCCGGC GCACGCGCTC TACCGCGCGC AGACGCAGCG CGCGGTCGAG
AACTTCCCGA TCTCGGGCAG CACGCTCGAG CGCGGGCACG TCGAGGCGCT GGCGCGCGTG
AAGAAGGCCG CGGCGAAGGC CAACGCCGAG CTGGACGTCC TGCCGCAGGA CGTCGCCGAC
GCGATCGTCG CCGCCGCCGA CGAGGTCGCC TCGGGCGTGC ACGACGCGCA CTTCCCCGTC
GACGTCTACC AGACGGGCTC GGGCACGAGC TCGAACATGA ACACCAACGA GGTCCTGGCG
ACGCTCGCGA CGCGCCTGCT GGGGCGCGAC GTGCACCCGA ACGACCACGT CAACGCGTCG
CAGTCGTCGA ACGACGTGTT CCCGACGTCG GTGCACGTGG CGGCCACGGC CGGCGTCGTG
CGCGACCTCG TGCCCGCGCT CGAGCACCTC GCCGGCGCGC TGGAGGAGAA GGCCACCGCG
TGGGCCACCG TCGTGAAGTC CGGCCGCACG CACCTCATGG ACGCCACGCC CGTGACGCTC
GGCCAGGAGT TCGGCGGGTA CGCCGCGGCC GTGCGGTACG GCGTCGAGCG CCTGCAGGCC
GCGCTCCCCC GCGCCGCGGA GGTCCCCCTG GGCGGTACCG CCGTCGGCAC CGGCATCAAC
ACCCCGGCGG GCTTCCCGCA GCGGGTCATC GCGCTGCTGG TCGAGGACAC CGGCCTGCCG
CTGACCGAGG CACGGGACCA CTTCGAGGCG CAGAGCTCGC GTGACGGCCT CGTCGAGCTG
TCGGGTGCGC TGCGCACCAT CGCCGTGAGC CTGACGAAGA TCTGCAACGA CCTGCGCTGG
ATGGGCTCGG GCCCCAACAC CGGCCTCGGC GAGATCGCGC TGCCCGACCT GCAGCCCGGC
TCGTCGATCA TGCCCGGCAA GGTCAACCCG GTCGTCCCGG AGGCCGTGCT CATGGTGTGC
TCGCGCGTCG TCGGCAACGA TGCGACGGTC GCGTGGGCGG GCGCGAGCGG CTCGTTCGAG
CTCAACGTGC AGATCCCCGT CATCGCCTCG GCCGTGCTGG AGTCGATCCG GCTGCTGGCG
AACGCGTCGC GCGTGCTGGC CGACCGGACG GTCGCGGGCA CCACGCCCAA CGTCGAGCGC
GCCCGCGCGC TGGCCGAGTC CTCGCCGTCG ATCGTCACAC CGCTCAACCG CACGATCGGG
TACGAGAACG CCGCGAAGAT CGCGAAGCAC GCCGTGAAGC AGGGCGTGAC GATCCGCCAG
GCGACGCTCG ACCTCGGGTT CGTCGAGCGC GGCGAGCTGA CGCTCGAGCA GCTCGACACG
GCGCTCGACG TGCTCGCGAT GACGCGCCCG CCGCAGGCCT GA
 
Protein sequence
MTDMNAPDGA GEYRIEHDTM GEVRVPAHAL YRAQTQRAVE NFPISGSTLE RGHVEALARV 
KKAAAKANAE LDVLPQDVAD AIVAAADEVA SGVHDAHFPV DVYQTGSGTS SNMNTNEVLA
TLATRLLGRD VHPNDHVNAS QSSNDVFPTS VHVAATAGVV RDLVPALEHL AGALEEKATA
WATVVKSGRT HLMDATPVTL GQEFGGYAAA VRYGVERLQA ALPRAAEVPL GGTAVGTGIN
TPAGFPQRVI ALLVEDTGLP LTEARDHFEA QSSRDGLVEL SGALRTIAVS LTKICNDLRW
MGSGPNTGLG EIALPDLQPG SSIMPGKVNP VVPEAVLMVC SRVVGNDATV AWAGASGSFE
LNVQIPVIAS AVLESIRLLA NASRVLADRT VAGTTPNVER ARALAESSPS IVTPLNRTIG
YENAAKIAKH AVKQGVTIRQ ATLDLGFVER GELTLEQLDT ALDVLAMTRP PQA