Gene Cfla_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1898 
Symbol 
ID9145791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2113086 
End bp2114213 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content75% 
IMG OID 
ProductThreonine aldolase 
Protein accessionYP_003636994 
Protein GI296129744 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGCC CGGCGCTGTC CTGCACGACG AACGACGGCA GCAGGCCGGT CGTCGGCCAG 
ACTGGGGGAG TGAGCACGCC CGCCCCCTCC CGCCACTTCG CCTCGGACAA CTACGCCGGG
GTGCACCCCG AGGTCCTGCA GGCGGTCGCC GCCGCCAACG TCGGGCACGT GCCCGCCTAC
GGCGACGACC CGTGGACCGA GCGTCTGCAG GAGGTCGTGC GCGCGCAGCT CGGCGACACG
GCGGTCGCGT ACCCGGTGCT CAACGGCACG GGCGCCAACG TCGTCGCGCT CCAGGCGATG
CTGCCGCGCT GGGGCGCCGT CGTGTGCACC GAGGCCGCGC ACGTGCACAC GGACGAGAAC
GGTGCGCCCG AGCGAGTCGG GGGCCTGAAG CTGCTCACCG TGCCCGCGGC CGACGGGCGG
CTCACCCCCG AGCTCGTGGC CCGACAGGCG TGGGGCTTCG GTGACGTGCA CCGCGCCCAG
CCCGGGGTCG TGTCGATCAC GCAGGCCACC GAGCTCGGCA CCGTGTACAC GCCCGAGGCG
GTGCGTGCGC TGTGCGACCA GGCGCACGAG CTCGGGATGC GCGTGCACCT GGACGGTGCG
CGGCTCGCCA ACGCCGCCGC CCACCTCGGC CTGCCGCTGC GGGCGCTGAC CACCGACTGC
GGCGTCGACG TGCTCTCGCT CGGCGGCACC AAGAACGGGC TGCTGCTCGG TGAGGCGGTG
GTGGTGCTGG ACCCCGCCGC CGTGACGGGT GTGGAGTACC TGCGCAAGGC CGACATGCAG
CTCGCCTCGA AGCTGCGGTT CGTGTCCGCG CAGCTGGTCG CCCTCTACGA GGGCGACCTG
TGGCTGCGCT CGGCGCAGCG CGCCAACGCG GCGGCCGCCC GGCTGCGGGC GGGGATCGAC
GCCCTGGGGG TCCTCGAGGT CACGCAGCCC ACCGAGGCCA ACGCGGTGTT CGTCCGGCTT
CCCGGCCATG TCGCCGCAGC CCTGCGGCGG CGCTGGCGGT TCTACGACTG GGACGTCACG
GACGGCACCG TGCGTCTCAT GTGCGCGTTC GACACCACGG ACGAGGACGT CGACGACCTG
CTCGTGGCCC TCGCCACCGC GCTGCGCGAG GAGCCGGACG CCGGCTGA
 
Protein sequence
MPGPALSCTT NDGSRPVVGQ TGGVSTPAPS RHFASDNYAG VHPEVLQAVA AANVGHVPAY 
GDDPWTERLQ EVVRAQLGDT AVAYPVLNGT GANVVALQAM LPRWGAVVCT EAAHVHTDEN
GAPERVGGLK LLTVPAADGR LTPELVARQA WGFGDVHRAQ PGVVSITQAT ELGTVYTPEA
VRALCDQAHE LGMRVHLDGA RLANAAAHLG LPLRALTTDC GVDVLSLGGT KNGLLLGEAV
VVLDPAAVTG VEYLRKADMQ LASKLRFVSA QLVALYEGDL WLRSAQRANA AAARLRAGID
ALGVLEVTQP TEANAVFVRL PGHVAAALRR RWRFYDWDVT DGTVRLMCAF DTTDEDVDDL
LVALATALRE EPDAG