Gene Cfla_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0074 
Symbol 
ID9143939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp94005 
End bp95672 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content71% 
IMG OID 
Productmalate synthase A 
Protein accessionYP_003635193 
Protein GI296127943 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.481582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCA TCGACACCTC GCCCGTGGCG GCGAACCTGA CCCCCGCCTT CTCCCACCCC 
CGCCTGACGA TCACCGGCCC GCGCGTCGAC GGCATCGACC ACGTGCTCAC GCCGGAGGCC
ATGGACTTCC TCACCGACCT GCACCAGCGC TTCTCGGGCC GTCGGCACGA GCTGCTGCTG
GCCCGCCAGC GCCGCCGGGA CCGCTTCGGC AACGGCGCCG ACCCGGACTT CCTGCCGCTG
ACCGCGCACA TCCGCGCGGA CCCGACGTGG CGCGTCGCCG GCCCCGGACC GGGCCTGGAG
GACCGCCGCG TCGAGATCAC CGGCCCCACC GACCGCAAGA TGACCGTCAA CGCCCTGAAC
TCCGGCGCCA AGGTGTGGCT CGCCGACCAC GAGGACGCCA TGAGCCCCAC GTGGTCCAAC
GCGATCTCCG GCCAGGTCAA CCTGTACGAC GCGATCCGCG GGCAGATCGA CTTCACGTCG
CCCGAGGGCA AGGAGTACAA GGTCGGTGAG ACGACGCCCA CGATCGTGTT CCGCCCACGC
GGCTGGCACC TCACCGAGAA GCACCTGCGC TTCACGGACC GCGCCGGGCA GTCCTGCGCG
GCGTCAGCGT CGCTCGTGGA CGCGGGCCTG TACCTGTTCC ACAACGCGCA GGCGCTGATC
GACGCCGGCC GCGGCCCGTA CCTCTACCTG CCCAAGATCG AGGGGCACCT GGAGGCGCGG
CTGTGGGACG ACGTCTTCCG GTTCACCGAG ACCTATCTCA ACCTGCGGCA CGGCACGATC
CGCGCGACGG TCCTCATCGA GACCATCACG GCCGCGTTCG AGATGGAGGA GATCCTCTAC
GAGCTGCGCG ACCACTGCGC GGGCCTCAAC GCCGGGCGCT GGGACTACAT CTTCAGCGTC
ATCAAGAACT TCCGCGCCCG CGGTCCCCGG TTCGTCATGC CGGACCGCGG CAAGGTGACC
ATGACGGTGC CGTTCATGAA GGCGTACACC GAGCTGCTCG TGCAGACGTG CCACAAGCGC
GGCGCGCAGG CGATCGGCGG CATGAGCGCG TTCATCCCCA ACCGCCGTGA GCCCGAGGTG
ACCGCGCGGG CGCTGGAGCA GGTGCGCGCG GACAAGGAGC GCGAGGCCGG CCAGGGCTAC
GACGGGACGT GGGTCGCGCA CCCCGACCTG GTGCCGGTGG CGCGCGAGGT GTTCGACGCG
GCGTTCGGCG ACCGCACCGA CCAGCGGCAC CGCCTGCGCG AGGACGTCCG CGTGACCGCC
GCCGACCTGC TGAACGTGCC GTCCGCGGGC GGCGGCGAGC CGGGTGCCGT CACCGACGCC
GGTCTGCGGC AGAACGTGTC CGTGGCCGTC CGGTACCTCG AGGCGTGGCT GCGCGGGTCG
GGCGCCGTGG CGATCGACAA CCTCATGGAG GACGCGGCGA CGGCCGAGAT CTCGCGCTCG
CAGGTGTGGC AGTGGGTGCA CCAGGGCACG GTGACGGCCG AGGGCACCCG CATCGACGCG
ACGACCGTCG AGAAGGTGCT GGCGGACGTG CTCGGCAGCC TCGAGCGACG CGAGGGCGAC
CGGTACGACG CCGCGGCGAC CCTGTTCCGC GAGGTGTCGC TGTCGGAGGA CTTCCCGACG
TTCCTCACGA TCCCCGCCTA CACCCAGCAC CTGGTGACGC CGGCCTGA
 
Protein sequence
MTIIDTSPVA ANLTPAFSHP RLTITGPRVD GIDHVLTPEA MDFLTDLHQR FSGRRHELLL 
ARQRRRDRFG NGADPDFLPL TAHIRADPTW RVAGPGPGLE DRRVEITGPT DRKMTVNALN
SGAKVWLADH EDAMSPTWSN AISGQVNLYD AIRGQIDFTS PEGKEYKVGE TTPTIVFRPR
GWHLTEKHLR FTDRAGQSCA ASASLVDAGL YLFHNAQALI DAGRGPYLYL PKIEGHLEAR
LWDDVFRFTE TYLNLRHGTI RATVLIETIT AAFEMEEILY ELRDHCAGLN AGRWDYIFSV
IKNFRARGPR FVMPDRGKVT MTVPFMKAYT ELLVQTCHKR GAQAIGGMSA FIPNRREPEV
TARALEQVRA DKEREAGQGY DGTWVAHPDL VPVAREVFDA AFGDRTDQRH RLREDVRVTA
ADLLNVPSAG GGEPGAVTDA GLRQNVSVAV RYLEAWLRGS GAVAIDNLME DAATAEISRS
QVWQWVHQGT VTAEGTRIDA TTVEKVLADV LGSLERREGD RYDAAATLFR EVSLSEDFPT
FLTIPAYTQH LVTPA