Gene Cfla_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1835 
Symbol 
ID9145728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2046702 
End bp2047823 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content74% 
IMG OID 
Product3-dehydroquinate synthase 
Protein accessionYP_003636931 
Protein GI296129681 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.233017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.221025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG CGTCCGTCGT CCGGGTCGCC GGCGAGCAGC CGTACGACGT CGTCATCGGG 
CGCCACCTCC TCGGGCACCT GACCGGCATG CTCGGCGAGG GCGTGCGCCG CGTGCTCGTC
GTACATCCGG CGGCGCTCGC GACGTCCGCC GAGACCGTCC GCGCGGACCT CGTGGCCGCC
GGCTACGAGG TGTACCTCGC GCAGGTCCCC GACGCCGAGG AGCAGAAGAC CGCGCAGGTC
GCCGCCTTCT GCTGGGGTGT GCTCGGCCAG GCCGACTTCA CGCGCTCCGA CGCCGTCGTC
GGCCTCGGCG GGGGAGCGAC CACCGACCTC GCCGGGTTCG TGGCCGCCAC CTGGCTGCGC
GGCGTGCGCG TCGTGCAGGT CCCCACCACC GTGCTGGCGA TGGTCGACGC GGCGGTCGGC
GGCAAGACCG GCATCAACAC CGCCGAGGGC AAGAACCTCG TCGGGGCGTT CCACCCGCCG
GCCGGCGTGC TGTGCGACCT CGCGGCCGTC GAGTCGATGG TGCCGAACGA CTTCGTCGCC
GGGCTCGCCG AGATCGTCAA GTGCGGGTTC ATCGACGACC CCCGCATCCT CGAGCTCGTC
GAGGAGCACA CCGCCCTGCT GCTGGACCCC GTCGCCGCCG CGTCGTCGCC CGTGCTCGCG
GAGCTCGTCG AGCGGGCCGT GCGCACCAAG GCGCGCGTCG TGGGGGAGGA CCTGCGCGAG
GCCGGCCTGC GCGAGATCCT CAACTACGGC CACACGTTCG GCCACGCGGT CGAGCACGTC
GAGCGGTACC GCTGGCGCCA CGGCGCCGCC GTGTCGGTCG GCATGGTGTT CGTCGCCGAG
CTCGCGCGCC TCGCCGGGCG CCTGGACGAC GCGGTCGTCG AACGCCACCG CAGCGTGCTC
ACCTCGCTGG GCCTGCCGAC GACGTACCGG GCAGACCGGT GGGAGCAGCT GCTCACCGCC
ATGCGCCGCG ACAAGAAGAC CCGGGGCGAC CTCCTGCGCT TCGTGGTCCT CGAGGACCTC
GCGAGGCCGG CGCGTCTGGA AGGCCCCGAC CCCACCCTCC TCGCCGCCGC CTACGCCGAG
ATCTCCGCGA CCCCGCAGCG GACGAGCGGC ATCCTGCTCT GA
 
Protein sequence
MSDASVVRVA GEQPYDVVIG RHLLGHLTGM LGEGVRRVLV VHPAALATSA ETVRADLVAA 
GYEVYLAQVP DAEEQKTAQV AAFCWGVLGQ ADFTRSDAVV GLGGGATTDL AGFVAATWLR
GVRVVQVPTT VLAMVDAAVG GKTGINTAEG KNLVGAFHPP AGVLCDLAAV ESMVPNDFVA
GLAEIVKCGF IDDPRILELV EEHTALLLDP VAAASSPVLA ELVERAVRTK ARVVGEDLRE
AGLREILNYG HTFGHAVEHV ERYRWRHGAA VSVGMVFVAE LARLAGRLDD AVVERHRSVL
TSLGLPTTYR ADRWEQLLTA MRRDKKTRGD LLRFVVLEDL ARPARLEGPD PTLLAAAYAE
ISATPQRTSG ILL