Gene Cfla_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1934 
Symbol 
ID9145828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2149572 
End bp2151740 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content73% 
IMG OID 
Producttransketolase 
Protein accessionYP_003637028 
Protein GI296129778 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCGA AGGCTCCCCT GGCCACCACG GTCGGCTGGT CCGACCTGGA CCTGCGCGCG 
GTGGACACCA CCCGCGTGCT CGCGGCGGAC GCGGTCGAGA AGGTCGGCAA CGGCCACCCC
GGGACGGCGA TCAGCCTCGC CCCGGCGGCC TACCTGCTCT ACCAGAACGT GCTGCGCCAC
GACCCGACGG ACCCGCAGTG GCTCGGGCGC GACCGCTTCG TCCTGTCGGC GGGCCACTCC
AGCCTGACGC AGTACATCCA GCTCTACCTC GGCGGCTTCG GCCTGGAGCT CGAGGACCTG
CAGGCGCTGC GCACCTGGGG CTCCAAGACC CCGGGCCACC CGGAGTACCG CCACACGGCC
GGCGTCGAGA TCACGACCGG CCCGCTGGGC CAGGGCCTGG CCTCGGCCGT CGGCTTCGCG
ATGGCCGCAC GCCGCGAGCG CGGTCTGCTC GACCCCGAGG CGGCGCCCGG CGAGAGCCCC
TTCGACCACC ACGTGTACGT CATCGCCTCC GACGGCGACC TGCAGGAGGG CGTGACGAGC
GAGGCCTCGT CGATCGCCGG CACGCAGGAG CTCGGCAACC TCGTCGTGGT GTGGGACGAC
AACCACATCT CGATCGAGGG TGACACGTCG ATCGCGTTCA CCGAGGACGT GCTCGCGCGC
TACGAGTCCT ACGGCTGGCA CGTGCAGTCC GTCGACTGGA CCGCCGGCGG CGAGTACCGC
GAGGACGTCG ACGCGCTGCA CGCGGCGATC GAGGCCGCCA AGGCCGTCAC GGACAAGCCG
TCGTTCATCC GCCTGCGCAC GCTGATCGCG TGGCCCACAC CGGGCAAGAC CGACGACCAC
TCGTCGCACG GCTCGAAGCT GGGCGGCGAC GCGATCCGCG GCCTCAAGGA GATCCTCGGG
TTCGACCCCG AGCAGACCTT CGAGGTCGCC GACGAGGTCA TCGCCCACAC GCGCTCGCTC
GCCGGGCGCG CCGCGCAGGA CCGCGCCGGA TGGCAGCAGA AGTTCGACGC GTGGGCCGCG
GCGAACCCCG AGCGCAAGGC GCTGCTGGAC CGGCTCCGCG CCGACGAGCT CCCGGAGGGC
TGGGCCGAGG CGCTGCCGAC GTTCCCCGCC GGCAAGGCGG TCGCCACGCG TGCCGCGTCC
GGCGAGGTGC TCTCGGCGCT GGCCCCGGTG CTGCCCGAGC TGTGGGGTGG CTCCGCCGAC
CTCGCGGGCT CGAACAACAC GACCATGAAG GGCGAGCCGT CCTTCCTCCC CGCGCACCGC
TCGTCGCACG AGTTCGCGGG CGACCAGTAC GGTCGCACGC TGCACTTCGG CATCCGCGAG
CACGCGATGG GCTCGATCCT GTCGGGCATC CGGCTGCACG GCCTGACCCG CCCGTACGGC
GGCACGTTCT TCACGTTCTC CGACTACATG CGCGGCGCCG TGCGCCTCGC GGCGCTCATG
GGCGTCAACG TCACGTACGT GTGGACGCAC GACTCCATCG GCCTCGGCGA GGACGGCCCG
ACGCACCAGC CGGTCGAGCA CCTGACGGCG GTGCGCGCGA TCCCCGGCCT GGCAGTCGTG
CGCCCCGCCG ACGCCAACGA GACCGCGGCC GCCTGGAAGG CCTCGCTCGA GCGCACCGAC
GGTCCCGTCG CGCTCGTGCT GACCCGTCAG AACGTCCCGA CGTTCCCGCG CGGCGAGGAC
GGCTTCGCCA CGACCGACGG GGTCGCCAAG GGCGCGTACG TGCTGCTCGA GGCGTCGTCC
GGCACCCCCG ACGTCGTGCT GATCGCCACG GGCTCCGAGG TCCAGCTCGC CGTCGAGGCG
CGCGCGACCC TCGAGGCCGC GGGCGTCGCG ACGCGCGTCG TGTCGGCGCC CTGCCTCGAG
TGGTTCGCAG AGCAGGACGA GGAGTACCGC GAGTCGGTGC TGCCGTCGTC GGTCCGGGCC
CGGGTGTCGG TCGAGGCCGG CATCGCGCTG TCGTGGCACA AGATCGTCGG CGACGCGGGC
CGGACCGTCT CGATCGAGCA CTACGGTGCG TCGGCCGACT ACGAGCGCCT CTACCGGGAG
TTCGGGATCA CCGCCGAGGC CGTCGTGGCC GCGGCGCACG AGTCGGTCGC GGCGGCGCGT
GGGACGTCGG CGCCGTCCGA CCAGCCGGCC GCCCCGTCGC AGTCCGGCAC GGGCGACCTG
CCCGCCTGA
 
Protein sequence
MNAKAPLATT VGWSDLDLRA VDTTRVLAAD AVEKVGNGHP GTAISLAPAA YLLYQNVLRH 
DPTDPQWLGR DRFVLSAGHS SLTQYIQLYL GGFGLELEDL QALRTWGSKT PGHPEYRHTA
GVEITTGPLG QGLASAVGFA MAARRERGLL DPEAAPGESP FDHHVYVIAS DGDLQEGVTS
EASSIAGTQE LGNLVVVWDD NHISIEGDTS IAFTEDVLAR YESYGWHVQS VDWTAGGEYR
EDVDALHAAI EAAKAVTDKP SFIRLRTLIA WPTPGKTDDH SSHGSKLGGD AIRGLKEILG
FDPEQTFEVA DEVIAHTRSL AGRAAQDRAG WQQKFDAWAA ANPERKALLD RLRADELPEG
WAEALPTFPA GKAVATRAAS GEVLSALAPV LPELWGGSAD LAGSNNTTMK GEPSFLPAHR
SSHEFAGDQY GRTLHFGIRE HAMGSILSGI RLHGLTRPYG GTFFTFSDYM RGAVRLAALM
GVNVTYVWTH DSIGLGEDGP THQPVEHLTA VRAIPGLAVV RPADANETAA AWKASLERTD
GPVALVLTRQ NVPTFPRGED GFATTDGVAK GAYVLLEASS GTPDVVLIAT GSEVQLAVEA
RATLEAAGVA TRVVSAPCLE WFAEQDEEYR ESVLPSSVRA RVSVEAGIAL SWHKIVGDAG
RTVSIEHYGA SADYERLYRE FGITAEAVVA AAHESVAAAR GTSAPSDQPA APSQSGTGDL
PA