Gene Cfla_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1049 
Symbol 
ID9144924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1161352 
End bp1164111 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content69% 
IMG OID 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_003636153 
Protein GI296128903 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGAGA CCACCAAGAA GAGCACCCGC GCGAAGGCCC CCGCCGGCGC GTCGGCCGAG 
GCCCCCGCGG CGGCCGCGCA GCCCGCCGGC GTCGCCCTCG GCGTGAGCGA GGCCGACGCG
GTCGCGCAGT CCGTCGACCT GCTCGTCGCG AACGCCACCA AGGCGCTCGC CGAGTTCGAG
TCCATGACGC AGGAGGACGT CGACCGCTTC GTCAAGAAGG GCGCCGTCGC GGCCCTCGAC
CAGCACGGTC AGCTGGCCAA GCTGGCCGTC GAGGAGACCG GGCGCGGTGT CTTCGAGGAC
AAGGCCGTGA AGAACATCTT CGCGTGCGAG CACGTCACGA ACTCGATGGC GAACCTGCGG
ACCGTCGGCG TCATCAACGT CGACGACCTC AACGGCATCA CCGAGATCGC GGAGCCCGTG
GGCGTGATCG CCGGCATCAC CCCCGTCACC AACCCCACGT CGACCGCGAT CTTCAAGGCG
CTCATCTCGC TGAAGACCCG CAACCCGATC ATCTTCGCGT TCCACCCGAA CGCCCAGCAG
TGCTCCGTGG CCGCCGCCCG CATCGTGCGC GACGCCGCCG TGGCCGCCGG CGCCCCCGAG
CACTGCATCC AGTGGGTCGA GGCCCCGTCG CTCGCCGCGA CCGGTGCCCT CATGAACCAC
CCGGGTGTCG CCACCATCCT TGCGACCGGC GGCAACGCGA TGGTCAAGGC CGCGTACTCC
TGCGGCAAGC CCGCCCTGGG CGTCGGCGCC GGCAACGTCC CGGCGTACGT CGAGAAGTCC
GCCAAGCTCG CCCGCGCGAT CAACGACATC GTGCTGAGCA AGGCGTTCGA CAACGGCGTG
ATCTGCGCCT CCGAGCAGGC CGCCATCCTC GACGACGAGA TCTACGACGC CGCGATGGCG
GAGTTCGCGA AGCTCCACGC GTACCGCGCC ACCCCGGCGG AGAAGGCGAA GCTCGAGCGC
TTCATCTTCG GTGTCGAGGC CGACGGCGAG AACTGCGCGG GAGCCAAGCT CAACCCGGCG
GTCGTCGGCA AGTCGCCGGT GTGGATCGCC GAGCAGGCGG GCTTCACGGT CCCCGCGGAC
ACCTCGATCA TCCTGGCCGA GGTGTCGGGC GTCGGCCCCG CCGAGCCCCT GACCCGCGAG
AAGCTGTGCC CGGTCCTGGC CGTGCTGCGG GCGTCGTCGA CCGAGGAGGG CATCGCGCTC
GCCGAGAAGA TGGTCGAGTT CGACGGTCTG GGCCACTCGG CGGCCATCCA CACGCTCGAC
GAGGCGCTCA CGGTCGAGTT CGGCCGGCGC GTCAAGGCGA TCCGCGTCAT CTGCAACGCG
CCCTCGTCGC TCGGCGGCAT CGGTGACATC TACAACGCGT TCATCCCGTC GCTCACGCTC
GGCTGCGGCT CCTACGGCCA CAACTCGGTG TCCAACAACG TGTCGGCCGT CAACCTCGTC
AACGTCAAGC GCGTGGGCCG GAGGAACAAC AACTTGCAGT GGTTCAAGGT CCCCGCCAAG
ACGTACTTCG AGCCGAACGC GATCCGCTAC CTCGCGGACA TGGCCGACGT CGAGCGCGTC
ACGATCGTCA CCGACGCGAC CATGACGACC CTCGGGTTCG TCGACAAGGT CCTCGACGTG
CTGCGCCGCC GCGGCAACAA CGTGGCCGTG CAGATCATCG ACCAGGTCGA GCCCGAGCCG
TCCGTGAAGA CCGTCCAGGC CGGCGCCGCG CAGATGCGCC ACTTCCGGCC CGACACGATC
ATCGCGCTCG GCGGTGGGTC GCCCATGGAC GCCGCGAAGG TCATGTGGCT GCTGTACGAG
CACCCGGAGA TCGTCTTCTC CGACCTCAAG CAGAAGTTCT TCGACGTCCG CAAGCGCGCG
TTCAAGTTCC CGGTGCTGGG CGACCTGGCC AAGCTCGTGT GCATCCCCAC CACGTCGGGC
ACGGGCGCCG AGGTCACGCC GTTCGCCGTC ATCAGCGACG TCGAGGCGGG GAAGAAGTAC
CCGCTCGCCG ACTACGCGCT GACGCCGACC GTCGCGATCA TCGACCCGGT CCTCACGCAC
AAGATGCCGC GGTCGCTGGC CGCCGACTCC GGGTTCGACG CCCTGACGCA CGCCACCGAG
GCGTACGTCG CGGTGTACGC GAACGACTTC ACCGACGGCA TGGCGCTGCA GGCGATCCGC
CTGATCTTCG ACAACCTCGC GCAGTCGGTG AACGGCGACC CGAGCGACCC GCTCACGCAG
GACGCGCGGG AGAAGATGCA CAACGCCGGG ACGATCGCCG GCATGGCGTT CGGCAACGCG
TTCCTCGGCA TCGTGCACGC CATGGCGCAC GTCGTCGGCT CGACGTACCA CCTGGTGCAC
GGCCGCACGA ACGCCACCCT GCTGCCGCAC GTGATCCGCT ACAACGGCAC CGTCCCGACC
AAGCTCACGA GCTGGCCGAA GTACGAGTCC TACGTGGCGC CCGAGCGCTT CCAGCAGATC
GCGGCGATGC TCGGCCTGCC GGCCTCGACG CCCGAGGAGG GCGTGGAGTC CTACGCGCTG
GCCGTCGAGG CGCTGCGCGC CAAGGTCGGC ATCCCGCAGT CGTTCCAGGC ACAGGGCGTC
GACGAGCAGG AGTTCATGAG CCGGCTCGAC GAGGTCGCCA TGGGCGCCTA CGAGGACCAG
TGCGCCCCGG CGAACCCGCG CATGCCGATG ATCGACGACA TGAAGGACAT CATGACCGCG
GCCTACTACG GCACGTCGCT GGAGGACGTG CGTGGTCGCC GCGAGCGGGC GGAGGGCTGA
 
Protein sequence
MSETTKKSTR AKAPAGASAE APAAAAQPAG VALGVSEADA VAQSVDLLVA NATKALAEFE 
SMTQEDVDRF VKKGAVAALD QHGQLAKLAV EETGRGVFED KAVKNIFACE HVTNSMANLR
TVGVINVDDL NGITEIAEPV GVIAGITPVT NPTSTAIFKA LISLKTRNPI IFAFHPNAQQ
CSVAAARIVR DAAVAAGAPE HCIQWVEAPS LAATGALMNH PGVATILATG GNAMVKAAYS
CGKPALGVGA GNVPAYVEKS AKLARAINDI VLSKAFDNGV ICASEQAAIL DDEIYDAAMA
EFAKLHAYRA TPAEKAKLER FIFGVEADGE NCAGAKLNPA VVGKSPVWIA EQAGFTVPAD
TSIILAEVSG VGPAEPLTRE KLCPVLAVLR ASSTEEGIAL AEKMVEFDGL GHSAAIHTLD
EALTVEFGRR VKAIRVICNA PSSLGGIGDI YNAFIPSLTL GCGSYGHNSV SNNVSAVNLV
NVKRVGRRNN NLQWFKVPAK TYFEPNAIRY LADMADVERV TIVTDATMTT LGFVDKVLDV
LRRRGNNVAV QIIDQVEPEP SVKTVQAGAA QMRHFRPDTI IALGGGSPMD AAKVMWLLYE
HPEIVFSDLK QKFFDVRKRA FKFPVLGDLA KLVCIPTTSG TGAEVTPFAV ISDVEAGKKY
PLADYALTPT VAIIDPVLTH KMPRSLAADS GFDALTHATE AYVAVYANDF TDGMALQAIR
LIFDNLAQSV NGDPSDPLTQ DAREKMHNAG TIAGMAFGNA FLGIVHAMAH VVGSTYHLVH
GRTNATLLPH VIRYNGTVPT KLTSWPKYES YVAPERFQQI AAMLGLPAST PEEGVESYAL
AVEALRAKVG IPQSFQAQGV DEQEFMSRLD EVAMGAYEDQ CAPANPRMPM IDDMKDIMTA
AYYGTSLEDV RGRRERAEG