Gene Cfla_2443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2443 
Symbol 
ID9146346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2735655 
End bp2739074 
Gene Length3420 bp 
Protein Length1139 aa 
Translation table11 
GC content77% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003637530 
Protein GI296130280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000107433 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGACA CCGCCCACGC GGACGCGGTC GTGGCGCTCG TGCGGCGCTG GCTCGCGGTC 
GCGGCCGACC AGCCGGTGGA CCCTGCGGCC CGGCACCTGG CGGCGCTGCT GCGCGAACCG
GGTGGGCTGG CGTTCGCGGT CGGGTTCGTC GACGGGGTCG TGCGACCCGA GGACGTGGGC
GTCGCGGCGC ACCGCCTGCG CGAGCTGTCG GCCCACCCAC CCACGTTCCT GCCGCCGGTG
CTCCGCGCGG CGGTGCGGGT CGGCGGCGCG ATCGCCCCGG CGCTGCCCGG CGTGGTCGTC
CCGGTCGCCC GGCGCGTGCT GCGGCAGATG GTCGGGCACC TCGTCGTCGA CGCCACCGAC
CGCCACCTGG GCGCGGCGAT CGCACGCCGG CGCCGCGACG GCACGCGCCT CAACGTGAAC
CTGCTCGGCG AGGCCGTGCT GGGCGCGCGG GAGGCCGAGC GTCGGCTCGC CGGCACCCGC
CGGCTCCTGG CACGGGACGA CGTCGACTAC GTGTCGGTCA AGGTGTCGTC GGTCGTGGCA
CCGCACGCAC CGTGGGCGTT CGAGGAGTCG GTCGAGGACG TCGTCGCGCG GCTCGTGCCG
CTGTACGAGC AGGCCGCCAC GTCCCCCACC CCGAAGTTCG TCAACCTCGA CATGGAGGAG
TACCGCGACC TCGACCTCAC GGTCGAGGTG TTCACGCGCC TGCTCGACCG CCCGGGCCTG
ACGGGCCTCG AGGCCGGGAT CGTGCTGCAG GCGTACCTGC CGGACGCGCT GCCCGCGATG
CAGCGCCTGC AGGAGTGGGC GGCGCGACGA CGCGCGGCCG GCGGCGCCGG GATCAAGGTG
CGGCTCGTCA AGGGCGCGAA CCTGCCGATG GAGCGCGTCG AGGCCGAGCT CCACGGCCGG
CCCCTGGCCA CGTGGGGCTC CAAGCGCGAG ACGGACGCGC ACTACAAGCG GGTCCTCGAC
TGGGCGCTGC ACCCCGAGCG CGTCGCGAAC GTGCGGCTCG GGGTGGCCGG GCACAACCTG
TTCGACGTCG CGCACGCGTG GCTGCTCGCA GGTGAGCGCG GCGTCCGGGA CGCCGTCGAG
ATCGAGATGC TGCTGGGCAT GGCGCCCGGG CAGGCGGAGG CCGTGCGTCG CGACGTGGGC
AGCCTGCTGC TGTACACACC GGTGGTCGCG CCGCAGGAGT TCGACGTCGC GATCGCCTAC
CTCGTGCGGC GGCTGGAGGA GGGCGCGTCG GACGACAACT TCATGTCCGC GGTGTTCGAC
CTCGCCGACG ACGAGGCACT GTTCGTCCGC GAGCGCGAGC GCTTCCTCGC CTCGCTCGCG
GACGTCGACG CGCCCGCACC CACGCGCCAC CGCGTCGCGG ACCGGCACGC CGCGGTGCCG
CCGTCGGCGC CCGGCGCCTT CACGAACACC CCCGACTCCG ACCCGGCGGT CCCCGAGCAC
CGCGCGACCG TCCGCGAGGT GCTGGTCCGG GTCCCGACGT CCGACCTCGG CACCGCCGGG
ATCGCCGCGG CTCGCATCGA GGACGCCACC ACGCTCGACG AGGTCCTGCA GGACGCCCGT
CGCGCCGGAG CGGCCTGGGG CGCACGCCCC GCCGCCGAGC GCGCCGCGGT CCTCGACCGC
GCGGCCGACG TGCTCGAGTC GCGCCGCGCC GACCTGCTGG AGGTCATGGC GTCCGAGGCC
GGCAAGACCG TCGACCAGGG CGACCCGGAG GTGTCCGAGG CCGTCGACTT CGCGCACTGG
TACGCCGAGC TCGCCCGCGG CCTGGACCAC GTCGACGGCG CACGCTTCGT CCCCGACGCG
CTCACGCTCG TCACTCCCCC GTGGAACTTT CCCGTGGCGA TCCCGGCGGG CTCGACGCTC
GCGGCGCTGG CGGCCGGCTC GGCCGTCGTC CTCAAGCCCG CCGGCCCGGC CGAGCGCTGC
GGCGCGGTGC TCGCCGACGC GCTGTGGGAG GCGGGTGTCC CCCGGGACGT GCTGCGCCTC
GTGCAGGTCG ACGAGGGCAC GCTCGGGCGG GACCTCGTCG CGCACCGGGC GGTGGACCGC
GTCGTCCTGA CGGGCGCGTA CGAGACGGCC GAGCTGTTCC GGCGCTTCCG GCCCGACCTG
CCGCTGCTCG CCGAGACCAG CGGGAAGAAC GCGATCGTCG TGACGCCGAG CGCCGACCTC
GACCTCGCGG TGCGGGACGT CGTCGCCTCG GCGTTCGGGC ACGCGGGGCA GAAGTGCTCG
GCCGCGTCGC TCGTGGTGCT CGTCGGGTCG GTCGCGACGT CGCGGCGGTT CCGCTCGCAG
CTGCTCGACG CGGTGTCGTC GCTGGTCGTG GGCCTGCCGC AGGTCGCCCG TGCGCAGGTC
GGGCCGCTCA TCGAGCCCGC GTCGGGCAAG CTGCTCACCG GGTTGACCGA GCTCGAGCCG
GGGCAGCGCT GGGCGCTGGC GCCGCGCCGC CTCGACGACG CGGGCCGCCT GTGGACGCCC
GGCGTCGTGA CGGGTGTGCG GCGCGGGTCG CGCACGCACC GCACCGAGTA CTTCGGGCCC
GTGCTGGGCG TGATGACGGC CGCGACGCTC GACGAGGCGA TCGACCTGGT GAACGACGTG
GACTACGGCC TGACGTCGGG GCTGCACAGC CTCGACGCCG ACGAGGTCGG GGTCTGGCTC
GACCGCGTCG AGGCCGGCAA CCTCTACGTC AACCGCGGCA CCACCGGCGC GGTCGTGCGG
CGCCAGCCCT TCGGCGGCTG GAAGCGGTCG GCGGTCGGGC CGGGTGCGAA GGCGGGCGGC
CCGAGCTACC TCCTCGGGCT CGGGTCGTGG ACGTCGGCGC CCGCGACGAC CGGCGCGAGC
GTCACCGCAC CCGCCGCGTC CGCGCTCGTC GCGGCCGCCC GCGCGGACCT GCCTCCCGCG
GACGCGGACC GCGTCGAGCG CGGTGCGCGC AGCGACGCGG CGGCGTGGCG GGACGTGTTC
GCGGCGCGGG ACGTCAGCGG GCTGGCGTGC GAGCGCAACG TGCTGCGGCA CGTCCCTGCG
GCGGACCCGG TGCTGGTCCG GCAGGCGGAC GACGCGCCGG TGGCCGACCT GCTGCGTGTC
GTCGCGGCGG CGGCGTGCGC GCGGGCACGC GTCGTGGTGT CGGTGCCCGC CGCGCTCCCG
GACCGCTGTG CCCGCGCCGT CGCGGCGCTC GGGCCCGTGC ACGTCGAGGA CGGCGCGGCC
TGGGCCGCGC GGGTCGGTGC CCTCGACGGG GGTCGGGTGC GCCTCGTGGG CGGCTCGACC
GCGACCGTCG TCGCGGCCAC CGGTGGACGC CCCGACGTCG CCGTGTGGGA CCACCCCGTC
ACCGAGGCGG GCCGGGTGGA GCTGCTGCCG TTCCTGCGCG AGCAGGCCGT GAGCGTGACG
GCCCACCGGT TCGGCACGCC GCACCCGCTC ACGGAGGCCG CGCTGCCGCT GGGCCGGTAG
 
Protein sequence
MTDTAHADAV VALVRRWLAV AADQPVDPAA RHLAALLREP GGLAFAVGFV DGVVRPEDVG 
VAAHRLRELS AHPPTFLPPV LRAAVRVGGA IAPALPGVVV PVARRVLRQM VGHLVVDATD
RHLGAAIARR RRDGTRLNVN LLGEAVLGAR EAERRLAGTR RLLARDDVDY VSVKVSSVVA
PHAPWAFEES VEDVVARLVP LYEQAATSPT PKFVNLDMEE YRDLDLTVEV FTRLLDRPGL
TGLEAGIVLQ AYLPDALPAM QRLQEWAARR RAAGGAGIKV RLVKGANLPM ERVEAELHGR
PLATWGSKRE TDAHYKRVLD WALHPERVAN VRLGVAGHNL FDVAHAWLLA GERGVRDAVE
IEMLLGMAPG QAEAVRRDVG SLLLYTPVVA PQEFDVAIAY LVRRLEEGAS DDNFMSAVFD
LADDEALFVR ERERFLASLA DVDAPAPTRH RVADRHAAVP PSAPGAFTNT PDSDPAVPEH
RATVREVLVR VPTSDLGTAG IAAARIEDAT TLDEVLQDAR RAGAAWGARP AAERAAVLDR
AADVLESRRA DLLEVMASEA GKTVDQGDPE VSEAVDFAHW YAELARGLDH VDGARFVPDA
LTLVTPPWNF PVAIPAGSTL AALAAGSAVV LKPAGPAERC GAVLADALWE AGVPRDVLRL
VQVDEGTLGR DLVAHRAVDR VVLTGAYETA ELFRRFRPDL PLLAETSGKN AIVVTPSADL
DLAVRDVVAS AFGHAGQKCS AASLVVLVGS VATSRRFRSQ LLDAVSSLVV GLPQVARAQV
GPLIEPASGK LLTGLTELEP GQRWALAPRR LDDAGRLWTP GVVTGVRRGS RTHRTEYFGP
VLGVMTAATL DEAIDLVNDV DYGLTSGLHS LDADEVGVWL DRVEAGNLYV NRGTTGAVVR
RQPFGGWKRS AVGPGAKAGG PSYLLGLGSW TSAPATTGAS VTAPAASALV AAARADLPPA
DADRVERGAR SDAAAWRDVF AARDVSGLAC ERNVLRHVPA ADPVLVRQAD DAPVADLLRV
VAAAACARAR VVVSVPAALP DRCARAVAAL GPVHVEDGAA WAARVGALDG GRVRLVGGST
ATVVAATGGR PDVAVWDHPV TEAGRVELLP FLREQAVSVT AHRFGTPHPL TEAALPLGR