Gene Cfla_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3688 
Symbol 
ID9147604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp4072393 
End bp4075749 
Gene Length3357 bp 
Protein Length1118 aa 
Translation table11 
GC content71% 
IMG OID 
Productreplicative DNA helicase 
Protein accessionYP_003638755 
Protein GI296131505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATCG AGGACCTCGA GTACGGCGCG CCGCCGGACA GCGGGCACAG CAGCGGCGGC 
GGGTTCGACC GCACCCCGCC GCAGGACCTC GACGCCGAGC GCTCGGTGCT CGGCGGCATG
ATGATCAGCA AGGACGCCAT CGCCGACGTC ATCGAGCAGA TCAAGGGCAC CGACTTCTAC
CGCCCTGCGC ACGAGGCGAT CTACGACGCG ATCCTCGACC TGTACGGCCG CGGCGAGCCC
GCCGACGCGA TCACCGTCGC CGACGAGCTC ACCAAGCGCG GCGAGATGGG CCGCGTCGGT
GGCGCCGCCT ACCTGCACAC CCTCATCGCG TCCGTCCCCA CCGCCGCGAA CGCCGGGTTC
TACGCGCGGA TCGTGCGCGA GCGCTCCATC CTGCGCAAGC TCGTCGAGGC CGGCACGCGC
ATCGTCCAGC TCGGCTACGC CACCGACGGC GGCGACGTCG ACGAGCTCGT CAACAACGCG
CAGGCCGAGG TCTACGCCGT CACCGAGCGC CGCGCGTCCG AGGACTACCT GCCGCTGTCC
GAGGTCATCG GCGGGACGGT CGACGAGATC GAGGCCGCCG GCCACCGCGG CGAGGGCATG
ATCGGTGTGC CCACGGGGTT CTCGGACCTC GACCGGCTCA CCAACGGGCT GCACCCGGGG
CAGATGATCG TCATCGCCGC GCGGCCGGCC ATCGGGAAGG CCCTCGCGCT CGACACGCCG
CTGCCCACGC CCACCGGCTG GACGACGATG GGCGAGGTCC AGGTCGGCGA CCAGCTGATC
GCCGACGACG GGACGATTAC CCGCGTCGTC GCCGCGACCG ACGTCATGAC GGACCGGCCC
TGCTACCGCG TGACGTTCGA CGACGGGTCC ACGATCGTCG CCGACGCGCA GCACCAGTGG
GCCACGCGCA CGCGTGCCGA GCGGCGGGTC GGCGCCGACG CGTCGGTGCG CACCACGGAG
GAGCTCGCCG CGACGGTGCG GTGCGCGACG GCCGACGCGC GCGTCAACCA CTCGGTCGCG
ACCACCGCGC CGCTGTCCCT GCCGGAGCGC GAGCTGCTGG TGGACCCGTA CCTGCTAGGT
GTGTGGCTGG GCGACGGGCA GTCCGCCGCG GCGCGCTTCA CGAGCGCCGA CCCCGAGATC
GCGATGCGGA TCGAGGGGCG GGGGTACGAC GCGGATGTGC TGACGTCGTC GCTCGCGACG
CTCGGTCTCG GTGCCGAACT GCACATCCCC GCGGACTACC TGCGTGCCGG TGAGGCGCAG
CGGCGCGAGC TGCTCGCCGG TCTGCTCGAC ACGGACGGGA CGGTGAACCC CACGGGGTCG
GTGCAGTTCG CCGTGACGCA CGAGCGTCTG GCGCGGGACG TGCGGGAGCT CGTGCACTCG
CTGGGTTACC GCACCGGTTG GTCGGAGAGG AACGCCCGCG GCCGTTCCGC CGCGTCGCCG
ACCTGCTTCA CGATCACGTT CACGACGGAC GACGACGTGT TCGCGCTCGA GCGCAAGAAG
CTCGTCCACA AGGAGCGCCG CCGTCGCTCG ACGCCGCGGC TGCACCAGCG CTATGTGGTG
TCGGTCGAGC CGATCGAGTC GGTGCCGGTC CGGTGCGTGG AGATCTCGCA CCAGTCGCAC
CTGTACCTTG CCGGCGAGGC GATGATCCCG ACGCACAACT CCACTGTCGG CATCGACATC
GTCAGGTCCT CCGCCATCAA GCACAACATG GCGGCCGTGG TGTTCTCCCT CGAGATGAGC
CGCAACGAGA TCGTCATGCG CCTGCTCTCC GCCGAGGCGC GCGTGCACCT GCAGAAGCTG
CGCACCGGTG CGATGGGCGA GGACGACTGG GCCAAGGTCG CCGCCACGAT GGGCCGCATC
AGCGAGGCGC CGCTGTTCAT CGACGACTCG CCGAACATGT CGCTCATGGA GATCCGTGCC
AAGTGCCGGC GCCTCAAGCA GAGGCACGAC CTCAAGCTCG TCGTCATCGA CTACCTCCAG
CTCATGACGT CGGGCAAGCG GGTCGAGTCC CGCCAGCAGG AGGTCTCGGA GTTCTCGCGT
GCGCTCAAGC TGCTCGCCAA GGAGCTCGAG GTGCCGGTCA TCGCGATCTC GCAGCTGAAC
CGTGGGCCGG AGCAGCGCAC GGACAAGCGG CCGGCGATGA GCGACCTGCG TGAGTCGGGC
TGCCTGACTG AGGACACCCG CGTGCTCCGC GCTGACACCG GGGCTGAGAC GACGCTGGGG
GAGATGTACG CGCTCGGCCA CAAGGACGTG CCGATCTGGG CGCTCGACGA CCGCCTGCAG
TACGTGCGGC GGCACCTGAC GCACGTCTTC CCGACGGGCG TGAAGCCGGT GTACCGGTTG
CGGCTCGCGT CGGGCAAGGA GGTGACAGCG ACCGCCAACC ACCCGTTCCT GAGGTACGAG
GGGTGGACGC CGTTGGGGGA GCTGGAGGTC GGGTCGCGCG TCGGGGTGCC GCGGCACGTG
CCTGGGCCGG AGATGACGGC CGACTGGTCC GACCGCGACG TCACGATGCT CGCGCGCATG
ATCGCGGGCC GGACCGCGCC TGGCGCAGCC GCGTGGGACG ACGAGGTGTG GGGCGAGCTG
GGGGAGCGGC TGCCGGCGTC GGTGTTCCAC CTGCCCAAGC CGCAGATCGC CCTGTTCTTG
AGGACACTGC TGACCGCCAA CGGGGCCGTG GTCCTCGGCG GTACCACCGG CCGCGTGAGC
CTGCACGCCG ACGACCGCCG CGTCCTCGAG GGGGTGAGCC GGCTGCTGCT GCGGTTCGGC
ATCTCGACGC GGCTGCGCAT CTCTGCTCAG GGTCCGCGGC TCGACGTCGT GGAGAAGGAC
GACCTCCGGC GCCTCCTGCA GGAGATCGGC ATCGACGGAC GGCACGCGCA CGCGGCCGAC
GAGCTGCTGG CGCGGGTCCG GACGCGGGAC CTCGAGACCG CCGCAGAGCC CGTCCGCCTG
TGGGACGACG TACGCACGGT GTTGACGGCC GCGGCCACCC GAACCCTCGG TGACGGCCGG
ACCCCTCTGG CGCAGGTGGT CGACGTACTC GACCGCGCAG ATCTCGACGT GGACGCGGTC
AACGACCTGC TGTGGGACGA GGTCGTGGCG ATCGAGCCAC TGGGTGAGCA GGCGGTGTAC
GACGCGACCG TGGTCGGCAC CCACAACTTC ATCGCCAACG GCATCGCCGT CCACAACAGC
ATCGAGCAGG ACGCCGACAT GGTGATCCTG CTGCACCGTG AGGACGCGTA CGAGAAGGAG
TCGCCGCGAG CCGGTGAGGC CGACCTGATC GTGGCGAAGC ACCGTAACGG TCCCACCGAC
ACGATCACCG TGGCGTTCCA GGGGCACTAC TCACGATTTG TCGACATGCA GATGTAG
 
Protein sequence
MTIEDLEYGA PPDSGHSSGG GFDRTPPQDL DAERSVLGGM MISKDAIADV IEQIKGTDFY 
RPAHEAIYDA ILDLYGRGEP ADAITVADEL TKRGEMGRVG GAAYLHTLIA SVPTAANAGF
YARIVRERSI LRKLVEAGTR IVQLGYATDG GDVDELVNNA QAEVYAVTER RASEDYLPLS
EVIGGTVDEI EAAGHRGEGM IGVPTGFSDL DRLTNGLHPG QMIVIAARPA IGKALALDTP
LPTPTGWTTM GEVQVGDQLI ADDGTITRVV AATDVMTDRP CYRVTFDDGS TIVADAQHQW
ATRTRAERRV GADASVRTTE ELAATVRCAT ADARVNHSVA TTAPLSLPER ELLVDPYLLG
VWLGDGQSAA ARFTSADPEI AMRIEGRGYD ADVLTSSLAT LGLGAELHIP ADYLRAGEAQ
RRELLAGLLD TDGTVNPTGS VQFAVTHERL ARDVRELVHS LGYRTGWSER NARGRSAASP
TCFTITFTTD DDVFALERKK LVHKERRRRS TPRLHQRYVV SVEPIESVPV RCVEISHQSH
LYLAGEAMIP THNSTVGIDI VRSSAIKHNM AAVVFSLEMS RNEIVMRLLS AEARVHLQKL
RTGAMGEDDW AKVAATMGRI SEAPLFIDDS PNMSLMEIRA KCRRLKQRHD LKLVVIDYLQ
LMTSGKRVES RQQEVSEFSR ALKLLAKELE VPVIAISQLN RGPEQRTDKR PAMSDLRESG
CLTEDTRVLR ADTGAETTLG EMYALGHKDV PIWALDDRLQ YVRRHLTHVF PTGVKPVYRL
RLASGKEVTA TANHPFLRYE GWTPLGELEV GSRVGVPRHV PGPEMTADWS DRDVTMLARM
IAGRTAPGAA AWDDEVWGEL GERLPASVFH LPKPQIALFL RTLLTANGAV VLGGTTGRVS
LHADDRRVLE GVSRLLLRFG ISTRLRISAQ GPRLDVVEKD DLRRLLQEIG IDGRHAHAAD
ELLARVRTRD LETAAEPVRL WDDVRTVLTA AATRTLGDGR TPLAQVVDVL DRADLDVDAV
NDLLWDEVVA IEPLGEQAVY DATVVGTHNF IANGIAVHNS IEQDADMVIL LHREDAYEKE
SPRAGEADLI VAKHRNGPTD TITVAFQGHY SRFVDMQM