Gene Cfla_3297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3297 
Symbol 
ID9147213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3663295 
End bp3666135 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content73% 
IMG OID 
Producthelicase domain protein 
Protein accessionYP_003638375 
Protein GI296131125 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones91 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAG CGACGTACGC CCCCGGTGCC CTGGTACGGG CGAGGGGGCG CGAGTGGGTG 
GTGCTGCCGG ACAGCTCCGC GGAGTTCCTG CTGCTGCGCC CGCTCGGCGG CGGTCACGAC
GACGTAGCGG GCGTGCACAC CGCGCTCGAG CGCGTCGAGG ACGCCACGTA CCCGCTGCCG
GACCCGGACG ACCTGGGCGA CGCGACCAGC GCGGGCCTGC TGCGGACAGC TCTGCAGCTC
GGCTTCCGCT CGTCGGCCGG CCCGTTCCGC TCGCTCGCGG GCATCGCGGT GGAGCCGCGC
GCCTACCAGC TGGTGCCGTT GCTGATGGCG CTGCGGCAGG AGACGGTGCG CCTGCTGGTG
GCCGACGACG TCGGCATCGG CAAGACCGTC GAGGCGGGAC TGATCGCGGC CGAGCTGCTG
GCCCAGGGGT CCGCCAAGCG GCTCGCGGTG CTGTGCAGCC CGGCGCTCGC GGAGCAGTGG
CAGGCGGAGC TGGCGTCGAA GTTCCACATC CAGGCCGAGC TCGTCCTCAC CTCGACGGTC
CGCCGCCTGG AACGTGACCT CATGATGAAC GAGTCGCTGT TCGACCGGTA CCCGCACGTC
GTGGTCTCGA CGGACTTCAT CAAGTCCGAC CGCCACCGCG CGGAGTTCCT CAACCACTGC
CCCGACCTGG TGATCGTCGA CGAGGCGCAC ACGGCCGTCG CGGACGACAG CGCGGCGGGC
GGGCGGCAGC GGCACCGCCG CCACGAGCTG CTGACGGACC TGGCGCGCGA CCCGCGACGT
CATCTCGTCC TGGTGACGGC GACTCCGCAC TCGGGCAAGG AGGACGGCTT CCGCAACCTG
GTCGGCCTGC TGGATCCGGA GCTGGCGCAC CTTGACCTGG ACCGCCCGCA GAACCGCGAG
CGACTTGCCC GGCACTTCGT CCAGCGACGC CGCGGCGACA TCCGCCACTA CCTCGACGAG
GACACGCAGT TCCCGTCGGA CCGCGAGCTG CGGGAGGTCG CCTACGCGCT GAAGCCCGAG
TACCGGGCGC TGTTCACCAA GGTGCTCGAC TACGCCCGCG AGCAGGTGCG CAGCGCAGGC
GACGGGTCGG TGCACCAGCG CGTGCGGTGG TGGTCGGTGC TCGCGCTCCT GCGCACGCTG
GCATCGTCGC CGGCAGCCGC AGCAGCGACG CTGCGGATGC GGGCCGCAGC AGCTGAGGCC
GCGGACCTGG CCGAGGCAGA CGCACTGGGG CGCGCGAGCG TCCTCGACAC CGCGGACGAC
GAGGCCGTCG AGGCCGTCGA TGTCACCCCG GGCGCGCTGA CCCAGGACGC GGAGAGCACG
ACAGCGTCCG AGCGCTCCAG GCTGCGCGCG CTGGCGCAGA GCGCCGAGGC GCTGGAGGGC
CCGGCGACGG ACGCGAAGCT GGCGACGCTG GTCAAGGAGG TCAAGGCCCT CCTGGCCGAC
GGGTTCGACC CGATCATCTT CTGCCGGTTC ATCGACACCG CGGAGTACGT CGGCCGGCAC
CTGGCGGACG CGCTGAAGAA GACCGCGGAG ATCGCCGTGG TCACGGGGAC GCTGCCGCCG
GCGGAGCGGC AGGAGCGGAT CCGTGCGCTG ACGTCGACGG ACGCCCGGCA CGTCCTGGTC
GCAACGGACT GCCTGTCCGA GGGCGTCAAC CTCCAGGACG CGTTCCAGGC GGTCGTGCAC
TACGACCTGG CGTGGAACCC GACCCGGCAC GAGCAGCGCG AGGGACGCGT CGACCGCTTC
GGCCAGACGG CGCCGACCGT GCGTGCGGTG ACGATCTACG GCAAGGACAA CAGCATCGAC
GGCCTGGTGC TGGACGTGCT GATCCGCCGG CACCAGGCGA TCAGCAAGGC CACGGGCGTC
ACCGTGCCGG TGCCGAGCCA GTCGGACGCA TTGCTGGAAG CCCTCCTGGA AGGTCTCGTG
CTGCGCGGGG TGAACCACGA GCAGATGGAG CTCGACCTGG GCCTGGCCGA TGCCGACCGC
CGTCTGGAGC AGGAGTGGCG CTCGGCTGCG GAGACCGAGA AGGCGTCGCG CACCAAGTAC
GCGCAGGGCA CCATCCACCC CGACGCGGTC GCGCAGGAGG TCGCCGAGGT GCGTGCCGCA
CTCGGGGCCC ACGGGCAGAT CGAGCCGTTC GTCGAGGAGT CGCTGCGTGC GCTCGGCGCC
TCGGTGACAC CCACGGTTGT CGGGTTCGAG GCGATCACGG ACACCCTCGC TCCCGGTCTG
CGCGACGCGT TGCCCCCGGG CGCGACGTCA CCGCTGCGCT TCCACCGGCA GATGCCCGCA
CCCCGGCGCG ACGCGCTGCT GGTGCGGACA GACCCGAGCG TCGAGGCGGT GGCGCGCTAC
GTGCTCGACA CCGCGCTCGA CGCGCAGACG ACGGCCCGTC GCGCGCCTGC ACGCCGCGCC
GGCGTCGTCC GGACGACGCA CGTGTCGTCA CGTACGACCG CCCTGCTGCT CCGTTACCGC
TTCCACCTCG ACCTGCCGTC GGCCGATGGG CCGCGCACGC TGATCGCCGA GGACTGCGAG
GTCGTCGCGT TCCGCGGCCA CCCGGATGAC CCCGAGGTGC TGGGTGAGCC GGAGGTCGCG
GCCCTGCTGG CGGCGCCCGC GGAGGCGAAC GTGCCCGGCG ACCAGGCCCG CGACCTCATC
GCGGCTGCTC TCGGCCGCGT GGAGACCTGG GGGTCGGTGC TCGACGCCCG GGCGCAGGAG
CGCGCGGAGC GCTTGCTCGA GAGTCACCGC CGTGTCCGCG CGGGCGCCGG GGCCGCCCGG
CAGGGTCTGC GCGTCCGTGC GCAGACGCCG GTGGACGTCC TCGGTGTGTA CCTGTACCTG
CCTGTCGTCG GAGGCCGTTG A
 
Protein sequence
MTTATYAPGA LVRARGREWV VLPDSSAEFL LLRPLGGGHD DVAGVHTALE RVEDATYPLP 
DPDDLGDATS AGLLRTALQL GFRSSAGPFR SLAGIAVEPR AYQLVPLLMA LRQETVRLLV
ADDVGIGKTV EAGLIAAELL AQGSAKRLAV LCSPALAEQW QAELASKFHI QAELVLTSTV
RRLERDLMMN ESLFDRYPHV VVSTDFIKSD RHRAEFLNHC PDLVIVDEAH TAVADDSAAG
GRQRHRRHEL LTDLARDPRR HLVLVTATPH SGKEDGFRNL VGLLDPELAH LDLDRPQNRE
RLARHFVQRR RGDIRHYLDE DTQFPSDREL REVAYALKPE YRALFTKVLD YAREQVRSAG
DGSVHQRVRW WSVLALLRTL ASSPAAAAAT LRMRAAAAEA ADLAEADALG RASVLDTADD
EAVEAVDVTP GALTQDAEST TASERSRLRA LAQSAEALEG PATDAKLATL VKEVKALLAD
GFDPIIFCRF IDTAEYVGRH LADALKKTAE IAVVTGTLPP AERQERIRAL TSTDARHVLV
ATDCLSEGVN LQDAFQAVVH YDLAWNPTRH EQREGRVDRF GQTAPTVRAV TIYGKDNSID
GLVLDVLIRR HQAISKATGV TVPVPSQSDA LLEALLEGLV LRGVNHEQME LDLGLADADR
RLEQEWRSAA ETEKASRTKY AQGTIHPDAV AQEVAEVRAA LGAHGQIEPF VEESLRALGA
SVTPTVVGFE AITDTLAPGL RDALPPGATS PLRFHRQMPA PRRDALLVRT DPSVEAVARY
VLDTALDAQT TARRAPARRA GVVRTTHVSS RTTALLLRYR FHLDLPSADG PRTLIAEDCE
VVAFRGHPDD PEVLGEPEVA ALLAAPAEAN VPGDQARDLI AAALGRVETW GSVLDARAQE
RAERLLESHR RVRAGAGAAR QGLRVRAQTP VDVLGVYLYL PVVGGR