Gene Caul_4206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4206 
Symbol 
ID5901668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4571343 
End bp4573229 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content72% 
IMG OID641564728 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001685828 
Protein GI167648165 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGATC AGCTTGATTC CACGTTCGAC CGGACTAACC CCGCCTCGGG TGCTTCGCTG 
GAGGCGGCGC GCGAGGTTCT GCGTCGCACC TTCGGCCACT CGGACTTCCG AGGCTTGCAG
GCCGGGGTGA TCGGCGAGCT GCTGGCCGGG CGCAGCGCCC TGGCCGTGCT GCCGACCGGC
GGCGGCAAGA GCCTGTGCTA CCAGATCCCC GCCCTGGTGC GGCCGGGCCT GGGCCTGGTG
GTCTCGCCGC TGATCGCTCT GATGGCCGAC CAGGTGGCGG GCTTGCAGCA GGCCGGCGTC
GCGGCCGAGC GGCTGGACAG CAACACCCTG CCGGGCGAGC GGACCGAGAT CTGGCGGCGC
ATCGACGCCG GTACGCTGGA CCTGCTCTAT CTGTCGCCCG AGGGCCTGAT GCAGCCGTCG
ATGCTGGAGC GGCTGTCGCG CCTGCCCCTG TCGCTGGTCG CGGTCGACGA GGCCCATTGC
GTCAGCCAGT GGGGCCACGA CTTCCGGCCC GAATACCGCA TGCTGGGCCG CCTGGCCGAG
GTGTTCCCCA ACGCCCCGCG CCTGGCCGTC ACCGCCACCG CCGACGCCCG GACCCGCGAC
GACATCCGCG CCGAGCTGCG CCTGCAGGGC GCGGCCGAGT TCGTCGACAG CTTCGCCCGC
CACGAACTGG CCCTGAACGC CGAGCGCAAG AAGGGCAAGG GCCACGACCG GGTCATCGAG
CTGGTCACGG AACGGCCGAA CCGGGCCGGC GTGGTCTATG CCGGCAGCCG GGATTCCACC
GAGAAGCTGG CCGAAAAACT GATCGCCGAG GGCATCCCGG CGCTCGCCTA CCACGCCGGC
CTCGACAAGG GCGTGCGCGC CCGGCGGCTG GAAGAGTTCC TCGAGGCCGA CGAGGCGGTG
ATGGTCGCGA CCATCGCGTT CGGCATGGGG GTCGACAAGC CCGACGTCCG CTTCGTGATC
CACGCCGACC CGCCGGCCGC CATCGAGGCC TATTGGCAGG AGATCGGCCG AGCCGGCCGT
GATCGCCAGC CGGCCGAAGG CATCACCCTG TACAGCTCGG CGGATTTGGC CTGGGCCGTG
CGTCGCATCG AGGCCCAGAG CGCGCCCGAC GAGGTCAAGA CCGTGCAGCT GCGCAAGCTG
CGGCAGTTCT ACGCCATGCT CGAAGGCGTC ACCTGCCGCG CCGCCGCCGT CCGCCGCTAT
TTCGGCGAGG AGGGGGTCGA GCGCTGCGGG GTCTGCGACA TCTGCGTCTC GCCGCCGAAC
GGGATCGATG CGAGCCAGGC CGCCCAGAAG GCCCTGTCCG CGGCCCATCG CCTGGGCGGG
CGCTTTGGAC GCGGCCGGCT GGTCGACCAC CTGCTGGGCA AGACCAAGGA TGTCACGCCC
GCCGAGGCCC AGATGTCGAC CTTCGGCATC GGCAAGGAGT TCAGCCCCCA GGGCTGGCGC
GACCTGCTCG ACACCCTGGT GTTCGAGGGC CTGCTGCGCG AGGACCCCAA CGACGGCCGG
CCGCTGATCG GCCTGGGCGA TGGCGAGGGC GTGCGCCAGG TCTATCGCGG CGAACGCCTG
GTCTCGCTGC GCCAGCAGCC GACGCCCGCC GACAGCCCCG GCCGCTCTTC TGGAGGGGGC
GGGGCTGGCA AGACGGCCCG CAAGCGCCAA GCCTTGACCG TGCCGCTGGA GGACCAGGCT
CTGTTCGAGG CCCTGCGGTC CTGGCGGCGC GAGGAGGCGG CGCTGCAACA CGTTCCGCCC
TACGTGATCT TCCACGACGC CACCCTGGCC GAGATCGCCG CCGCGAGGCC GGCCACGGCG
GGCGCCCTGG CCAAGGCCGG CGGGGTGGGC CAAGGCAAGC TGGACCGGTA TGGCGAGGCC
GTGCTCAAAG TGGTGCGCGA CAACTAG
 
Protein sequence
MYDQLDSTFD RTNPASGASL EAAREVLRRT FGHSDFRGLQ AGVIGELLAG RSALAVLPTG 
GGKSLCYQIP ALVRPGLGLV VSPLIALMAD QVAGLQQAGV AAERLDSNTL PGERTEIWRR
IDAGTLDLLY LSPEGLMQPS MLERLSRLPL SLVAVDEAHC VSQWGHDFRP EYRMLGRLAE
VFPNAPRLAV TATADARTRD DIRAELRLQG AAEFVDSFAR HELALNAERK KGKGHDRVIE
LVTERPNRAG VVYAGSRDST EKLAEKLIAE GIPALAYHAG LDKGVRARRL EEFLEADEAV
MVATIAFGMG VDKPDVRFVI HADPPAAIEA YWQEIGRAGR DRQPAEGITL YSSADLAWAV
RRIEAQSAPD EVKTVQLRKL RQFYAMLEGV TCRAAAVRRY FGEEGVERCG VCDICVSPPN
GIDASQAAQK ALSAAHRLGG RFGRGRLVDH LLGKTKDVTP AEAQMSTFGI GKEFSPQGWR
DLLDTLVFEG LLREDPNDGR PLIGLGDGEG VRQVYRGERL VSLRQQPTPA DSPGRSSGGG
GAGKTARKRQ ALTVPLEDQA LFEALRSWRR EEAALQHVPP YVIFHDATLA EIAAARPATA
GALAKAGGVG QGKLDRYGEA VLKVVRDN