Gene Caul_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1456 
Symbol 
ID5898911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1550259 
End bp1553147 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content73% 
IMG OID641561943 
Producthelicase c2 
Protein accessionYP_001683084 
Protein GI167645421 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.107877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGC CCTTGGCCGC CGGCAAGCCT TGGGATTGTT CTCGTTCCGT TTCGACGGCC 
GATCCGCTAA AAGGGCCTGT GACCGCACAG ATTCCCGCCC TAGACCTCGC CCCCGCCCTG
GTGGTGCTGC CCGGCCCGCG CGCCGGCTTC GCCGACGGCT CGGGCGAGGG CCGGCCGCTG
CGCACGCCGG ACGCCCGCGA CCTGTTCGAA TACGGGCCGG TGCTGGTCGC CCACGCCGCC
ATGACCGCCA AGCGGCTGAA CCTGCACGCG CCGGTCCGAA GCCGGGGCTG TTTCGACGTG
CTGGAGCTCT ATGCCTTCAC CCGCCCGGCC ACCTTCTGCG CGCCGTCGGC GGTGGGGCTG
GCCACGGCGT TGGGCCTGGC CGAGCCCAAG GGCGCGCCGG CCCAGGCCCA GACCCTGCGC
GACGCCGCCG CCGCCCTGAT CCACGAGCTG GCGATGACGC CCAGGCCGTC GCGCGAGGAA
GCCCTGGCCG TCGCCGAGAC CCTGGCCCGG GCCGGCTGGG CCTGGGGGCC GGCGGTGGTC
GGCGCCCTGC GCTCGGCCCC TGTCGGCAAC CAGTTCCGCA GCTCCGGCCT CGACGTCTGG
GCCCGTGCGC TCGAATGGGA GGACCAGGCG CCGCCCGGCG AGCCCGGCTC CAAGCCGGTC
GACCCGGAAC GGGCCAGCGA GCGGCTGAAG GAGCTGCTGC ACCGCTCGGG CCTGGACGAG
GTGCGCGAGG CCCAGGACAC CTTCGCCCGC GAGGCCGCCT ACGCTTTCGC CCCCAGGGAG
CGCGAGGGCG AGCCGCAGAT GATGCTGGCC GAGGCCGGCA CCGGGGTCGG CAAGACCCTG
GGCTATCTGG CCCCCGCCTC GCTGTGGGCC GAGGCCAACG GCCCCGCGGT CTGGATCAGC
ACCTATACCC GCGCCCTGCA GCGGCAGATC GAGCGCGAGA GCCATTCGAT CTATCCCGAC
CCCGTGGTCC GAGCCCGCAA GGCGGTGGTC CGCAAGGGCC GGGAGAACTA TCTGTGCCTG
CTGAACTTCC AGGAGCAGGT CAACACCGCC CAGCTGGGCA ATGGCGATCT GATCGGCTTG
GGGCTGGCGG CCCGCTGGGC GCGTAACAGC CGCGACGGCG ACATGACCGG CGGCGACTTT
CCGGCCTGGC TGCCGACCCT GTTCGCCATC GCCCCCTCGG TCCAGGCCGG CCCGGCCAAT
CTGGTCGACC GGCGCGGCGA GTGTATCCAT GCCGGCTGCC AGCACTATCG CGTCTGTTTC
ATCGAGAAGG CGGTGCGGGC CTCCAAGCGC GCCGACATCG TCATCGCCAA CCACGCCCTG
GTGCTCAGCC AGGCGGCCTT CGACGGCGCG CGGGCGGCGC GTGGCCTGAA GGGCGACAAC
GAGACCACCT CGCTCAAGCG CATCGTCTTC GACGAAGGCC ACCACCTGTT CGACGCCGCC
GATAGCGCCT TCTCGGCCGC GCTCTCGGGC GCAGAGGCCG CCGAGATGCG CCGCTGGCTG
CGCGGACCGG AGGGACGGGG CCGCCGGGGC CGGGGGCTGG AGACCCGCCT GCTCGACATC
ATCGGCGAGC GCGAGGGCGC GCGCGCCGCG CTGACCGCCG CCCTGCACGC CGCCGCCGCC
CTGCCGGGCG AGGGCTGGTC CGGCCGCATC GCGCCGCCCG ACGGCCAGGT CAATCCGATC
GGCCCGATCG AGAATTTCCT GGTCGCGGTC ATCGAGCAAC TGCGGGCGCG GGTCAGCGAC
CGTCCCGGCG CGGCCGAGCT GGGGCTGCAA TGCGCCGCCC GGCCGGCGGT CGAGTTGGTG
CGCGAGCGGG CCGGCGAGGC CGCCAAGGCC CTGGCCGGCA TCGAGGCCCC GCTGCTGGCC
CTGGCCCGCC ACCTGGAGGA CGTGCTCGAC GAGGAAGCCG ACACCCTGCC GACCTCGGAG
CGGGCGCGGA TCGAGGGCGC CCTGCGCGGC CTGGATCGCC GGTGCCGCAT GACCCTGCCG
GCCTGGCGCT CGATCCTGAA AGCCATCGAC GAAGACGCCG AGGACGATCC CGACTTCGTC
GACTGGTTCG AGGCGACGTT CCTGTACGGC CGGGTCGTCG ACGCCGCCTG CCGCAGGCAT
TGGGTGGACC CGACAGAACC GCTGCGCGCG GCGGTGCTGT CGCCCGCCCA CGGCGTGCTG
GTGACGAGCG CCACCCTGAC CGACCCGGCC TTGGAGGACC CGTTCGCCCT GGCCGAGATG
CGCACCGGCG CCGCGCGCCT GCCGCAGTCG CCCAAGGTGC TGCGCCTGGT CTCGCCGTTC
GACTACGCCA ACAACGCCAA GGCCTTCGTG GTCACCGACG TTGGAAAGGA CGATCCTCGT
CAGGTGTCGG CGGCGATGCG CGAGCTGTTT CTGGCGGCTG GCGGCGGCGG CCTTGGCCTG
TTCACCGCCA TCCGTCGATT GAAAGCCGTG CACGAGCGCA TCGCCGCGCC CCTGGCCGAC
CAGGGTCTGG CTCTCTATGC CCAGCACGTC GATCCGCTGG AGGCGGGGGC GCTGGTCGAC
ATCTTCCGGG CCGAGGAGGA CGCCTGCCTG CTGGGCACCG ACGCCATCCG CGACGGCATC
GACGTGCCGG GCCGGTCCCT GCGCCTGCTG GTCTTCGACC GCGTGCCCTG GCCGCGCCCC
GACGTGCTGC ACAAGGCCCG CCGCGCCCGG TTCGGCGGCA AGGGCTACGA CGACAGCACG
GCCCGCGCCC GTATCAGCCA GGCGTTCGGT CGCCTGATCC GCCGGGCCGA CGACAAGGGC
GTGTTCGTGA TGCTGGACGC CGCCGCCCCG ACGCGGCTGT TCTCCAGCTT GCCGGACGGC
GTGACCATCG AGCGCCTCGG CCTTGTCGAG GCGATCGAGG CGACCGCCGC ATTTTTGTCG
AAAGCCTGA
 
Protein sequence
MAEPLAAGKP WDCSRSVSTA DPLKGPVTAQ IPALDLAPAL VVLPGPRAGF ADGSGEGRPL 
RTPDARDLFE YGPVLVAHAA MTAKRLNLHA PVRSRGCFDV LELYAFTRPA TFCAPSAVGL
ATALGLAEPK GAPAQAQTLR DAAAALIHEL AMTPRPSREE ALAVAETLAR AGWAWGPAVV
GALRSAPVGN QFRSSGLDVW ARALEWEDQA PPGEPGSKPV DPERASERLK ELLHRSGLDE
VREAQDTFAR EAAYAFAPRE REGEPQMMLA EAGTGVGKTL GYLAPASLWA EANGPAVWIS
TYTRALQRQI ERESHSIYPD PVVRARKAVV RKGRENYLCL LNFQEQVNTA QLGNGDLIGL
GLAARWARNS RDGDMTGGDF PAWLPTLFAI APSVQAGPAN LVDRRGECIH AGCQHYRVCF
IEKAVRASKR ADIVIANHAL VLSQAAFDGA RAARGLKGDN ETTSLKRIVF DEGHHLFDAA
DSAFSAALSG AEAAEMRRWL RGPEGRGRRG RGLETRLLDI IGEREGARAA LTAALHAAAA
LPGEGWSGRI APPDGQVNPI GPIENFLVAV IEQLRARVSD RPGAAELGLQ CAARPAVELV
RERAGEAAKA LAGIEAPLLA LARHLEDVLD EEADTLPTSE RARIEGALRG LDRRCRMTLP
AWRSILKAID EDAEDDPDFV DWFEATFLYG RVVDAACRRH WVDPTEPLRA AVLSPAHGVL
VTSATLTDPA LEDPFALAEM RTGAARLPQS PKVLRLVSPF DYANNAKAFV VTDVGKDDPR
QVSAAMRELF LAAGGGGLGL FTAIRRLKAV HERIAAPLAD QGLALYAQHV DPLEAGALVD
IFRAEEDACL LGTDAIRDGI DVPGRSLRLL VFDRVPWPRP DVLHKARRAR FGGKGYDDST
ARARISQAFG RLIRRADDKG VFVMLDAAAP TRLFSSLPDG VTIERLGLVE AIEATAAFLS
KA