Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1456 |
Symbol | |
ID | 5898911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1550259 |
End bp | 1553147 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641561943 |
Product | helicase c2 |
Protein accession | YP_001683084 |
Protein GI | 167645421 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.107877 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGC CCTTGGCCGC CGGCAAGCCT TGGGATTGTT CTCGTTCCGT TTCGACGGCC GATCCGCTAA AAGGGCCTGT GACCGCACAG ATTCCCGCCC TAGACCTCGC CCCCGCCCTG GTGGTGCTGC CCGGCCCGCG CGCCGGCTTC GCCGACGGCT CGGGCGAGGG CCGGCCGCTG CGCACGCCGG ACGCCCGCGA CCTGTTCGAA TACGGGCCGG TGCTGGTCGC CCACGCCGCC ATGACCGCCA AGCGGCTGAA CCTGCACGCG CCGGTCCGAA GCCGGGGCTG TTTCGACGTG CTGGAGCTCT ATGCCTTCAC CCGCCCGGCC ACCTTCTGCG CGCCGTCGGC GGTGGGGCTG GCCACGGCGT TGGGCCTGGC CGAGCCCAAG GGCGCGCCGG CCCAGGCCCA GACCCTGCGC GACGCCGCCG CCGCCCTGAT CCACGAGCTG GCGATGACGC CCAGGCCGTC GCGCGAGGAA GCCCTGGCCG TCGCCGAGAC CCTGGCCCGG GCCGGCTGGG CCTGGGGGCC GGCGGTGGTC GGCGCCCTGC GCTCGGCCCC TGTCGGCAAC CAGTTCCGCA GCTCCGGCCT CGACGTCTGG GCCCGTGCGC TCGAATGGGA GGACCAGGCG CCGCCCGGCG AGCCCGGCTC CAAGCCGGTC GACCCGGAAC GGGCCAGCGA GCGGCTGAAG GAGCTGCTGC ACCGCTCGGG CCTGGACGAG GTGCGCGAGG CCCAGGACAC CTTCGCCCGC GAGGCCGCCT ACGCTTTCGC CCCCAGGGAG CGCGAGGGCG AGCCGCAGAT GATGCTGGCC GAGGCCGGCA CCGGGGTCGG CAAGACCCTG GGCTATCTGG CCCCCGCCTC GCTGTGGGCC GAGGCCAACG GCCCCGCGGT CTGGATCAGC ACCTATACCC GCGCCCTGCA GCGGCAGATC GAGCGCGAGA GCCATTCGAT CTATCCCGAC CCCGTGGTCC GAGCCCGCAA GGCGGTGGTC CGCAAGGGCC GGGAGAACTA TCTGTGCCTG CTGAACTTCC AGGAGCAGGT CAACACCGCC CAGCTGGGCA ATGGCGATCT GATCGGCTTG GGGCTGGCGG CCCGCTGGGC GCGTAACAGC CGCGACGGCG ACATGACCGG CGGCGACTTT CCGGCCTGGC TGCCGACCCT GTTCGCCATC GCCCCCTCGG TCCAGGCCGG CCCGGCCAAT CTGGTCGACC GGCGCGGCGA GTGTATCCAT GCCGGCTGCC AGCACTATCG CGTCTGTTTC ATCGAGAAGG CGGTGCGGGC CTCCAAGCGC GCCGACATCG TCATCGCCAA CCACGCCCTG GTGCTCAGCC AGGCGGCCTT CGACGGCGCG CGGGCGGCGC GTGGCCTGAA GGGCGACAAC GAGACCACCT CGCTCAAGCG CATCGTCTTC GACGAAGGCC ACCACCTGTT CGACGCCGCC GATAGCGCCT TCTCGGCCGC GCTCTCGGGC GCAGAGGCCG CCGAGATGCG CCGCTGGCTG CGCGGACCGG AGGGACGGGG CCGCCGGGGC CGGGGGCTGG AGACCCGCCT GCTCGACATC ATCGGCGAGC GCGAGGGCGC GCGCGCCGCG CTGACCGCCG CCCTGCACGC CGCCGCCGCC CTGCCGGGCG AGGGCTGGTC CGGCCGCATC GCGCCGCCCG ACGGCCAGGT CAATCCGATC GGCCCGATCG AGAATTTCCT GGTCGCGGTC ATCGAGCAAC TGCGGGCGCG GGTCAGCGAC CGTCCCGGCG CGGCCGAGCT GGGGCTGCAA TGCGCCGCCC GGCCGGCGGT CGAGTTGGTG CGCGAGCGGG CCGGCGAGGC CGCCAAGGCC CTGGCCGGCA TCGAGGCCCC GCTGCTGGCC CTGGCCCGCC ACCTGGAGGA CGTGCTCGAC GAGGAAGCCG ACACCCTGCC GACCTCGGAG CGGGCGCGGA TCGAGGGCGC CCTGCGCGGC CTGGATCGCC GGTGCCGCAT GACCCTGCCG GCCTGGCGCT CGATCCTGAA AGCCATCGAC GAAGACGCCG AGGACGATCC CGACTTCGTC GACTGGTTCG AGGCGACGTT CCTGTACGGC CGGGTCGTCG ACGCCGCCTG CCGCAGGCAT TGGGTGGACC CGACAGAACC GCTGCGCGCG GCGGTGCTGT CGCCCGCCCA CGGCGTGCTG GTGACGAGCG CCACCCTGAC CGACCCGGCC TTGGAGGACC CGTTCGCCCT GGCCGAGATG CGCACCGGCG CCGCGCGCCT GCCGCAGTCG CCCAAGGTGC TGCGCCTGGT CTCGCCGTTC GACTACGCCA ACAACGCCAA GGCCTTCGTG GTCACCGACG TTGGAAAGGA CGATCCTCGT CAGGTGTCGG CGGCGATGCG CGAGCTGTTT CTGGCGGCTG GCGGCGGCGG CCTTGGCCTG TTCACCGCCA TCCGTCGATT GAAAGCCGTG CACGAGCGCA TCGCCGCGCC CCTGGCCGAC CAGGGTCTGG CTCTCTATGC CCAGCACGTC GATCCGCTGG AGGCGGGGGC GCTGGTCGAC ATCTTCCGGG CCGAGGAGGA CGCCTGCCTG CTGGGCACCG ACGCCATCCG CGACGGCATC GACGTGCCGG GCCGGTCCCT GCGCCTGCTG GTCTTCGACC GCGTGCCCTG GCCGCGCCCC GACGTGCTGC ACAAGGCCCG CCGCGCCCGG TTCGGCGGCA AGGGCTACGA CGACAGCACG GCCCGCGCCC GTATCAGCCA GGCGTTCGGT CGCCTGATCC GCCGGGCCGA CGACAAGGGC GTGTTCGTGA TGCTGGACGC CGCCGCCCCG ACGCGGCTGT TCTCCAGCTT GCCGGACGGC GTGACCATCG AGCGCCTCGG CCTTGTCGAG GCGATCGAGG CGACCGCCGC ATTTTTGTCG AAAGCCTGA
|
Protein sequence | MAEPLAAGKP WDCSRSVSTA DPLKGPVTAQ IPALDLAPAL VVLPGPRAGF ADGSGEGRPL RTPDARDLFE YGPVLVAHAA MTAKRLNLHA PVRSRGCFDV LELYAFTRPA TFCAPSAVGL ATALGLAEPK GAPAQAQTLR DAAAALIHEL AMTPRPSREE ALAVAETLAR AGWAWGPAVV GALRSAPVGN QFRSSGLDVW ARALEWEDQA PPGEPGSKPV DPERASERLK ELLHRSGLDE VREAQDTFAR EAAYAFAPRE REGEPQMMLA EAGTGVGKTL GYLAPASLWA EANGPAVWIS TYTRALQRQI ERESHSIYPD PVVRARKAVV RKGRENYLCL LNFQEQVNTA QLGNGDLIGL GLAARWARNS RDGDMTGGDF PAWLPTLFAI APSVQAGPAN LVDRRGECIH AGCQHYRVCF IEKAVRASKR ADIVIANHAL VLSQAAFDGA RAARGLKGDN ETTSLKRIVF DEGHHLFDAA DSAFSAALSG AEAAEMRRWL RGPEGRGRRG RGLETRLLDI IGEREGARAA LTAALHAAAA LPGEGWSGRI APPDGQVNPI GPIENFLVAV IEQLRARVSD RPGAAELGLQ CAARPAVELV RERAGEAAKA LAGIEAPLLA LARHLEDVLD EEADTLPTSE RARIEGALRG LDRRCRMTLP AWRSILKAID EDAEDDPDFV DWFEATFLYG RVVDAACRRH WVDPTEPLRA AVLSPAHGVL VTSATLTDPA LEDPFALAEM RTGAARLPQS PKVLRLVSPF DYANNAKAFV VTDVGKDDPR QVSAAMRELF LAAGGGGLGL FTAIRRLKAV HERIAAPLAD QGLALYAQHV DPLEAGALVD IFRAEEDACL LGTDAIRDGI DVPGRSLRLL VFDRVPWPRP DVLHKARRAR FGGKGYDDST ARARISQAFG RLIRRADDKG VFVMLDAAAP TRLFSSLPDG VTIERLGLVE AIEATAAFLS KA
|
| |