Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0783 |
Symbol | |
ID | 5711219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 787616 |
End bp | 790579 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641266692 |
Product | helicase domain protein |
Protein accession | YP_001532129 |
Protein GI | 159043335 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4581] Superfamily II RNA helicase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000205991 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0820229 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACACG ACGCGCGGAT CACCGCGGTC CTCGGTCCGA CCAATACCGG CAAGACCCAT TACGCCATAG ATCGGATGCT GGGCTACCGG ACGGGCGTGA TCGGGTTGCC CCTGCGGCTG CTCGCGCGGG AGGTCTATGA CAAGATCGTC GCCCTGCGCG GCCCGTCCGT GGTGGCCCTG GTGACCGGCG AGGAACGGAT CGTGCCGGAC CGGGTGCAAT ACTGGGTCTG CACGGTCGAG GCGATGCCGC AGGAGATTGG CGCCGATTTC CTCGCGGTGG ACGAGATCCA GCTTTGTGCC GATCCCGAGC GGGGCCATGT GTTCACCGAC CGCCTGCTCA GGGCACGCGG TCTGCAGGAG ACGCTGTTTC TGGGGGCCGA GACCATGCGC GGGGCGATTT CCGCGCTGGT GCCCAAGGCG CAGTTCCTGC GCCGGGAGCG GTTTTCCGAG CTGACCTATA CCGGGGCCAA GAAGATCAGC CGGATGAAGC CGCGCTCGGC CATCGTCGGG TTCTCGGTCG ACAATGTCTA TGCCATGGCG GAGCTGATCC GGCGGCAGAA GGGCGGTTGT GCCGTGGTGA TGGGCGCGCT CAGCCCGCGC ACCCGCAACG CCCAGGTGGA CCTTTATCAG AACGGCGATG TGGATTACCT GGTGGCGACG GACGCCATCG GGATGGGGCT GAACCTGGAC GTGGCCCATG TCGCCTTCTC CAGCCTGTCG AAATTCGACG GTCGCCGGAT GCGGGCGCTG GCGCCCAACG AGCTGGCCCA GATCGCCGGG CGCGCGGGGC GCTACATGAC GCCCGGCACC TTCGGGGTGA CCGGCGAGGC GCCCGAAATC ACGCCGGAGG TGGTCGCGGC CATCGAGGCC TCGCGGTTTA CCCCGATTCG CAAGCTGCAA TGGCGCAGCG CCCGGCTCGA CTTCACCGGG CCCGGGCGGC TCATCGCCTC GCTGGAGGCC GCGCCAGAGG ACGAGTGGCT CGCCCGATCC CGGGAATCGG ACGATCTGGC CGCGCTCAAG ACCCTGGCGG CAGAGGCCGA GGTCGCCGCG CGCCTGTCGG ATGCGCGCGA TCTGCGGCTT TTGTGGGATG TTTGCCGAAT CCCCGACTTC CGCGGCATCT CGGCGGGCGA ACACGCGGGC ATGCTGACGC GGATTTTCGG CTTCCTGCAT GACGAGGGCC GCGTGCCCGA GCCGTGGTTT GCGCGACAAG TCGCCCGTAT CGACCGGACA GATGGCGACA TTGACACATT GTCGAAACGC TTGGCGTATA TTCGCACGTG GACCTATGTC GCCCAACGCA AGGGGTGGGT CGAACAGGAT AGCCATTGGC GCGGCGAAAC CCGCGCTGTA GAAGACCGCT TGTCAGATGC CCTGCACACG GCACTGACCC AGAGATTTGT CGACCGCCGC ACATCTGTGC TCCTGCGGCG GTTGAAGCAG AAGGAGAGCC TGGTGGCTGA CGTAAACGAC AAGGGTGAAG TAACCGTTGA GGGCGAGCTG ATCGGCCGTC TCGAGGGGTT CCGCTTCCGC CAGGACAAGA CTGCGACGCC GGAGGAGGCC AAGACCGTGC GCGCCGCCGC TGTGGCCGCG CTCGCGCCGG AGTTTCATCT GCGCGCCGAC CGCTTCTACA ACGCGCCGGA GACCGAGATC GACTATACCG AGCAGGGTGG TCTGATGTGG GGCGAGCATG CGGTGGGCAA GCTCGTCAAG GGCGCCGACC CGCTCAAGCC CATGGTGCAG GCCTTCGTCG ATGACGAGGC CGGGCCCGAG GTCGCCCAGA AGGTCGAGCG CCGCTTGCAG CATTTCATCG ACCGCAAGAT CGCCGCCCTG TTCGAGCCGC TCATGGCCAT GAGCCGGGAC GAGGCGCTGA CCGGGCTAGT GCGCGGATTC GCCTTCCAGT TGGTCGAGGC GCTGGGCGTG CTGCCGCGCG GGACCGTGGC CAATGACGTC AAATCGCTGG ACCAGGATGC CCGCGCGCTC CTGCGCAAGC ACGGGGTGCG GTTCGGGCAG TTCACGATCT TCCTGCCCCT GTTGCTGAAG CCCGCGCCGA CCCGGTTGCG GCTGGTCCTG TCGTCGCTGG CCAAGGAGCG CGACGTGTTC CCCGAAAGCC CGCCGCCGGG CCTCGTGACG ATTCCGACGG TCGAGGGGGT CGCGCAGGAG ATCTACACCG AATCCGGGTA TCGGGCCGCC GGGGCGCGGG CGATCCGCAT CGACATGCTG GAGCGTCTGG CCGACATGCT GCGCAGCGTC GATACCCGTG GCGGGTTCGA GGCAACGCCG GACATGCTCT CGATCACGGG TATGACGTTG GAACAGTTCG CCGACCTGAT GCAGGGGCTG GGCTATGCCG CCGAGAAGGG CGAGCGGCCC AAGGTGAAGC CCGCGCCGGT CGCGGCTGCG CCGGCAGAGG TGGCGGAGAC GCCGGAGGGT GAGCCGTCCG ACGTGCCGTC AGAGACGCCC AACGAGACCC CCTCGGAAAC CCCGCCGGAA GCGCCAAGGG AAATGCCCGT CGAAGTTCCG CAGGAGGCCC CGGCAGAAGC ACCGCCCGAG GCCCCGACGG AAGACCCTGT CGAGGCCCCG ACCGAGACCC CCGAGACGCC GCCCGCGGAA ATACCCGACG CGACGGCGGA GCCTTCAGAG CCCGGCGCTG CCGAGGCCGT CAGCGACGTG GTTGCCGAGA AGGCCGAGCC GGAGGTGGAG GTGTTCTACA CCTTCCGTTG GTCCCCCCGA CCGCGCCGCG GCGAGGCCCG AGGGCGCCGC CAGGGTGCCG AGAGCGGCGG GCGGCCCAAT GCCAAGACCG GCGGCAAGCC CGGTGGCAAG CCGAGGGGCA AGGGCAAGCC CGGCGGGCGC CCCGCGCGCG AGACCGGGCC CAAGACCTAC CAGTCCCGGC CCGAGCGCAA GGACAAGATC GACCCGGACA ACCCCTTTGC CGCAGCCCTC ATGGGGCTCA AGGACAAGTC CTGA
|
Protein sequence | MRHDARITAV LGPTNTGKTH YAIDRMLGYR TGVIGLPLRL LAREVYDKIV ALRGPSVVAL VTGEERIVPD RVQYWVCTVE AMPQEIGADF LAVDEIQLCA DPERGHVFTD RLLRARGLQE TLFLGAETMR GAISALVPKA QFLRRERFSE LTYTGAKKIS RMKPRSAIVG FSVDNVYAMA ELIRRQKGGC AVVMGALSPR TRNAQVDLYQ NGDVDYLVAT DAIGMGLNLD VAHVAFSSLS KFDGRRMRAL APNELAQIAG RAGRYMTPGT FGVTGEAPEI TPEVVAAIEA SRFTPIRKLQ WRSARLDFTG PGRLIASLEA APEDEWLARS RESDDLAALK TLAAEAEVAA RLSDARDLRL LWDVCRIPDF RGISAGEHAG MLTRIFGFLH DEGRVPEPWF ARQVARIDRT DGDIDTLSKR LAYIRTWTYV AQRKGWVEQD SHWRGETRAV EDRLSDALHT ALTQRFVDRR TSVLLRRLKQ KESLVADVND KGEVTVEGEL IGRLEGFRFR QDKTATPEEA KTVRAAAVAA LAPEFHLRAD RFYNAPETEI DYTEQGGLMW GEHAVGKLVK GADPLKPMVQ AFVDDEAGPE VAQKVERRLQ HFIDRKIAAL FEPLMAMSRD EALTGLVRGF AFQLVEALGV LPRGTVANDV KSLDQDARAL LRKHGVRFGQ FTIFLPLLLK PAPTRLRLVL SSLAKERDVF PESPPPGLVT IPTVEGVAQE IYTESGYRAA GARAIRIDML ERLADMLRSV DTRGGFEATP DMLSITGMTL EQFADLMQGL GYAAEKGERP KVKPAPVAAA PAEVAETPEG EPSDVPSETP NETPSETPPE APREMPVEVP QEAPAEAPPE APTEDPVEAP TETPETPPAE IPDATAEPSE PGAAEAVSDV VAEKAEPEVE VFYTFRWSPR PRRGEARGRR QGAESGGRPN AKTGGKPGGK PRGKGKPGGR PARETGPKTY QSRPERKDKI DPDNPFAAAL MGLKDKS
|
| |