Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1543 |
Symbol | |
ID | 8419372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 1786758 |
End bp | 1789697 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645038117 |
Product | type III restriction protein res subunit |
Protein accession | YP_003198407 |
Protein GI | 258405665 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAACG ACTCAGGCTC ACATCCTATC TATTACCATA AGCTTGTCCG GGACAAGATT CCTTCCATCA TTCGGGATAA CGATCACCAA CCTACTGTCT TAAGTCTTTC GGGCCAGGAT TTGACCCAGG CAGCAAGTCA AAAGCTTTTG GAAGAGGCTT ATGAACTGTT CACAGAGGTT CAGGTGGGAG AGAAGCCCTC TGTGCTTAAG GAGTCAGCCG ATGTCCTGGA AGTTGTATTA ACCATCCTGA AGCAGTTGGG CTATAGCTTT GATGACCTGA TTTCTGAAAT GGAACTACGA CGGGAGCAAA GAGGCGGTTT TGAGCAGGGC CTTTTTCTGG AAAGCGTTGA CGGGCAGTTT CTCAGTCACA GATTACAGCA GAGCCCGAGT TATGTATGTT CTCACTTGGA AATAGACTCC TTATTGGATC TTTTTCGAGG AGAAATGGAA CGTAGTGACA AGGTTTGGAT AGCATCGGCC TTTTACTCCC CGGCAATAAC CAATATTTTG ATCAGTGAAT TTGAACGCTT TATCTGCAAA GGAGGAGAGG CCAGAGTCAT CTTGTCTACA ATGAACGGTT TTATTAAGCC CGAGTATCTG ACCCATCTGC GTGATCATGT GCCTTACTTG AATGTTAAGG TATTCCATCC CCCTGATATC CCTTTTCATA TCCAGCCCAA GCGAGATTTT CATGTCAAAG CCTATATATT CAAGCACAGA ACTGGCAAAG GCTCGGCTAT CATTGGATCT TCCAACTTAT CTCAGGGTGG GTTTTCCGAT AACATTGAAT GGAACTATTA CTCAGCCGGG GAGATCAACC TTCCTTTTGA AAATAAGCAG ACTCCATGGG AAAAGATAGT TCATGAGTTT GAATCCCTGT GGGCCAATGA GTGTGTGCAT GTTACCGATG ATTTTTTGGC CGGTTACAGG AAGCGACACA GGGATGTTTT TCAGGAAAGA GAGAAGCCGT CTGATGCGTA TGGAAGCGAG GAGCAAAGCC CAGTTCAGCC GGGCATTGGG GCAGACAGCG CTGCTGCATA CGGGAAAAGA AAGCAATCCA AAAGTGAATC AGAATCAGTG AGCCCCAACA TCGCCCAAGG TGAGGCTCTG GAAGGGCTTT TGAAACTCAG AAACAGAAAA GCCAGGTCCG GAGCGGTTAT TGCAGCCACC GGGGTGGGCA AGACCTATTT GGCTGCATTT GACTTCATTC AAAGCGGCAA AGACAAATGC CTTTTTATCG CCCACAGAGA AGACATCCTC AGAAAAGCCA AGGAAAGTTT TTCCCATGTC CTTGGCCCAG AGGGGTTGGA GATCTTCAGC GGCAGGAGCA AGGAGATCTC ACATGGGTCA AGGGCTGTTT TTGCTATGAT CCAGACCTTG GGACGGCAGG ACAATATGGA ACGTTTTCAT CCCGAGGAGT TTGACTACAT TGTCATGGAT GAGTTCCACC ATGCCATGGC TGCTACCTAT CGCAGGGTCT TGGATTATTT TCAGCCCGAT TTCCTGTTAG GTCTCACAGC TACTCCTGAG CGCATGGATG GACGGGATGT CCTCTGGCTA TGCGATTACA ATATTGCATA TGAAATGAGG CTTTTCCAGG CCATTGACAA AGAATTGCTG GCCCCTTTTC AGTACTTCGC TGTCCATGAT CCGACTGATT ATGCCCAGAT TTCCTGGAAA CGAACAGATT ATGATCAAGA AGAACTGACC AAGGCCCTGG CAAATGACAC CAGAACAACC ATTATTGCCA ATAACTTGAA AAAATTTCTC CCGTACCAGG GTAAGATAAA GGCCTTGGCA TTTTGTAGCT CAGTTGACCA TGCCCAGTAT ACAGCTGCGC GTTTGACCCA GGAGCATGAT TTTGAGGCCA TGGCTTTGGT AGGCGATTCT TCGCAGGATC AAAGAGAAGA AGCCGTGGCC AGGCTTGAGG AGGAAAATGA TCCTCTCAAG CTTATATGTT GCGTGGATAT CTTCAATGAG GGCATCGATA TCCCTAAGCT CAGCCATGTA CTGCTCCTTA GGCCTACCCA GTCTTTTACA GTCTTTCTCC AGCAACTTGG CCGAGGGCTT CGAAAGATAC AGAGCGAAGA TGTGGAAAAG CATCTTGTGG TCATAGATTT TGTCGGTAAT TTTCGAACTG CACATGTAGC GCCCTTGGCC TTGGCCGGCT ATACCTCAAT TCAAGAATTT ACCCAGGATA GTGAAGGTTC AAAGGAAAGC AAACTTGATT TAAGCAATCC GCCTAAGGGA TGTTTTGTTT CACCTGATCT GGAAGTGCAA AGAATATGGG AGAGTAAGCT GAGGGAGATT GCTCCCATGT CCAGAGCAGA GCAGTTACGA GCCCTTTATG ACGAGGTGGT ACAGGATTTA GGGCTTATTT CTCCAGGACT TTGTGAATTT TATGCCGATC CCCAAAAAGC AGACCCCCAT GCATTCATTA AGTATTTTGG AAGCTGGATT AAGACCAAAA AAGCATTTAA AGACCTGTTG GATTTTGAAC AGAACTTATT AGGAACTCAA GGCGAGTCGT TTCTGGAGTA TCTGGAAAAA GATTTGAACC CGGTCAAGTC TTATAAGATG GTTGTTCTCA AAACATTGCT CTCATTTGAG GGAGTCTCAT GGAATGTCTC TGAAATTGCT CAAGGGTTTT TAAGTTATTA CCTTAATCAC CCTGAATATC TATCAGATTA TGATGATTTG GATAGGAAAG AGAACCCAGA GGAGATTTCT CTGCAAAGAG TAGAAAGGCA TATCATGAAT ATGCCTTTGA AGTACTTGAG CAATAAAGGC AAAGACTGGT TTGTATTAGA TAGAGAAAAA AAAGTCTTTT CAGTAAAAAA TGATCTTATT GATTATTGGA GTGCAGCATT CTATAAGCAA CTTATGCTCG ATAGAGCTGA TTATGCATTA GCCAGATATT TTTATCGGAA AACACAATAG
|
Protein sequence | MPNDSGSHPI YYHKLVRDKI PSIIRDNDHQ PTVLSLSGQD LTQAASQKLL EEAYELFTEV QVGEKPSVLK ESADVLEVVL TILKQLGYSF DDLISEMELR REQRGGFEQG LFLESVDGQF LSHRLQQSPS YVCSHLEIDS LLDLFRGEME RSDKVWIASA FYSPAITNIL ISEFERFICK GGEARVILST MNGFIKPEYL THLRDHVPYL NVKVFHPPDI PFHIQPKRDF HVKAYIFKHR TGKGSAIIGS SNLSQGGFSD NIEWNYYSAG EINLPFENKQ TPWEKIVHEF ESLWANECVH VTDDFLAGYR KRHRDVFQER EKPSDAYGSE EQSPVQPGIG ADSAAAYGKR KQSKSESESV SPNIAQGEAL EGLLKLRNRK ARSGAVIAAT GVGKTYLAAF DFIQSGKDKC LFIAHREDIL RKAKESFSHV LGPEGLEIFS GRSKEISHGS RAVFAMIQTL GRQDNMERFH PEEFDYIVMD EFHHAMAATY RRVLDYFQPD FLLGLTATPE RMDGRDVLWL CDYNIAYEMR LFQAIDKELL APFQYFAVHD PTDYAQISWK RTDYDQEELT KALANDTRTT IIANNLKKFL PYQGKIKALA FCSSVDHAQY TAARLTQEHD FEAMALVGDS SQDQREEAVA RLEEENDPLK LICCVDIFNE GIDIPKLSHV LLLRPTQSFT VFLQQLGRGL RKIQSEDVEK HLVVIDFVGN FRTAHVAPLA LAGYTSIQEF TQDSEGSKES KLDLSNPPKG CFVSPDLEVQ RIWESKLREI APMSRAEQLR ALYDEVVQDL GLISPGLCEF YADPQKADPH AFIKYFGSWI KTKKAFKDLL DFEQNLLGTQ GESFLEYLEK DLNPVKSYKM VVLKTLLSFE GVSWNVSEIA QGFLSYYLNH PEYLSDYDDL DRKENPEEIS LQRVERHIMN MPLKYLSNKG KDWFVLDREK KVFSVKNDLI DYWSAAFYKQ LMLDRADYAL ARYFYRKTQ
|
| |