Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_2544 |
Symbol | |
ID | 8420401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013224 |
Strand | + |
Start bp | 35877 |
End bp | 39185 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 645039141 |
Product | hypothetical protein |
Protein accession | YP_003199398 |
Protein GI | 258406657 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 113 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGTTT CAGGCCTTGT CGTCCAATTC AAGACCGCCT ATTCTTCCAA AAAAAAGAGT TGGCAGACGT TGTCCCTCGA TAAAGCCGAA CAATCTATCT TCGGTTCCGT GGGGATTGAT AAGCTGGCCT CGGGGGAAGC CGGAGTATGG CTCACACCCC GAGCACACGA CAGCGCCGGA GACACCTGGC ACATGGCGTG GTGGGACGTG GAACACCCCG ACGAACACCA CACCAGTATC GAGGCCAATA CCCGTACCGC CCAGGACCTT TTCATCCAAC TGGATGGACT CGGCCTGGCG CATGGACTGT CTGTCGTTTT ATCCGGAAAG GGGTTCCGTT TCCTCTGGCC GTTTGTAATC CCATCTGACT ACACCAAGGC CTACCGGGCC ATGATTACCG ACAAGGGCCA ATGGGTAGGA CTCGATCCCT CGCCGCACAT GGCTCCAAAC CGGTGGTTCC GCTTTCTTGG GTACCGGGGG CACCGCAAGC AAGACACCAA CCCCAAAGAT CGCCATATCC ACCTTTTGGA GCACCCTGCC CACCTCCTGG ATCTCACAGA GACAACGTAT CTGGAGTTAG TGCAAGGAAA ACCGGATCCG GCCACATTTC GCCCTTGGAT GCGGAGGCTT TTGCCCCACA CCACAGAGCC GCCACCGGAA TGGGTGGAGT TACTGAAAAA ATACAACGAC ATTCTGCGGC TACGATCGCA TATCGTGAAG CTGAATTTCC CCAAGAAGCC GAAACCTAGA GGAGTGGACT GGGCGCAAAT AGAAACCTTC CTTACCCAAA AGGGGATCCG TACCTGGGAC ATGCAGGACA ACGGGGAAAT CTTCTATCGC TTAACCGAAT GCCCCATGTG TGGCCGGAGG GATGGTAATC CCTGGATGAC GCAAGCCGGC CGGCTCAAAT GCTTCCACGC AAACACCTGC CCGGCTGGAG AAGAACACAC CGACCTGCAG GGGCAGACCT TCAAAAAAGG CTTGCCGCCG GAAAAGTGGG TCGAAGGATA CCAGGAGATA GAAGTAAGCC CTCCCGTGCA AGAAGACCAA CGGGAGAAGA CGGACGTCAA AACCGCCAGA GAACGCATCC GGGACGCTCT GCGCTCTGAC GAAGACGTAT TGATTCGGGC TGCCCCAGGG GTGGGCAAGA CACACACCAC CTTGGAAGAG ATCCTGCCTC AATGCCGGGA TCGACTCGTT TTGTTCACAG TTCCCAAAGG GGAGAACGTC GCCGAGATAT ACGAAAAGGC ATTGAGCCTG GCGCCGGAAG GCGTTGAAAT CCGCAAGATC AGGGGACGCA GGCGAGAAGA AAACGGTTCA GGAACTTTGG ATTTCAACCC TCCACCGGAG GGGATCTGCT ACAACATGGA CTATGTAGAA GAAGTGGCCA ATTGGGGCTA CTCTCCAGGG TTGATTTGCT GCACCGGGTG CGAGCACCAA AAAAATTGCC CCTACCAGGA ACAATTCAAG TCCCTCCCGA AAACCGGACT TGTCATTGCA GCGCATGAAA GCGCTGTTTC CCTGCCTAAA AAACGCCATT TTGATCTCTG GGTAATCGAT GAAAACCCTG TGGCGTCTCT TCTTCAAACC AAGACCGTTT CGCCCGGGGC TCTTTCACAA ATCAGGGCCA AACTACCCCG GAGGTCTGAA TTGCCTTTGG ATACAATCAA GGCTCAGGGC GAAGGCCTCT TGAAGTATCT CGCAGGAAAC CAACACGAAG GCCGGATCTA TGCCACTACG CCGCCAGCCG AATGGAAGAA CACGGAAAGT GTCTGGGAGC TCGGAGGAAT CGAAAGCCAT AAAACACCGT TTGCCGAAGA TTTGAGCTGC TTCGACCAAC TCGAAGAAGA AAACCTCAAG CAGTGGCAGA AGCGGCTGTA CTACAGCGAA AAGGTAAATT TTACCGCTTT GGAATGGCTC TGGACCGCCA CAGGACAACA GGCAGGGGTG GCCTATATCA AGGCCCGGGC AGACCGGAAG CACCCCATTT CCTATGTCCT GCACCAAACC AAAGCCCCAG GCATGAGGCG GGCAAACCAG GATGGCAGCG AAACCAAAAC CCGAATCGTG GCTCTGGACG GGACCGGCAA CAAACAAGAA CTGGAAGCGC TCTTCCCAAA TCGTTCCTTT GCCGAGGTGT CGGCTGATGT GGATCTTCCG GGCCGCAGGG TCCACCTTGA ATACAACCTG AGCAAAACCA CAGTCTGTGG GGCAGAGAAG TACAAAAAAA CCCCGATGGC GCCGCAGCAC GTCAAAGCCA AGCTCAAAGA GGGGCTCAAA TGCCTTCGTA CCGAGGAAAA GCGTGTGCTG TTGGTGACGT TCAAGGACGC CAAGGAAACA GTGTTGCGGG CAGCGCAGAA CCTCGATCCA GGTCGCACCT TTGAGGTGAC ACATTTTTGG GGCAACCGGG GGCTGAACTG CTTCCAGGAA TGCGACGCCG TTATCTGCTT CGGCTCCCCT CGGGTGGCAC CGCACAGCGT CAAAGACATG GCCTCCAGTC TGTTCGACGA TACGGAGCAG CAGAAGGCCT GGACCGAACA ACAGGGGCAC CGGGACGTGG TGCAGTCCAT TCACCGGATC CGGCCAATTT ACTCCAGAAA GTCCGTGATC GTGATGGGCG ATTTTTGGCC GGAGCAGTTA GGGACACCAC AATTCCGGAT TCGGGCTTAT CAAAAGAATG GCGCCTTTGA CCTGGCCTTG GAACGGCTCA AGCTCGTAGC GCAGACCTAC GGCTTTGTGA CCCGGGAGCT TGCCTGTTTA CACGGGGTGT TCTGCCGGGT GGATACCCAG AGCATAGCTA AGTGGATGGA ACTCCAGAAG CGCTTCCGGG AAAAGCTCGA AGAAAGTCCA TCTGAATTTG TGTTCTTTCC TATAAATATA TTTCTTATAG GAAACCACAC AAATAAAAAT GGACTTTTGG ACCCGATCAA GCTGAAGGAC ACTCACGCTT GGAATGACCT GGTGTCAGCC CTCGAGCTTG AGCTTGGCTT TCCGTCTCTG ACTGAGCGGC AAAGTACTGG GGCAGGAAGA CCGAGCCGTG GAGTGGGGAC TGTCAGCGCT GCACGGCGGT TTTACCACGC TCTGGGTGTG GTCGTTTTTG ATGAATCCCT CTGGTCCGGC CAGGAGCATT TTTTCGAGAT CCCCGTAGGG AAACTCAAAA GGCGGGTGCT GCCCGGGCAG GGCCTTTTTG AAGCGCGGAG GGCCAAGTCC GTACTGTTGG GATTCACCTC AGAAATCCGA CACCGGCAAC AAAAGCCACC GATAGAGGTT TTCAGAGATC AACAGTCGTC CTTGGCAGCT GTGGTGTAA
|
Protein sequence | MRVSGLVVQF KTAYSSKKKS WQTLSLDKAE QSIFGSVGID KLASGEAGVW LTPRAHDSAG DTWHMAWWDV EHPDEHHTSI EANTRTAQDL FIQLDGLGLA HGLSVVLSGK GFRFLWPFVI PSDYTKAYRA MITDKGQWVG LDPSPHMAPN RWFRFLGYRG HRKQDTNPKD RHIHLLEHPA HLLDLTETTY LELVQGKPDP ATFRPWMRRL LPHTTEPPPE WVELLKKYND ILRLRSHIVK LNFPKKPKPR GVDWAQIETF LTQKGIRTWD MQDNGEIFYR LTECPMCGRR DGNPWMTQAG RLKCFHANTC PAGEEHTDLQ GQTFKKGLPP EKWVEGYQEI EVSPPVQEDQ REKTDVKTAR ERIRDALRSD EDVLIRAAPG VGKTHTTLEE ILPQCRDRLV LFTVPKGENV AEIYEKALSL APEGVEIRKI RGRRREENGS GTLDFNPPPE GICYNMDYVE EVANWGYSPG LICCTGCEHQ KNCPYQEQFK SLPKTGLVIA AHESAVSLPK KRHFDLWVID ENPVASLLQT KTVSPGALSQ IRAKLPRRSE LPLDTIKAQG EGLLKYLAGN QHEGRIYATT PPAEWKNTES VWELGGIESH KTPFAEDLSC FDQLEEENLK QWQKRLYYSE KVNFTALEWL WTATGQQAGV AYIKARADRK HPISYVLHQT KAPGMRRANQ DGSETKTRIV ALDGTGNKQE LEALFPNRSF AEVSADVDLP GRRVHLEYNL SKTTVCGAEK YKKTPMAPQH VKAKLKEGLK CLRTEEKRVL LVTFKDAKET VLRAAQNLDP GRTFEVTHFW GNRGLNCFQE CDAVICFGSP RVAPHSVKDM ASSLFDDTEQ QKAWTEQQGH RDVVQSIHRI RPIYSRKSVI VMGDFWPEQL GTPQFRIRAY QKNGAFDLAL ERLKLVAQTY GFVTRELACL HGVFCRVDTQ SIAKWMELQK RFREKLEESP SEFVFFPINI FLIGNHTNKN GLLDPIKLKD THAWNDLVSA LELELGFPSL TERQSTGAGR PSRGVGTVSA ARRFYHALGV VVFDESLWSG QEHFFEIPVG KLKRRVLPGQ GLFEARRAKS VLLGFTSEIR HRQQKPPIEV FRDQQSSLAA VV
|
| |