Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1565 |
Symbol | |
ID | 8419395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 1811157 |
End bp | 1814063 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645038138 |
Product | Peptidase M16C associated domain protein |
Protein accession | YP_003198427 |
Protein GI | 258405685 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0498145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0547201 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCT CCACAGGATT TACCTGCCTC CGGGACACCT ACGTCGACGA GATCCGGAGC CAGTGCCGAG TGTACCGCCA CGACCAGACC GGTGCGGAGG TTTTGTCCGT CGAGAACCAG GACACGAACA AGGTTTTCGG CATCAGCTTC CGGACACCGC CCAAGGATTC GACCGGCGTG GCCCATATCC TGGAGCATTC CGTTCTCTGC GGTTCCCGGA AATATCCGCT CAAGGAACCC TTTGTGGAGT TGCTCAAAGG GTCCCTGCAG ACCTTTCTCA ATGCCATGAC CTTTCCGGAC AAGACCTGTT ATCCGGTGGC CAGTCAGAAC ACCCAGGATT TTTACAACCT GATCGACGTC TATTTGGACG CGGTTTTCCA TCCCCGTATC ACGGAGAATA TTTTCCGCCA GGAAGGGTGG CATTACGACC TGGAATCCCC GGACGACACC ATGCGTCTCA AGGGCGTGGT CTACAACGAG ATGAAAGGGG CCTATTCCTC ACCGGACGGA TTGCTCTCCG AGTATTCGCA GCAGATCCTG TTTCCGGATA CGACCTACGG ACTCGATTCC GGGGGCAATC CGTCACACAT CCCGGATCTG ACTTTCGAGC AGTTCCTTGA CTTCCACCGG ACCTACTACC ATCCCTCCAA CGCCCGGATC TTTTTCTACG GCGACGACGA TCCCGAGCAG CGACTGCGCC TTATTGACGC GGCGTTGCAG GAGTACGCGG CACAAGAGGT CGATTCTGCG GTCGGCGATC AGCCTTACTG GCAGAGCCCG ACACGCGAAG AACGGTTCTA TGCCGCGGGA CCGGATTCGG ACAACAAGAC CATGCTCACC GTCAACTGGC TGCTGGGACC GGTCAGTGAT ATCAAGACCA ATCTGACACT CCAGATCCTG GAGGACATCC TGCTCGGCGC ACCCGGAGCC CCGCTACGCA AAGCGCTTTT GGACTCCGGC TACGGCGAGG ATATTGCCGG GGTGGGGCTG GAGGAAGATC TCAAACAGAT GTTCTTCTCC ACCGGACTCA AGGGGGTTGC CCCGGACAAG GCCGAGACAG TGGAAACGCT TTTGCTGGAG ACCCTCGAGC GGCTGGCCGA CGAAGGGCTC GATCCGGAAG CGGTGCAGGC CGGACTGAAC ACGGTCGAAT TCGAACTCCG GGAGAACAAC TCCGGGAGTC TGCCGCGCGG ACTGCTGGTC ATGATCCGCA GTCTGACCAC CTGGCTGCAC GACGGCGATC CTCTGGCCCT GCTCCAGTTT GATGGTCCGC TACAGGAAAT CAAGGACGAA CTGGCCGAGG GCAAGCCGGT TTTTGAAGAG AGTATCCGCC GGTATTTTCT TGATAATATG CACCGCAGTA CCCTGATCCT CAAGCCGGAT TCCGGCTTGA GCGAACGGAT GGCGGCGGAA GAGGCTGAAC GGCTCGCCGC AGCCCGGGAG GCCTTGGGTC CTGAGGGGCT GGAGCGGGCC GCGGAACAGG CCCGGGAACT GAAGAAAGAG CAGGAGCAGC CCGATCCTCC AGAGGCCTTG GCCCGTCTCC CGCGGCTGAC GCGCGAGGAT CTCGACCCCC AGATCGAACG GCTGCCGGCC TCGTTCCAGG TCATGCACGG CGTGCCCTGT CTCGGGCACG GCCTGGACTG TAACGGTATT GTCTACGTCG ATCTCGGTTT TGACATCCGG GGCGTCGCGG AGGCGGATCT CGGATTTGTC TCTTTGTTGG GGCGGGCCCT GGTGGAGACC GGAACCGCCA GCGAAGATTA TGTCCGGCTT CTGCAGCGCA TCCGGCAGCA CACCGGCGGT ATTCATGCCC AGACCGTAAC CTTGACCCAA CTGGAAAGTG ACGCTCCGCG AGCCCTGCTG TTCGTCCGCG GCAAGGTGGT GGCCTCGAAA CTGGAGCAGT TTTGGGATTT GTGCTCAGAT ATTCTCTGTC GCCCGCTGCT CGAGGACAAG GACCGCTTTC GCCAGATTGT GCTTGAGGAA AAAGCCCATC TCGAACAGGC GCTTATTCCG GCAGGACACC AACTCGTCAA TTCCCGGCTG CGGGCTTCAT TCACCCAGGC CGACCACAGC GCTGAACAGA TGGGCGGTGT GGAATACCTC TTTTTCCTGC GCCAGCTCCT CGAGCGCATC GAGACCGAGT GGGACGAGGT GGCGTCGACC CTGCGCCGGG TCTACGGCCA GGTCATCCGG CGTCAGGGAC TGGTGGCAAA TATCACCTCG GATGAAGAGC ATATCGACGC TGCCCGGCCG GGACTCTGGC AATTGGTGCA GGCTTTGCCC GAGGCGCAGG TCGAGCCGAG TCAATGGCAG GTGCCGCAAT GGGAGGGCAG CGAGGCCCTG ACCCTGCCGG CGCAAGTCAA TTATGCCGGC AAGGCGGTCA GCCTGTCGGA GCACGACCAG ACCATCACCG GAGGGGACGT GGTCGCCTGC CGCTATCTGC GCACGAGTTG GCTGTGGGAC AAGATTCGGG TTCAGGGCGG GGCCTACGGG GCGTTCAGCC TGTTGGACCG CTATTCCGGG GTCCTGTCGA TGGTCTCCTA CCGTGATCCC AATGTCACGG CCACGCTCAA GGTCTTTGAC CAGGCGGGTG ACTTCGTTCG CGGCCTGGAA CTTGATGCCG GGGAGGTGGA CAAGGCGGTC GTTGGCGCCA TCGGGGACAT GGACAAGTAT CAATTGCCGG ACGCCAAGGG CTTTCAGGCC ATGCTGCGTT TTCTGGCCGG GGAAGGGGAT GACCAGCGCC AGGAATTGCG CGACGCGATT CTGGCGACCA CGGCGGATGA GTTCCGCGGC TTTGCCGAGA AACTCGATCT CCTCGCTAGC CAGGGCCGCA TTGCCGTGCT TGGCGGAAGC GACCGGCTGG AGAATGAATC CGATGTCGGC TTTGTCCGGC TCACGAAGCT CTTGTAG
|
Protein sequence | MNASTGFTCL RDTYVDEIRS QCRVYRHDQT GAEVLSVENQ DTNKVFGISF RTPPKDSTGV AHILEHSVLC GSRKYPLKEP FVELLKGSLQ TFLNAMTFPD KTCYPVASQN TQDFYNLIDV YLDAVFHPRI TENIFRQEGW HYDLESPDDT MRLKGVVYNE MKGAYSSPDG LLSEYSQQIL FPDTTYGLDS GGNPSHIPDL TFEQFLDFHR TYYHPSNARI FFYGDDDPEQ RLRLIDAALQ EYAAQEVDSA VGDQPYWQSP TREERFYAAG PDSDNKTMLT VNWLLGPVSD IKTNLTLQIL EDILLGAPGA PLRKALLDSG YGEDIAGVGL EEDLKQMFFS TGLKGVAPDK AETVETLLLE TLERLADEGL DPEAVQAGLN TVEFELRENN SGSLPRGLLV MIRSLTTWLH DGDPLALLQF DGPLQEIKDE LAEGKPVFEE SIRRYFLDNM HRSTLILKPD SGLSERMAAE EAERLAAARE ALGPEGLERA AEQARELKKE QEQPDPPEAL ARLPRLTRED LDPQIERLPA SFQVMHGVPC LGHGLDCNGI VYVDLGFDIR GVAEADLGFV SLLGRALVET GTASEDYVRL LQRIRQHTGG IHAQTVTLTQ LESDAPRALL FVRGKVVASK LEQFWDLCSD ILCRPLLEDK DRFRQIVLEE KAHLEQALIP AGHQLVNSRL RASFTQADHS AEQMGGVEYL FFLRQLLERI ETEWDEVAST LRRVYGQVIR RQGLVANITS DEEHIDAARP GLWQLVQALP EAQVEPSQWQ VPQWEGSEAL TLPAQVNYAG KAVSLSEHDQ TITGGDVVAC RYLRTSWLWD KIRVQGGAYG AFSLLDRYSG VLSMVSYRDP NVTATLKVFD QAGDFVRGLE LDAGEVDKAV VGAIGDMDKY QLPDAKGFQA MLRFLAGEGD DQRQELRDAI LATTADEFRG FAEKLDLLAS QGRIAVLGGS DRLENESDVG FVRLTKLL
|
| |