Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1633 |
Symbol | |
ID | 8419464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 1881291 |
End bp | 1884386 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645038207 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_003198495 |
Protein GI | 258405753 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.720693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCCCGC AGGCTACCAA TCAGGCGAAA TTTGCCCGGC CGGCCCTGTC CCAGGTCGTG CCCCGGGCTC GCCTTTTTGA CCGCCTCAGG GAGCATGGGC GGTCCTCTTT TCTTTGGATT GCCGCCCCGG CAGGCTTCGG CAAGACTACC CTTGTCGCGA GCTATCTCGA GGCGCAGGAG GTCCCATCTG TCTGGTTCCG TCTGGACGGT GGGGACGGGG ATCCAAATTC CTTTTTCCTG ACTCTGGAGC AGGCGGTGCG GTTGGTGTTT CCCGATGCGC CGGCATGTGT CGGGCCTCGC TTTGAACAGC CCCTGTCTTC TTTGTCCCGG AATTTTTTTT CGCACCTCTC CCTGGTCTGC GCGCCCCAGA CCTATCTCGT CTTCGACAAT GCTCATGAAC TCGAGGACAG CCACGAATTT TGGGAAATGC TCAGCACCGG CCTTGCCTGC GCGGGTCCAG CCCTGCGGGT GGTGTTTATG AGCCGCTCCA GCCCCCCGGA TCCGTTTGTC ACTCTTCTTG CCAGTGACCG GCTCCGGGAA CTGCCGGCCG AGGAACTCCG CCTCAATCGG GAGGAAGTGG TGGAAATGGT CAGACGGGAC CCGCCCCAGG CTGGTATCGG CAGTGCACTC ATTCCCAATC TTCTTGCGGC CTGCAACGGG TGGGCGGCCT GTTTGCGGCT CGTTCTGCGC CATTTGCGGT TTGCGCCGTT ATCGGCCCAA TGGGTTGATC TGGACCGGCC GCCACCTGAA TTGGTGACCT ATTTTTCGAC AGAACTGTTC CAGGCCATCG ACCCTGATTT ACGCTGCTTT CTCCTCAATC TGGCGGTTTT TCCCTGGTTT GACGCTGACA TGGCCACACG TCTGACTGGG GTGGCGGAGG CCGCGTCCAT TTTGGAGCGG ATGGCCGGGG AGAGTTTTTT CACCTCAGTG CACCGTTCGG CCCCGGATGC GCCTTCCATC TATATTTTCC ATTCCCTGTT CCGGAATTTT CTGCGCGTCG AGGCCAGAAA GATCTTTTCT TCTGAACAGT GGCAGGACCT CTTCCAGCGC GCCGGGAGGC TGTTGGAAGA GGCTGGGGAT GCAGAAGCAG CCATCAACGC CTACAGTGCA GCCGGGGACA CCGAAGCCTG GGTGGCGACC GCCTTGCGAC ACGCTCCGTC CTTACTCGCC TCGGGGCGGC ACAAGAATCT GGAGCGGTGG CTGGCTCCCT TACCGGCGCA GTGCCAAAAG CGCTGGCCGT GGACGGTTTT CTGGCTGGCC GGCAGTCGAT TTCCGGAATC CATTGCCCAG AGCAGGGACC TCTACGCCCA GGCCCTGGAT CTGTTTCAGG ATCAGGGAGA CCACCGGGGG ATGCTTCTGG CCTGGGCGGG GCGGATGGCC TGTCTCATCC ATGGCCAGGC GCCGCGGCCC GCTGTCGTTT CGGCTTTGGA GCTGTTCGAC ACCGACCTGA CCACGTTCTA TTCCCTGGAG AAAGACGAAT GGACCCGGGC CGTGGTCGCT GCAGCGGTGT TTCTGGCCAA GGTCCTCTGT GCCCCTGAAC CGGAGACCAT CCGGTGGGGT GAGACCGCCC TCGGCTTCGC AGAGCGTCTT CAGGCAAAAG ACCTGCTCAT TCACGTCTGT TTCGGCGCTG TGTTGTATTT TTTGCCTCGC GGGGAAAACG AACGGCTGGC GGGCCTGATC ACGCAGGTAC GATTTCAGTG CTCCGTGCCA GCGTTGCCGG CGTTGCCAGC TGCAACGCTG CTGGCCACGG ACATGCTGGG ACGGATGACC CAGGCCCGGG AGGAAGAGGC CCTGGCCTTG GCGACACAAG GATTGCAGCT TGTGCGGCAA AGTGGGGTCA CCCTTTTTGA GGCCTTTTTT TTGGGCTATA GTGCGGCCTG TGCCATGAAT ACTGGAGATC TTGAGGCGGC CGAGCGTTTT TTGGACTCGA TGGCCAAGGC TTCGCTTGCG ACCCGGCGGT GGGACCGTTC ACTGTATTGT TTTCTCAGCG CCCGGTTGTC TCTGCTCCGG GGAGCCTATG CCGATGCTGA ACGGTGGGTT CGTCAAGTGC TCGACCGGGA GGAAGACGAG GGGCCCCTTG GCACCCTTTG TCTGAGTCTG GTGGGCGCTG CCCAGATCGC TTTTTGTCTG GGATGGAGGG AGCAGGCGGA AGAACGACTC GCCCAGGCCG GCGTTTTGGC CCGCCAGTAT CAGTTCACGC TTGCCGAATG GGCTGCGGCA ATGGCGCAGG CGTATTGGGC GCTGGCCGAG GGCGACCGTG ACAGCTGCCG CGCCGTATTG AAGCCGGCGC TCGCTCTTGG GCGGCGGAAC ATGCTGTACG TCACGCTTGT CGATGTCCCG GCCCAACTCG CCGAGGTCTG TGTGGCGGCT TTGGAGGAGG GGATGGAAGT CCCGTATGTC CAACGCCTGG TCAGACAACG GCGGCTGCTC CCGTCCCGTC CGCCTCTGCA TGTCCAGAAC TGGCCCTGGC GGTACCGGAT TTTTACCCTG GGCGTGTTTG CCGTGCAGCG AAATGAGCGG GAAATGCGTT TCGGCGGAAA AGCGCCGCAG CAAACCCTCA ATGTGTTGAA AGCCTTGATC GCCTGTGGGC GTGAACAGGT TCCCTGTCAC CTGCTCAGTG ATCTGCTCTG GGAGGAAGCG CCAGGTGACG CCGCGCACAA CGCCTTGAAG ACCCAGGTCC ACCGACTGCG CCGGCTGTTG GGCGACCCCA AGGTCCTGAC CTGGCAGGGT AAGCGGCTGA GTCTGGACCC CTCACTGGTC TGGGTGGATG TCGGGGCCCT GAGCAGCATC TTTGAGCGGT TCGGAAGCGG GGAACTGGCC CTCGCCCCAA CCGATGTCGT GCGGCTGGTG TTGGATCTCT ATAAAGGACC GTTTTTACCC GAGATCGAGG AGAGTTGGGT CTTTGAGCCA AGGCGGAGGT TTGCGGCGTT GTTCAGCCGC GTTGTGACCG CACTCGGCCG GAGACTGGAG GAGACGGGGC AATGGATGGA GGCGCTTTTT CTCTATAGTC AGGCCCAGGA ACGGGAGGCG AATGGGGGGA GGTGGGACAA GCGGATCGAG TACGTACGGG GCAAACTCAA CGGGGGAGCG CTTTGA
|
Protein sequence | MCPQATNQAK FARPALSQVV PRARLFDRLR EHGRSSFLWI AAPAGFGKTT LVASYLEAQE VPSVWFRLDG GDGDPNSFFL TLEQAVRLVF PDAPACVGPR FEQPLSSLSR NFFSHLSLVC APQTYLVFDN AHELEDSHEF WEMLSTGLAC AGPALRVVFM SRSSPPDPFV TLLASDRLRE LPAEELRLNR EEVVEMVRRD PPQAGIGSAL IPNLLAACNG WAACLRLVLR HLRFAPLSAQ WVDLDRPPPE LVTYFSTELF QAIDPDLRCF LLNLAVFPWF DADMATRLTG VAEAASILER MAGESFFTSV HRSAPDAPSI YIFHSLFRNF LRVEARKIFS SEQWQDLFQR AGRLLEEAGD AEAAINAYSA AGDTEAWVAT ALRHAPSLLA SGRHKNLERW LAPLPAQCQK RWPWTVFWLA GSRFPESIAQ SRDLYAQALD LFQDQGDHRG MLLAWAGRMA CLIHGQAPRP AVVSALELFD TDLTTFYSLE KDEWTRAVVA AAVFLAKVLC APEPETIRWG ETALGFAERL QAKDLLIHVC FGAVLYFLPR GENERLAGLI TQVRFQCSVP ALPALPAATL LATDMLGRMT QAREEEALAL ATQGLQLVRQ SGVTLFEAFF LGYSAACAMN TGDLEAAERF LDSMAKASLA TRRWDRSLYC FLSARLSLLR GAYADAERWV RQVLDREEDE GPLGTLCLSL VGAAQIAFCL GWREQAEERL AQAGVLARQY QFTLAEWAAA MAQAYWALAE GDRDSCRAVL KPALALGRRN MLYVTLVDVP AQLAEVCVAA LEEGMEVPYV QRLVRQRRLL PSRPPLHVQN WPWRYRIFTL GVFAVQRNER EMRFGGKAPQ QTLNVLKALI ACGREQVPCH LLSDLLWEEA PGDAAHNALK TQVHRLRRLL GDPKVLTWQG KRLSLDPSLV WVDVGALSSI FERFGSGELA LAPTDVVRLV LDLYKGPFLP EIEESWVFEP RRRFAALFSR VVTALGRRLE ETGQWMEALF LYSQAQEREA NGGRWDKRIE YVRGKLNGGA L
|
| |