Gene Dret_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1009 
Symbol 
ID8418832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1184079 
End bp1185788 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content56% 
IMG OID645037579 
Productmethyl-accepting chemotaxis sensory transducer with Cache sensor 
Protein accessionYP_003197875 
Protein GI258405133 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.103849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0253995 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTT CGCTTTCCAT CAATAAGAAG ATGTGGGTCA TCTTTGCTCT GGTGGTCGTG 
CTTTTTGCCG CGACCACGAT TTTTTCCTCT CTGGCCTTGA ACAAAACCAA GTCCATTGCA
CTGGAGACGA CAGCCGAAGA GATGTTCAAA GGTCAGAAGC AAAAACTGAA AGTTGGAACC
CACAGCATTG CCTTGGCGCT GGGTGAGCAA CTGGAGGGTG TTTCTGATAC AACAGAACGG
ATTCAGATGA TTCGGGAGGG GGTCGACAAG ATCCGTTTTG AGAAGGACAA ATCTGGCTAC
TATTTTGTCT ACAAAAATAC GACAAATGTC GCCCTGCCGA CGAAAAAAGA GCTTCAGGGA
GAGGATCTGG GGCAAGCCAA GGACAAAAAC GGGGTCTATT TTGTTCGGGA ACTCAACGAA
CAGGCCCAAG CCGGGGGCGG GTTTGTGGAG TATGTCTTTC CCAAGCCCGG GGCCGGTGAT
CAACCCAAGC TGGCCTATGC GGAAATGATT CCCGGTACCG ACATGTGGAT TGGGACCGGG
GTGTATCTGG ACAACATCGC CGCGGCAAAG AACAATCTCA ATGCGACCAT GTCCGCGTCG
ATTTCCAAAT GGAATATTCT CCGTTACGGC GCCTCGGCAG CCATCTTCTT GGTCATTATC
GGGGTTTTGT ACCTCATCAC CCGCAGTATC GTTCGGCCCT TGAGGCAGAC CATCGATGTT
TTGAACAACA GTTCGGAGAT GATGACCGCG TCCTCGGATG AAGTTTCTTC TTCGAGCCAG
TCTTTGGCTG AAGGGGCGAA CGAGCAAGCC TCCAGCCTGG AAGAGACTTC TTCTTCGCTT
GAAGAGATGT CCTCGCAGAC CCAGCAGAAC TCCAGCAACG CCAGCCAGGC CGAACAGACC
ATGCAGCAGA CGAAGACGGC CGTGGACACC GGGGTCGAGT CCATGAGCCG CATGGGCACG
GCCATCAATG CCATCAAGCA GTCCTCGGAA GAAACGTCCA AGATTATGAA GACCATCGAC
GACATCGCCT TCCAGACCAA CCTGTTGGCC TTGAATGCTG CCGTGGAAGC CGCTCGCGCC
GGAGAGGCTG GCAAAGGTTT TGCAGTTGTG GCTGAGGAAG TCCGCAGTCT GGCGCAGCGT
TCGGCTGAGG CGGCCAAAAA CACCGCTTCA CTGATTGAAG ACGCCCAATC CAACGCCAAT
CACGGCGTTC AGGTTGCTGA CGAGGTTTCC AGCAGCCTTG AGGAGATCCA GAAGAGCGCG
GATCAGGTCG GCATTCTGGT TGCCGAGATC GCTGCGGCGA GCAAAGAACA ATCCCAAGGG
ATCGAACAGA TCAATACGGC GGTGGCCGAA ATGGACAAGG TGGTCCAGAA AAACGCTTCG
GACTCTGAGG AAACCGCCAG CGCTTCTGAG GAATTGTCTG CGCAGGCTCA GGAATTGCAA
AATGCGGTGC TCGAATTGGT GGCCCTGTTG CGCGGCGGCG ACGGAGCGTT AGGCGGAAAC
GGTGCCCAAC CAAAAAGCAG CGGTTCCCAG ACCCAACAAC GCAGCCGCTC CCGCACCGGG
AGTTCCCAAC CGCGGCAGCG CCAGGCCGTT GGCCAACACC CCTCGCGGCA GCAAAGCCGC
GCCACTGCAG CCAAGCGGCA GCAGTCCCAA CAGGATATCC GCCCCGATGA GGTGATCCCC
CTCGACGACG ATTCGTTCAA CGATTTCTAG
 
Protein sequence
MRFSLSINKK MWVIFALVVV LFAATTIFSS LALNKTKSIA LETTAEEMFK GQKQKLKVGT 
HSIALALGEQ LEGVSDTTER IQMIREGVDK IRFEKDKSGY YFVYKNTTNV ALPTKKELQG
EDLGQAKDKN GVYFVRELNE QAQAGGGFVE YVFPKPGAGD QPKLAYAEMI PGTDMWIGTG
VYLDNIAAAK NNLNATMSAS ISKWNILRYG ASAAIFLVII GVLYLITRSI VRPLRQTIDV
LNNSSEMMTA SSDEVSSSSQ SLAEGANEQA SSLEETSSSL EEMSSQTQQN SSNASQAEQT
MQQTKTAVDT GVESMSRMGT AINAIKQSSE ETSKIMKTID DIAFQTNLLA LNAAVEAARA
GEAGKGFAVV AEEVRSLAQR SAEAAKNTAS LIEDAQSNAN HGVQVADEVS SSLEEIQKSA
DQVGILVAEI AAASKEQSQG IEQINTAVAE MDKVVQKNAS DSEETASASE ELSAQAQELQ
NAVLELVALL RGGDGALGGN GAQPKSSGSQ TQQRSRSRTG SSQPRQRQAV GQHPSRQQSR
ATAAKRQQSQ QDIRPDEVIP LDDDSFNDF