Gene Dret_1985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1985 
Symbol 
ID8419830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2278772 
End bp2279899 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content58% 
IMG OID645038573 
ProductHPr kinase 
Protein accessionYP_003198847 
Protein GI258406105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCAG TAACTGAAAT CATCCAATCC ATACGCAAAA ACCATCCACC GACGCATACC 
CTGAGCTTGC AATTCGAAGA CCTCCCTTTG GCCGTGCATT GCAACAGCTT GGAACTGTAT
ACCTTTCTCG CAGAATACCT CCACACTTTC AGAGCCCCGT TTTCGTCGCG CACGACCACC
AGCATCACCG TGCACGAAGG CGATCCCGGA CTCCCGGAAT TGCCCTGGAC GGTCAAACCT
CCCGAGCCGG GGAAGGCCAA AATCAAAGAG GAATTTGCCG ACCTGCCCGA CGGGCGCCTG
ATCCGCAAGC GGCAAACCGG CGTCGTCTTT GCCCTCGCTG ACCAGGAACA TCTGGCCGTG
GGCCCGGTCA CCGACAACCC GAACCAGATC ATCAATTTCA TCAACAACCG CTTTATCCAG
CACAAGCTCT GCCGCAGTTG TCTGCTTGGC CACGCCGCCG GGATTAGCCA CAACGGCCGG
GGCATGGCTC TGGCCGGATT CTCCGGGATG GGCAAATCCA CTTTGGCCCT GCACCTTATG
AGCAGTGGCT GTACGTTTGT CTCCAACGAC CGCATCATGG TCGAGGCCGA TACGCAGCGG
CTGACCATGT ACGGCGTGGC CAAACATCCC CGCATCAACC CGGGGACGGC CCTGCACAAC
CCTGATCTCG CCGGACTCAT TCCCGAAGAA AAGCGGGAAG CGCTTGCGAA GCGCGGCGAT
CTCTGGGAAT TGGAAGACAA ATACGACGCC CCGATTGAGA CCTTTTTCGG TCCCGATCGC
TTCGTCCTGG GCGCTCCGAT GGATATATTG ATCTTGCTCA ATTGGGCCCA CGACGACCAG
CCGACCCATT TCCAGAAAGT TGACCTCCAT GAACGCACGG ATCTCTTACC GGCTTTCCAG
AAGGCTCCCG GGCTGTTCTT CTGGCCCAGT GGCAACGGCG ATTGCCGGAT TATCCGGCCT
TCGCAAAGCA ACTACCTGGG CTACCTTTCC AAATGCCACG TCTATGAGAT CAGCGGCGGC
GTCGACTTTG CCGCAGCCAC CGAATTCTGT CTCGATATTT TGAACAATGC CCCGCTGCGC
TCGTCCAGGG AGTACGCATG TCCGCAACCG GCACCCGCAT CACTGTAA
 
Protein sequence
MHSVTEIIQS IRKNHPPTHT LSLQFEDLPL AVHCNSLELY TFLAEYLHTF RAPFSSRTTT 
SITVHEGDPG LPELPWTVKP PEPGKAKIKE EFADLPDGRL IRKRQTGVVF ALADQEHLAV
GPVTDNPNQI INFINNRFIQ HKLCRSCLLG HAAGISHNGR GMALAGFSGM GKSTLALHLM
SSGCTFVSND RIMVEADTQR LTMYGVAKHP RINPGTALHN PDLAGLIPEE KREALAKRGD
LWELEDKYDA PIETFFGPDR FVLGAPMDIL ILLNWAHDDQ PTHFQKVDLH ERTDLLPAFQ
KAPGLFFWPS GNGDCRIIRP SQSNYLGYLS KCHVYEISGG VDFAAATEFC LDILNNAPLR
SSREYACPQP APASL