Gene Huta_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1854 
Symbol 
ID8384145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1860582 
End bp1861628 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID644972923 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_003130757 
Protein GI257052924 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATGC GCGCAGCCGT CTTTCGCGGC CCAGGCGAGA TCGCTATCGA AGACGTCCCG 
AAACCGGCAC TCGACGCGCC GACCGACGCC GTCGTCCGCG TGACCCACAC CGCGATCTGC
GGGTCGGATC TCTGGCCGTA CCGCGGGCAA GAGGACCGCG ACGTCCCTTC CCGTATCGGT
CACGAACCGA TGGGGATCGT CGAGGAAGTG GGCGAAGAAG TCCGCTCGGT TGCGCCCGGC
GATCGGGTGT TCGCGCCGTT CGTCACCAGC TGTGGCGAGT GTGAGTTCTG CCGGAAGGGA
CTCCACACTT CCTGTGTGAA CGGCGGGTTC TGGGGTGGCG AGGACGGCGG CGCACAGGGC
GAGTACGTCC GAGCGCGCCA CGCCGACGGC ACGCTCGTGC GCGTGCCCGA CCGCCATGCC
GACGACGAGG AGACGCTCCA GGCGATCCTC CCGCTGACGG ACGTCATGGG GACGGGCCAT
CACGCCGCGG TCAGTGCGGG CGTCGACGCC GGCTCGACGT GCATCGTCGT CGGCGACGGC
GCGGTTGGGC TGTGTGGCGT CCTCGCCGCC CGCCGGCTCG GTGCCGAACG GATCATCGCG
ATGGGCCACC ACGAGGACCG CCTCGCACTC GCCGAATCGT TCGGCGCGAC AGACACGATC
GCCGCGCGCG GCGAGGAGGC CGTCGAGCGT GCCCAGGAGC TCACGTACGG CGGCGCGAAC
CACGTCATGG AGTGTGTCGG ATCGACGGGT GCGATGAACA CCGCGATCGA GGTCTGTCGG
CCTGGCGGGA CCGTAGGGTA CGTGGGCGTC CCACACGGGA TCGAGGACGA CGGACTCGAC
ATCTTCGGAC TGTTCATGGA CAACATTTCG TTGAACGGGG GGATCGCGCC GGTCCGGGCC
TATGCGGACG AATTGCTCGC GGATGTCCTC GGTGGGACGC TCGACCCCTC GCCGATCTTT
ACGAAGACGG TCGACCTCGA CGGCGTGCCC GAGGGCTACC GGGCGATGGA TGAGCGCGAG
GCGATCAAGG TGTTGGTCAA GCCCTAA
 
Protein sequence
MDMRAAVFRG PGEIAIEDVP KPALDAPTDA VVRVTHTAIC GSDLWPYRGQ EDRDVPSRIG 
HEPMGIVEEV GEEVRSVAPG DRVFAPFVTS CGECEFCRKG LHTSCVNGGF WGGEDGGAQG
EYVRARHADG TLVRVPDRHA DDEETLQAIL PLTDVMGTGH HAAVSAGVDA GSTCIVVGDG
AVGLCGVLAA RRLGAERIIA MGHHEDRLAL AESFGATDTI AARGEEAVER AQELTYGGAN
HVMECVGSTG AMNTAIEVCR PGGTVGYVGV PHGIEDDGLD IFGLFMDNIS LNGGIAPVRA
YADELLADVL GGTLDPSPIF TKTVDLDGVP EGYRAMDERE AIKVLVKP