Gene Htur_4441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4441 
Symbol 
ID8745070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp22541 
End bp23716 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content70% 
IMG OID646514978 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003405925 
Protein GI284172543 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0607658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGTTA CAGACTACGA GCTCTACGCG GTGCCGCCGC GCTGGCAGTT CCTCAGACTC 
GAGACGAGCG ACGGGCGCGT CGGCTGGGGC GAGGTCTACA CCAAGTGGCA CTTCGCGGGC
GACAGCGAAC CGGCGACCCG GAGCGCGGTC GATCAGCTGA TGCACCAGTA CGTCCTCGGC
GAGGACCCGA GTCGCATCGA GTACCTCTGG CAGGCGATGT ACCGCAGCAG CTTCTACCGC
GGCGGACCGG TCCACATGAG CGCCATCGCC GGCATCGACG AGGCGCTGTG GGACCTGAAG
GGGAAGGCGG CCGGGATGCC GGTCTACGAA CTGCTCGGCG GTCCTGCACG CGACCGCGTC
CGACTCTACC AGCACGTCAG GGCTCACGGC GCCGACGACG TGGCGGATCC GGCGGCCGCG
GCCGCCGACG AGGCGCGCGA GCACGTCGAA GCGGGCTACA CCGCCGTGAA GCTGGTTCCG
ACGGGCGGAC TCGAGATCAT CGATGCGCCG GCAGCCGTCG AAGAAGCGCG CGAAATCGTC
GGCGCGGTCC GCGACGCCGT CGGCCCCGAG GTCGACGTCG CGCTGGATTT CCACGGCCGC
GCCTCGAAGG CGATGGCCCG CCGACTGGCG ACGGCGCTCG AGGAGTTCCA GCCGATGTTC
GTCGAGGAGC CGGTTACCCC CGAGCACGAC CACGCGCTGC CCCGGATCGC CGAGGGGACG
ACGATTCCGA TCGCGACGGG CGAGCGCCTC TACTCTCGGA GCGAGTTCCG GCCGATCCTC
GAGGCCGACG CGGTCGACGT CGTCCAGCCG GACGTCTCGA GCGCCGGGGG GATCACTGAG
ACGAAGAAAA TCGCCGACAT GGCCGAGACG TACGACGCCT CGATCGCGCC CCACTGCCCC
ATCGGCCCGC TGGCGCTGGC GGCCTCGCTA CACGTCGACG CGGCCGCGCC GAACGCGCTG
GTACAGGAGC AAGTGGTCGT CGACGACGAA GACGCGATGC GGTACGTCGA AAACGACGAG
ATCTTCGAAC CGGCCGACGG CTATCTGGAC CTGCCTGACG GACCGGGGCT CGGAATCGAG
ATCGACGAGA ATCGCGTCCG CGAACTCGCG GGAACGGACC TCGGCTTCGA CCGCTCGCCG
GGCCACCGCG CCGACGGCAG CGTCGGCGAG CGGTGA
 
Protein sequence
MHVTDYELYA VPPRWQFLRL ETSDGRVGWG EVYTKWHFAG DSEPATRSAV DQLMHQYVLG 
EDPSRIEYLW QAMYRSSFYR GGPVHMSAIA GIDEALWDLK GKAAGMPVYE LLGGPARDRV
RLYQHVRAHG ADDVADPAAA AADEAREHVE AGYTAVKLVP TGGLEIIDAP AAVEEAREIV
GAVRDAVGPE VDVALDFHGR ASKAMARRLA TALEEFQPMF VEEPVTPEHD HALPRIAEGT
TIPIATGERL YSRSEFRPIL EADAVDVVQP DVSSAGGITE TKKIADMAET YDASIAPHCP
IGPLALAASL HVDAAAPNAL VQEQVVVDDE DAMRYVENDE IFEPADGYLD LPDGPGLGIE
IDENRVRELA GTDLGFDRSP GHRADGSVGE R