Gene Htur_3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3874 
Symbol 
ID8744502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp104438 
End bp105643 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content66% 
IMG OID646514458 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003405405 
Protein GI284167127 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0228827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATAA CGAACATCAC CGTCACGAAG GTCAGTACCG ATTCCTGGGG CGAGTTCGTC 
GAGTTCCCGC TCGTCACCGT CATGAGCAAG TTCGAGGAGT ACAACAACGC CGACGGCGAC
AACCCGCAGG CCCGCCGGAA GTGGATGGGG CCGGTCGGCG ACGTCGTCGT GGAGGTCGAG
ACGGACGCGG GCATCACCGG CGTCGGCGTC GGCAACTGGG CGACGGGCTC GATCGAGACG
ATCGTCGACG AGACGCTCTC GAAGCTCGTC GTCGGCGAGG ATCCCCGCGA GCGCGAACGC
CTGTGGGACA TGATGTACCG AGCGACGATC CCCTTCGGTC GGAAGGGGGC GGCCATCGAG
GCCATCAGCG CGGTCGACCT CGCGCTCTGG GATATCGCCG GCAAGGAAGC GGAGAAGCCG
GTATACGAAC TGCTGGGCGG CCCGGTCACC GACGAGATTC CCTGTTACGC CAGCAACCTC
CACCCGGTCG ACCACGAGAA ACTCGCCCGG GAAGCCCAGA ACTACGCCGA GCAGGGCTTC
GACGCGATGA AACTGCGGTT CCGGTACGGA CCGGAAGCGG GCCGCAAGGG TATGAAGGAG
AACGAGAAGA TCGTCGAGAC GGTCCGGGAC GCCGTCGGCG ACGAGATCGC GATCGCCGGC
GACGCCTACA TGGGCTGGGA CGTCCGCTAC GCCAAGAAGA TGCTCAAGCG CCTCGAGCGC
TACGACATGG AGTGGGTCGA AGAGCCGGTC ATCCCGGACG ACATCGACGG CTACGCCGAG
GTCAGAGAGG CCTCGAACGT CCCCATCTCC GGCGGCGAAC ACGAGTTCAC CCGCTGGGGC
CACAAGGAGC TGCTCGAGCG CGAGGCCGTC GACATCCTCC AGCCCGACAT CCACCGCTGT
GGCGGGCTGA CCGAGTTGTT GAAGATCGAC TCGATGGCCA GCGCCCGCGA CGTGCCGGTG
ATCCCTCACT CCGGAACGAA CCCGACGCTG CACTTCATCG CCGCCTCGAC CAACGCGCCG
ATGGCGGAGT ACTTCCCGAT CCCGGAGTGG TACAAGGAGC GCCAGGGCGA GCAGGAGTCG
ACCTACGCCG ACGCCATCTA TGCGAATCCG CCCCAGGCCG AAGGTGGCAC CATTCCGCTG
CCCGAGACCG TCGGACTGAG CTCGGCGACC AACCCCGAGG CCCTCGAGCA CTACAGCGTG
GAGTGA
 
Protein sequence
MEITNITVTK VSTDSWGEFV EFPLVTVMSK FEEYNNADGD NPQARRKWMG PVGDVVVEVE 
TDAGITGVGV GNWATGSIET IVDETLSKLV VGEDPRERER LWDMMYRATI PFGRKGAAIE
AISAVDLALW DIAGKEAEKP VYELLGGPVT DEIPCYASNL HPVDHEKLAR EAQNYAEQGF
DAMKLRFRYG PEAGRKGMKE NEKIVETVRD AVGDEIAIAG DAYMGWDVRY AKKMLKRLER
YDMEWVEEPV IPDDIDGYAE VREASNVPIS GGEHEFTRWG HKELLEREAV DILQPDIHRC
GGLTELLKID SMASARDVPV IPHSGTNPTL HFIAASTNAP MAEYFPIPEW YKERQGEQES
TYADAIYANP PQAEGGTIPL PETVGLSSAT NPEALEHYSV E