Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3874 |
Symbol | |
ID | 8744502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 104438 |
End bp | 105643 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646514458 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003405405 |
Protein GI | 284167127 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0228827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGATAA CGAACATCAC CGTCACGAAG GTCAGTACCG ATTCCTGGGG CGAGTTCGTC GAGTTCCCGC TCGTCACCGT CATGAGCAAG TTCGAGGAGT ACAACAACGC CGACGGCGAC AACCCGCAGG CCCGCCGGAA GTGGATGGGG CCGGTCGGCG ACGTCGTCGT GGAGGTCGAG ACGGACGCGG GCATCACCGG CGTCGGCGTC GGCAACTGGG CGACGGGCTC GATCGAGACG ATCGTCGACG AGACGCTCTC GAAGCTCGTC GTCGGCGAGG ATCCCCGCGA GCGCGAACGC CTGTGGGACA TGATGTACCG AGCGACGATC CCCTTCGGTC GGAAGGGGGC GGCCATCGAG GCCATCAGCG CGGTCGACCT CGCGCTCTGG GATATCGCCG GCAAGGAAGC GGAGAAGCCG GTATACGAAC TGCTGGGCGG CCCGGTCACC GACGAGATTC CCTGTTACGC CAGCAACCTC CACCCGGTCG ACCACGAGAA ACTCGCCCGG GAAGCCCAGA ACTACGCCGA GCAGGGCTTC GACGCGATGA AACTGCGGTT CCGGTACGGA CCGGAAGCGG GCCGCAAGGG TATGAAGGAG AACGAGAAGA TCGTCGAGAC GGTCCGGGAC GCCGTCGGCG ACGAGATCGC GATCGCCGGC GACGCCTACA TGGGCTGGGA CGTCCGCTAC GCCAAGAAGA TGCTCAAGCG CCTCGAGCGC TACGACATGG AGTGGGTCGA AGAGCCGGTC ATCCCGGACG ACATCGACGG CTACGCCGAG GTCAGAGAGG CCTCGAACGT CCCCATCTCC GGCGGCGAAC ACGAGTTCAC CCGCTGGGGC CACAAGGAGC TGCTCGAGCG CGAGGCCGTC GACATCCTCC AGCCCGACAT CCACCGCTGT GGCGGGCTGA CCGAGTTGTT GAAGATCGAC TCGATGGCCA GCGCCCGCGA CGTGCCGGTG ATCCCTCACT CCGGAACGAA CCCGACGCTG CACTTCATCG CCGCCTCGAC CAACGCGCCG ATGGCGGAGT ACTTCCCGAT CCCGGAGTGG TACAAGGAGC GCCAGGGCGA GCAGGAGTCG ACCTACGCCG ACGCCATCTA TGCGAATCCG CCCCAGGCCG AAGGTGGCAC CATTCCGCTG CCCGAGACCG TCGGACTGAG CTCGGCGACC AACCCCGAGG CCCTCGAGCA CTACAGCGTG GAGTGA
|
Protein sequence | MEITNITVTK VSTDSWGEFV EFPLVTVMSK FEEYNNADGD NPQARRKWMG PVGDVVVEVE TDAGITGVGV GNWATGSIET IVDETLSKLV VGEDPRERER LWDMMYRATI PFGRKGAAIE AISAVDLALW DIAGKEAEKP VYELLGGPVT DEIPCYASNL HPVDHEKLAR EAQNYAEQGF DAMKLRFRYG PEAGRKGMKE NEKIVETVRD AVGDEIAIAG DAYMGWDVRY AKKMLKRLER YDMEWVEEPV IPDDIDGYAE VREASNVPIS GGEHEFTRWG HKELLEREAV DILQPDIHRC GGLTELLKID SMASARDVPV IPHSGTNPTL HFIAASTNAP MAEYFPIPEW YKERQGEQES TYADAIYANP PQAEGGTIPL PETVGLSSAT NPEALEHYSV E
|
| |