Gene Tpen_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1837 
Symbol 
ID4602074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1779300 
End bp1780370 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content59% 
IMG OID639774610 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_921235 
Protein GI119720740 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.106418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACATAG AAGCTTGGGT GTGCGAAGTC CCGCTTAGGG AGCCCTTAAG GATTTCTGCC 
GCGACCTATC ACACCCAGCG CTCCATAGTT CTCCGCTTGA GGGACGGAGA TCTAGAGGGT
TGGGGCGAGG CTTGCCAGTC GAGGAGGGTT CTCGGAGAGA GCTTCGAGGA CGCGCTGGAG
TCTCTGAGAG CGAGCGTCGA CGCAATCAAG AGGGCGGAGT ACGACTCTTT GGAGAAGATT
CACAGGTTCA CCGAGGAGCT CAACGCTACC CCGTCGATAA AGGCCGCTGT TAACATGGCT
CTCCTGGACC TCTACGCGAA GTCGGAGGGC AGGCCTCTGT GGAGGCTTCT CGGGGGCTTC
AGGGAGGAGG TGAGGACGGA CATCACGATA GGGATCATGC GGCCGGAGGA GATGGCGGAG
AGGGCTCTGC GCTACGCTGA GAAGGGGTTC CGGATATTCA AGCTCAAGCT GGGCGAGAAC
CCGGAGGAGG ACGTTCTAAG AGTTAAGGCT GTGCGGGACG TCGTTGGAGA CGCGACGATA
AGAGTGGACG CGAACGAGGG GTGGACGCGC GAGGACGCCG TACGGGTGAT AGAGAGGATA
GCCGACTACG GCGTTGAACT CGTGGAGCAA CCGCTCAGAC ACGACGACAT AGAAGGCCTT
AGGGCCTTGA GGAGGGAAAG CCCGATACCC ATAGCTGTAG ACGAGTCCGT GAAGACGGCG
CGCGACGCGC TGCTGGTAGC CAAGAAGGAG GCGGCCGACA TCATAAACAT CAAGCTGATG
AAAAGCAGGG GGATTACGGG CGCAATCAGG ATAATCATGG TAAGCGAGGC GGCCGGGCTG
AAGAACATGG TCGGATGCTT CTCCGAGTCT AGGCTTGGCA TCACGGCTAC CGCTTACCTC
GCCCAGGCAT TCTCCAACGT GCTGTTCTAC GACTTGGACT GCGACATACT GAGCGCCGAC
CCCGTGTTCA CGGGAGGCTC TGAGCCTATT GGTGACAGAA GAAGAGCTTC CCCGAAGCCT
GGGCTGGGAA TCTCGCCGAG TAACTTCAAC GTATTGAAGA AGATCCTCTA G
 
Protein sequence
MDIEAWVCEV PLREPLRISA ATYHTQRSIV LRLRDGDLEG WGEACQSRRV LGESFEDALE 
SLRASVDAIK RAEYDSLEKI HRFTEELNAT PSIKAAVNMA LLDLYAKSEG RPLWRLLGGF
REEVRTDITI GIMRPEEMAE RALRYAEKGF RIFKLKLGEN PEEDVLRVKA VRDVVGDATI
RVDANEGWTR EDAVRVIERI ADYGVELVEQ PLRHDDIEGL RALRRESPIP IAVDESVKTA
RDALLVAKKE AADIINIKLM KSRGITGAIR IIMVSEAAGL KNMVGCFSES RLGITATAYL
AQAFSNVLFY DLDCDILSAD PVFTGGSEPI GDRRRASPKP GLGISPSNFN VLKKIL