Gene GWCH70_3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3171 
Symbol 
ID7977024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3199699 
End bp3200769 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content41% 
IMG OID644799956 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_002951095 
Protein GI239828471 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01928] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA AAGACGCAAC CATCGCCACA CAATCGACTC CATTAATAAA ACCTTTCAAA 
ACGGCATTGC GGACAGCTAC GCAAATTGAA AGCATCGTCG TAAAAATCAC GCTAGATAAC
GGAATTGAAG GGTATGGGGC CGCCGTCCCG ACCGAAGCCA TCACGGGAGA AACGAAACAA
GGAATCATAG GTATTTTAGA AAATGTACTC ATCCCTAAAA TCATCGGTAA AGAAATCGAA
GAAATAGCAA AAAACAGTAA AGATATTCAA ACTAGTTGCA TCGAAAACAC AAGCGCGAAA
GCAGCATTAG AAATGGCCAT GTACGATGCC CTTTGCAAAC TACTAAACAT TCCTCTCTAT
CAATTATTCG GAGGAAAGAC GAACCGTCAT GTCAACGATA TGACAATTAG CGTAAATAGT
GTAGAAGAAA TGGTCAATGA CGCAAAAAAA GTCACAGAAA AAGGCTTTTC GATTTTAAAA
ATTAAAGTAG GAAAAGAAGC CGAAAAGGAT ATCGAACGAA TCGAACGAAT TTATGAAGAA
GTGGGGCCAA ATGTTTCACT GCGCATTGAC GCTAATCAAG GATGGACAGC GAAAGAAGCT
GTGGAAATCA TTCAAACGTT AGAACGGCTC CAACTTCCGA TCGAATTTAT TGAGCAGCCC
GTTCCTAAAT ATGACATAAA AGGTCTTCAG TTTATCCGAG AACGAGTCAA CATACCGATT
ATGGCGGACG AAAGTGTATT CTCGGCTCGA GATGCACTAG AACTGATCCG TCATCACGCT
GTCGATTTGA TCAATATTAA GTTAATGAAA ACGGGAGGAT TACGGGAAGC CTACAAGATT
GCTAGTCTAG CAGAAGCGGC CGGTATCGAA TGCATGATCG GAAGCATGAT GGAACCAACT
CTTTCCGTAC TGGCAGCAAC CCATTTAGCA ATTGCCCATC CGAACATTAC AAAAGTCGAT
TTAGATGCGC CTCTATGGAT AGATGATGAT AGCAGTCGCT CGTTCTTTCA AGGAAGCGAA
ATTAACGTTC CTGATTTACC AGGGATCGGT TATGTTCCTT TATATAACTA A
 
Protein sequence
MKIKDATIAT QSTPLIKPFK TALRTATQIE SIVVKITLDN GIEGYGAAVP TEAITGETKQ 
GIIGILENVL IPKIIGKEIE EIAKNSKDIQ TSCIENTSAK AALEMAMYDA LCKLLNIPLY
QLFGGKTNRH VNDMTISVNS VEEMVNDAKK VTEKGFSILK IKVGKEAEKD IERIERIYEE
VGPNVSLRID ANQGWTAKEA VEIIQTLERL QLPIEFIEQP VPKYDIKGLQ FIRERVNIPI
MADESVFSAR DALELIRHHA VDLINIKLMK TGGLREAYKI ASLAEAAGIE CMIGSMMEPT
LSVLAATHLA IAHPNITKVD LDAPLWIDDD SSRSFFQGSE INVPDLPGIG YVPLYN