Gene GWCH70_1170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1170 
Symbol 
ID7977646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1219094 
End bp1219966 
Gene Length873 bp 
Protein Length290 aa 
Translation table11 
GC content45% 
IMG OID644798123 
Productdihydrodipicolinate synthase 
Protein accessionYP_002949296 
Protein GI239826672 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase
[TIGR00683] N-acetylneuraminate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000026647 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCCAAT TTGGCAAAAT CGTGACAGCG ATGGTAACGC CGTTTGATCA TAAAGGAAAT 
ATCGATTTTG CAAAAACGAC CAAGCTTGTC GATTATTTGC TCGAAAACGG AACAGATTCC
CTCGTTGTTG CCGGAACGAC AGGCGAATCG CCAACATTAA CGACGGAAGA AAAAGTTGCT
TTATTCCGTC ACGTTGTTTC GGTTGTGAAC GGGAGAGTTC CAGTTATTGC TGGGACTGGA
AGCAACAATA CACACGCATC GATTGAGTTG ACGAAGAAAG CGGAAGAAGC TGGCGTCGAC
GCGGTAATGT TAGTAGCGCC GTATTATAAT AAACCGAATC AAGAAGGGTT ATATCAACAC
TTCAAAGCGA TTGCCGAAAG CACATCGCTT CCGGTGATGC TCTATAATAT TCCTGGACGT
TCTGTTGTGA ACATGTCTGT TGACACGGTT GTTCGTTTAT CGGAAATTCC AAACATCGTT
GCTATAAAAG ATGCGAGCGG CAATTTAGAT ACGATGACGG AAATAATTGC CCGGACGAGA
GAGGATTTTC TGCTTTACAG CGGTGACGAT AACATCACCC TTCCGGTATT GGCGATTGGC
GGTGCCGGCG TTGTGTCTGT TGCCTCTCAT ATTATTGGCA ATGAAATGCA ACAAATGATT
GCTGCCTTCG AAGCAGGGGA ACTTGCCAAA GCGGCAAAAC TGCATCAAAA GCTGTTGCCA
ATTATGAAAG GGTTATTTGC AGCGCCAAAT CCTGTACCGG TAAAAACGGC GCTGCAGTTA
AAAGGATTAG ACGTTGGTTC TGTTCGTCTG CCGCTTGTCC CGCTTACCGA ACAAGAGCGC
ATCGAGCTAA TGAATTTATT AAATACATTA TAA
 
Protein sequence
MIQFGKIVTA MVTPFDHKGN IDFAKTTKLV DYLLENGTDS LVVAGTTGES PTLTTEEKVA 
LFRHVVSVVN GRVPVIAGTG SNNTHASIEL TKKAEEAGVD AVMLVAPYYN KPNQEGLYQH
FKAIAESTSL PVMLYNIPGR SVVNMSVDTV VRLSEIPNIV AIKDASGNLD TMTEIIARTR
EDFLLYSGDD NITLPVLAIG GAGVVSVASH IIGNEMQQMI AAFEAGELAK AAKLHQKLLP
IMKGLFAAPN PVPVKTALQL KGLDVGSVRL PLVPLTEQER IELMNLLNTL