Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1170 |
Symbol | |
ID | 7977646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1219094 |
End bp | 1219966 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644798123 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_002949296 |
Protein GI | 239826672 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase [TIGR00683] N-acetylneuraminate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000026647 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATCCAAT TTGGCAAAAT CGTGACAGCG ATGGTAACGC CGTTTGATCA TAAAGGAAAT ATCGATTTTG CAAAAACGAC CAAGCTTGTC GATTATTTGC TCGAAAACGG AACAGATTCC CTCGTTGTTG CCGGAACGAC AGGCGAATCG CCAACATTAA CGACGGAAGA AAAAGTTGCT TTATTCCGTC ACGTTGTTTC GGTTGTGAAC GGGAGAGTTC CAGTTATTGC TGGGACTGGA AGCAACAATA CACACGCATC GATTGAGTTG ACGAAGAAAG CGGAAGAAGC TGGCGTCGAC GCGGTAATGT TAGTAGCGCC GTATTATAAT AAACCGAATC AAGAAGGGTT ATATCAACAC TTCAAAGCGA TTGCCGAAAG CACATCGCTT CCGGTGATGC TCTATAATAT TCCTGGACGT TCTGTTGTGA ACATGTCTGT TGACACGGTT GTTCGTTTAT CGGAAATTCC AAACATCGTT GCTATAAAAG ATGCGAGCGG CAATTTAGAT ACGATGACGG AAATAATTGC CCGGACGAGA GAGGATTTTC TGCTTTACAG CGGTGACGAT AACATCACCC TTCCGGTATT GGCGATTGGC GGTGCCGGCG TTGTGTCTGT TGCCTCTCAT ATTATTGGCA ATGAAATGCA ACAAATGATT GCTGCCTTCG AAGCAGGGGA ACTTGCCAAA GCGGCAAAAC TGCATCAAAA GCTGTTGCCA ATTATGAAAG GGTTATTTGC AGCGCCAAAT CCTGTACCGG TAAAAACGGC GCTGCAGTTA AAAGGATTAG ACGTTGGTTC TGTTCGTCTG CCGCTTGTCC CGCTTACCGA ACAAGAGCGC ATCGAGCTAA TGAATTTATT AAATACATTA TAA
|
Protein sequence | MIQFGKIVTA MVTPFDHKGN IDFAKTTKLV DYLLENGTDS LVVAGTTGES PTLTTEEKVA LFRHVVSVVN GRVPVIAGTG SNNTHASIEL TKKAEEAGVD AVMLVAPYYN KPNQEGLYQH FKAIAESTSL PVMLYNIPGR SVVNMSVDTV VRLSEIPNIV AIKDASGNLD TMTEIIARTR EDFLLYSGDD NITLPVLAIG GAGVVSVASH IIGNEMQQMI AAFEAGELAK AAKLHQKLLP IMKGLFAAPN PVPVKTALQL KGLDVGSVRL PLVPLTEQER IELMNLLNTL
|
| |