Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1165 |
Symbol | |
ID | 7977642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1213906 |
End bp | 1214808 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644798118 |
Product | dipicolinate synthase subunit A |
Protein accession | YP_002949291 |
Protein GI | 239826667 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1052] Lactate dehydrogenase and related dehydrogenases |
TIGRFAM ID | [TIGR02853] dipicolinic acid synthetase, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00398415 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCTAA CAGGCATGCA TATTGCCATT ATAGGTGGGG ACGCAAGGCA ATTAGAAGTC ATACGCAAAT TAATTGAGCT AGATGCGAAA TTGTCTTTAG TTGGCTTTGA TCAATTAGCG CATCATTTTA CGGGAGCGAC AAAGTTGAGA ATAGATGAAG TCGATTTCGC CGATTTAGAT GCGATTATTT TGCCGGTTCA CGGAACAACT CCGGATGGAA AAGTAAATAC CGTGTTTTCG CATGAACCAA TTCCCTTTAC AGAAGAAATG ATGTTAAAAA CACCGAAGCA TTGCACGGTC TATTCCGGGA TTAGTAATAG CTACTTGGAC AATTTAATGA AAACAATCGA CCGCAAATAT GTTCAATTAT TTGAACGCGA CGATGTAGCG ATTTATAACT CGATTCCAAC CGCAGAGGGA ACGATCATGA TTGTCATTCA GCATACCGAT TTTACGATTC ATGGTTCTCG CGTAGCGGTC CTTGGGCTTG GGCGCGTCGG CATGACGGTT GCACGTACAT TTGCTGCACT TGGTGCAAAA GTGAAAGTAG GGGCACGCCG CTCGGAGCAT CTGGCCCGCA TTACAGAAAT GGGTCTTACC CCGTTTCATC TCAATGATTT AGAAAAAGAA GTGCAGGATA TCGATATTTG CATTAATACC GTTCCGCATT TAATCGTAAC GGCAAGTGTC ATCGCAAAAA TGCCGGCGCA TACGCTTATC GTTGATTTAG CTTCAAGGCC TGGCGGCACC GATTTTCGTT ACGCCGAAAA GCGTGGAGTA AAAGCGATCC TCGCACCAGG GCTGCCGGGA GTTGTTGCGC CAAAAACAGC AGGGCAAATT ATCGCAAATG TTCTTTCACA ACTATTATAT ACAGATTTAG AAAAAAGAAA GGGGAATGTG TAA
|
Protein sequence | MMLTGMHIAI IGGDARQLEV IRKLIELDAK LSLVGFDQLA HHFTGATKLR IDEVDFADLD AIILPVHGTT PDGKVNTVFS HEPIPFTEEM MLKTPKHCTV YSGISNSYLD NLMKTIDRKY VQLFERDDVA IYNSIPTAEG TIMIVIQHTD FTIHGSRVAV LGLGRVGMTV ARTFAALGAK VKVGARRSEH LARITEMGLT PFHLNDLEKE VQDIDICINT VPHLIVTASV IAKMPAHTLI VDLASRPGGT DFRYAEKRGV KAILAPGLPG VVAPKTAGQI IANVLSQLLY TDLEKRKGNV
|
| |