Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2591 |
Symbol | |
ID | 7976352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2623158 |
End bp | 2624573 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644799392 |
Product | isopropylmalate isomerase large subunit |
Protein accession | YP_002950552 |
Protein GI | 239827928 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR00170] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00933091 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACCGA AAACGATTAT TGAAAAGATT TGGGAAAATC ATGTCGTATA TCGCGAAGAT GGAAAACCAG ATTTGCTCTA TATCGACTTA CATTTAGTGC ACGAAGTCAC CTCTCCGCAA GCGTTTGAAG GATTGCGGCA AAAGGGGCGG AAAGTGCGCC GCCCAGATTT AACATTTGCG ACGATGGATC ATAATGTTCC AACAGTAAAC CGTTTGGTGA TTGAGGACGA AGTGGCAAGA AATCAAATTG CCGCGTTGGA ACGAAACTGT CGGGAATTTT CGATTCCGCT AGCTGATTTG CGTAGTGAAG AACAAGGAAT TGTTCATGTG ATTGGCCCAG AGCTTGGTTT AACACAACCT GGAAAAACGA TTGTTTGCGG GGACAGCCAT ACGTCGACAC ATGGAGCATT TGGAGCGCTT GCGTTTGGAA TCGGAACGAG CGAAGTGGAA CATGTGCTTG CGACACAAAC GTTATGGCAG CATAAACCGA AAACGCTGCA AATTCGCATT AACGGAAAAT TAGGGGAAGG GGTTACCGCC AAAGACGTCA TTTTGGCCAT TATCGGCCGC TACGGGGTTG ATGTTGGAAC AGGATATATT ATCGAATTTA CTGGAGAAGT CATTCGTAAC ATGTCGATGG AAGAGAGAAT GACGATTTGC AACATGTCGA TTGAAGCAGG CGCACGTGCT GGTTTAGTAA GTCCAGATGA AACGACGTTT GCTTATTTGC GAGGGCGCAA ATATGCGCCG AAGGGAGAAG AGTTTGAAAA AGCGGTAGAA CGCTGGCGGG CTCTTGCGAC AGATGAAGGC GCGGAATACG ACAAAACGAT TGAAATAGAT GCGTCGACCA TTGCCCCAAT GGTGACATGG GGGACGAACC CATCGATGAG CACATCGATC GAAGGAACAG TGCCGTATCC AGAGGATTTT TCAAGCGAAA CGGAACAAAA GGCAGTACGA CGCGCTTTGG AATATATGGG ATTAGAACCT GGCACTCCGA TTACAGAAAT TCCAGTGCAG CACGTTTTCA TCGGTTCCTG TACGAACTCT CGTATTAGCG ACTTGCGTGA AGCCGCGAAA ATTGTCAAAG GAAAGAAAGT GGCAAAAGGT GTGCGTGCGC TTGTTGTCCC TGGATCGCAA CAAGTGAAAA AACAAGCGGA AGAAGAAGGA TTAGCACAAA TTTTTATCGA TGCGGGATTC GAATGGCGTG ATTCCGGCTG CAGCGCATGT TTAGGAATGA ATCCAGACAT TATACCGGAA GGAGAACATT GCGCTTCCAC ATCGAATCGA AATTTTGAGG GACGACAAGG AAAAGGGGCA AGAACGCACT TAGTTAGCCC GGCGATGGCT GCCGCTGCTG CAATTTATGG TCATTTTGTC GATGTTCGAA AATTGCAGAA AGAACCGGTT CATTAA
|
Protein sequence | MKPKTIIEKI WENHVVYRED GKPDLLYIDL HLVHEVTSPQ AFEGLRQKGR KVRRPDLTFA TMDHNVPTVN RLVIEDEVAR NQIAALERNC REFSIPLADL RSEEQGIVHV IGPELGLTQP GKTIVCGDSH TSTHGAFGAL AFGIGTSEVE HVLATQTLWQ HKPKTLQIRI NGKLGEGVTA KDVILAIIGR YGVDVGTGYI IEFTGEVIRN MSMEERMTIC NMSIEAGARA GLVSPDETTF AYLRGRKYAP KGEEFEKAVE RWRALATDEG AEYDKTIEID ASTIAPMVTW GTNPSMSTSI EGTVPYPEDF SSETEQKAVR RALEYMGLEP GTPITEIPVQ HVFIGSCTNS RISDLREAAK IVKGKKVAKG VRALVVPGSQ QVKKQAEEEG LAQIFIDAGF EWRDSGCSAC LGMNPDIIPE GEHCASTSNR NFEGRQGKGA RTHLVSPAMA AAAAIYGHFV DVRKLQKEPV H
|
| |