Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_2002 |
Symbol | hom1 |
ID | 7190940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011773 |
Strand | + |
Start bp | 1907494 |
End bp | 1908789 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643555414 |
Product | homoserine dehydrogenase |
Protein accession | YP_002450953 |
Protein GI | 218903119 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 4.97211e-42 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAACG TTATTCATGT AGGGGTGTTA GGATTAGGTA CGGTCGGAAG TGGTGTTGTC CATATTTTGA AAGAACATTA TAAAAAAATT GCACTTGATA CAGGGCATGA AGTGAAGGTG AAGACAGTCG TTGTACGTGA TTTAGAAAAA GAACGTGATG TTTGTATCGA TGGAATCGTA GTAACAAGTC ATGTTGATGA AGTTCTAAAT GATTCAAATA TTGATATTGT AGTAGAGGTA ATGGGCGGAA TTGAAGAAGC GAAGCAGCAT ATTGTTAAGG CTTTACGAAA TAAGAAACAT GTCGTGACAG CAAATAAAGA TTTAATGGCT GTATACGGTG CAGAGCTTTT GCAACTGGCG AACGATAATG ATTGTGATTT ATGTTATGAA GCAAGTGTAG CCGGTGGTAT TCCAGTGTTA AGAGGACTAA CAGACGGATT AGCTTCAGAT CAAATTGAAA AAATAATGGG AATCGTAAAT GGAACAACAA ATTATATGTT AACAAAGATG AGTCAAAAGG GATGGTCGTA TGAAGAGGCT TTACAAGAAG CGCAAAAATT AGGTTTCGCA GAATCAGATC CGACAGCGGA TGTAGATGGA TTAGATGCAG CGAGAAAAGT AGCAATCCTT GCAAATTTAG GTTTTTCGAT GAATGTTTCT TTGGATGATG TGCAAGTAAG AGGGATTCGA AAGGTAGAAA AAGAAGATTT ACAAATGGCT GAAAAGTTAG GGTTTACTAT GAAGTTAATT GGTAAAGCAG AGAAACAGGG ATCAGCTATT CATTTAAGTG TAGAACCGAC ACTTTTACCA AGTCATCATC CATTGTCAAA TGTAAATAAT GAATTTAATG CAGTGTATGT TCACGGGCAA GCGGTAGGAG AAGTGATGTT TTACGGACCT GGAGCAGGTA AATTGCCGAC TGGTTCTGCA GTAGTAAGTG ATATTATTTC AATCGTTAAA AATATGAATC AAGTTCCAAA AAATAAAAGT GTGTTAAAAG AACCAGAGCC ATACGAATTA CAAGGGGATG AAGAAGTCGT TTCGAAATAT TTCTTACGTA TTTCATTACG AGATGAGCCA GGGATGTTGC AAAAAATAAC AGAATGTTTC GTTAATTATT CTGTAAGTTT AAAAGAAGTA ATTCAATTAC CTTTAAATCG TGAACTTGCA GAAGTCGTTG TTGTGACACA TCAAACTTCA AAGTATCAAT TCGAACGAGT TTTAGGGGCA ATAGAAGATG TCGCAAGTGA AATAAACAGT TACTACATTA TTGAGGAGGA AAAACAATAT GTATAA
|
Protein sequence | MNNVIHVGVL GLGTVGSGVV HILKEHYKKI ALDTGHEVKV KTVVVRDLEK ERDVCIDGIV VTSHVDEVLN DSNIDIVVEV MGGIEEAKQH IVKALRNKKH VVTANKDLMA VYGAELLQLA NDNDCDLCYE ASVAGGIPVL RGLTDGLASD QIEKIMGIVN GTTNYMLTKM SQKGWSYEEA LQEAQKLGFA ESDPTADVDG LDAARKVAIL ANLGFSMNVS LDDVQVRGIR KVEKEDLQMA EKLGFTMKLI GKAEKQGSAI HLSVEPTLLP SHHPLSNVNN EFNAVYVHGQ AVGEVMFYGP GAGKLPTGSA VVSDIISIVK NMNQVPKNKS VLKEPEPYEL QGDEEVVSKY FLRISLRDEP GMLQKITECF VNYSVSLKEV IQLPLNRELA EVVVVTHQTS KYQFERVLGA IEDVASEINS YYIIEEEKQY V
|
| |