Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_2159 |
Symbol | |
ID | 8526023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 2179030 |
End bp | 2180685 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | |
Product | urocanate hydratase |
Protein accession | YP_003253256 |
Protein GI | 261419574 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAAA AACGGACCGT ACGCCCGTTT GCGGGAACAG AGCGGCGGGC GAAAGGATGG ATTCAAGAAG CGGCGTTGCG CATGTTAAAC AACAATTTGC ATCCCGATGT CGCCGAGCGG CCGGATGAGT TGATCGTCTA CGGCGGCATC GGCAAGGCGG CGCGCAACTG GGAATGTTAC GAGGCGATTG TGGACACCCT TCTTCGTTTA GAAAACGATG AAACGTTGCT CATTCAATCT GGCAAGCCGG TGGCGGTGTT TCGCACGCAT CCGGACGCCC CTCGCGTGCT GATCGCCAAC TCCAACCTCG TGCCCGCATG GGCGACGTGG GACCATTTTC ACGAACTTGA CAAAAAAGGG TTGATCATGT ACGGACAAAT GACGGCCGGG AGCTGGATTT ACATCGGCAG CCAAGGAATC GTCCAAGGGA CATATGAAAC GTTTGCCGAA GTGGCGCGCC AGCACTTTGG CGGCACGCTG GCCGGGACGA TCACGCTAAC CGCCGGCCTT GGCGGCATGG GCGGGGCGCA GCCGCTCGCC GTGACGATGA ACGGCGGCGT CTGCCTCGCC ATCGAAGTCG ATCCGGCCCG CATCCAGCGC CGCATTGACA CGAATTACTT GGATACGATG ACCGACAGCC TAGACGCGGC GCTCGAGATG GCGAAACAAG CGAAGGAAGA GAAAAAAGCG CTGTCGATCG GCCTTGTCGG CAATGCGGCT GAAGTGTTGC CGCGTCTCGT CGAAACGGGC TTTGTTCCGG ATGTCTTGAC CGATCAAACG TCCGCCCACG ATCCGTTAAA CGGCTACATC CCGGCTGGCC TTACGCTTGA TGAGGCCGCC GAACTCAGGG CGCGCGATCC GAAGCAGTAC ATCGCCCGTG CGAAACAGTC GATCGCCGCG CATGTTCGAG CGATGCTGGC GATGCAAAAG CAAGGGGCGG TGACGTTTGA TTACGGCAAC AACATCCGCC AAGTGGCAAA AGACGAAGGG GTGGACGACG CCTTTTCCTT CCCAGGTTTT GTGCCGGCCT ACATCCGTCC GCTCTTTTGC GAAGGAAAAG GGCCGTTCCG CTGGGTGGCA TTATCCGGCG ACCCGGAAGA CATTTATAAA ACCGATGAAG TCATTTTGCG TGAATTCAGC GACAATGAGC GTCTTTGCCA TTGGATTCGC ATGGCGCAAA AACGCATTAA GTTCCAAGGG CTGCCGGCGC GCATTTGTTG GCTCGGCTAC GGCGAGCGGG CGAAATTTGG CAAAATCATC AACGACATGG TGGCCAAAGG CGAGCTGAAA GCGCCGATCG TCATCGGCCG CGATCATTTG GATTCGGGCT CCGTCGCTTC GCCGAACCGG GAGACGGAAG GAATGAAAGA CGGAAGCGAC GCCATCGCCG ACTGGCCGAT TTTAAACGCG CTGTTGAATG CGGTTGGGGG CGCGAGCTGG GTGTCGGTTC ACCACGGTGG CGGCGTCGGC ATGGGCTACT CGATTCACGC CGGCATGGTC ATTGTCGCCG ACGGCACGAA AGAGGCGGAA AAACGGTTGG AACGGGTGTT GACGACCGAC CCGGGGCTTG GTGTGGTCCG CCACGCCGAT GCCGGTTATG AGCTCGCCAT CCGGACGGCG AAAGAAAAAG GCATTGATAT GCCGATGCTC AAGTAG
|
Protein sequence | MAEKRTVRPF AGTERRAKGW IQEAALRMLN NNLHPDVAER PDELIVYGGI GKAARNWECY EAIVDTLLRL ENDETLLIQS GKPVAVFRTH PDAPRVLIAN SNLVPAWATW DHFHELDKKG LIMYGQMTAG SWIYIGSQGI VQGTYETFAE VARQHFGGTL AGTITLTAGL GGMGGAQPLA VTMNGGVCLA IEVDPARIQR RIDTNYLDTM TDSLDAALEM AKQAKEEKKA LSIGLVGNAA EVLPRLVETG FVPDVLTDQT SAHDPLNGYI PAGLTLDEAA ELRARDPKQY IARAKQSIAA HVRAMLAMQK QGAVTFDYGN NIRQVAKDEG VDDAFSFPGF VPAYIRPLFC EGKGPFRWVA LSGDPEDIYK TDEVILREFS DNERLCHWIR MAQKRIKFQG LPARICWLGY GERAKFGKII NDMVAKGELK APIVIGRDHL DSGSVASPNR ETEGMKDGSD AIADWPILNA LLNAVGGASW VSVHHGGGVG MGYSIHAGMV IVADGTKEAE KRLERVLTTD PGLGVVRHAD AGYELAIRTA KEKGIDMPML K
|
| |