Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_0474 |
Symbol | aroB |
ID | 8524280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 476933 |
End bp | 478033 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | |
Product | 3-dehydroquinate synthase |
Protein accession | YP_003251638 |
Protein GI | 261417956 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGAAC GGACGATTGA AACGGCGACG AAGCGCTATC CGCTTCTTTT GGGCGATGGG GCGGCCCGCG TGCTGCCAAG CTTGCTTCGG TCGCTCTCCT GTCCGCCAGG GACAAAACTT TTCATCGTTA CCGACGATAC TGTGGCGCCT CTTTATTTGG ATGAGGTGCG GGCGTTGCTT GCCGCCGCTG AGTATGACGT GTACGCCTAC GTCATTCCAA GCGGGGAGGC GGCCAAGTCA TTTGATCATT ATTACGCTTG CCAGACAGCG GCATTGCAGT GCGGCCTCGA CCGCCGTTCG GTTATCATTG CGCTTGGCGG CGGCGTTGTC GGCGATTTGG CTGGGTTTGT CGCCGCCACC TATATGCGCG GCATCCGTTA CATTCAAATG CCGACGACAC TCTTGGCCCA TGACAGCGCC GTCGGCGGCA AAGTCGCCAT CAACCATCCG CTCGGCAAAA ATATGATCGG CGCCTTCCAT CAGCCGGAAG CAGTTGTGTA TGACACCTCC TTTTTGCGCA CGCTGCCCGA GCGCGAGCTT CGGTCCGGGT TTGCCGAGGT GATCAAACAT GCGCTGATCC GCGACCGCCG CTTTTACGAC TGGCTGCGCG CGGAAATCAA GACGCTCGCC GACTTGCGCG GCGAGAAACT CGCCTATTGC ATTGAAAAGG GCATTGACAT TAAGGCGTCC GTCGTGCGTG AGGATGAAAA AGAAACCGGG GTGCGCGCCC ATTTGAATTT CGGCCATACG CTCGGCCATG CGCTGGAAAG CGAGTTAGGC TACGGCGCGC TCACTCACGG GGAGGCGGTT GCGGTTGGCA TGCTGTTTGC CGTCTTTGTC AGCGAACGGT TTTACGGCCG GTCGTTCGCT GAGCATCGAT TGGCCGACTG GTTCGCCGGA TACGGCTTCC CGGTGTCGCT GCCGACAACG GTTCAGACGC GCCGCCTGCT TGAGAAGATG AAAGGCGACA AAAAAGCGTA CGCCGGAACA GTGCGGATGG TGCTTCTCTG TGAGATCGGC GACGTGGAAG TGGTGGAACT CGAAGACGAC AACCTGCTCA CGTGGCTGGA CGAGTTTTCC AGACAGGGGG GAAAAGGATG A
|
Protein sequence | MIERTIETAT KRYPLLLGDG AARVLPSLLR SLSCPPGTKL FIVTDDTVAP LYLDEVRALL AAAEYDVYAY VIPSGEAAKS FDHYYACQTA ALQCGLDRRS VIIALGGGVV GDLAGFVAAT YMRGIRYIQM PTTLLAHDSA VGGKVAINHP LGKNMIGAFH QPEAVVYDTS FLRTLPEREL RSGFAEVIKH ALIRDRRFYD WLRAEIKTLA DLRGEKLAYC IEKGIDIKAS VVREDEKETG VRAHLNFGHT LGHALESELG YGALTHGEAV AVGMLFAVFV SERFYGRSFA EHRLADWFAG YGFPVSLPTT VQTRRLLEKM KGDKKAYAGT VRMVLLCEIG DVEVVELEDD NLLTWLDEFS RQGGKG
|
| |