Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS0228 |
Symbol | |
ID | 2849863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | + |
Start bp | 228946 |
End bp | 230118 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637503433 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_026513 |
Protein GI | 49183261 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTATC GTCACATGGG GGAGCTACCT CATAAACGAC ATGTACAATT CCGTAAAAAA GATGGATCGC TTTATCGTGA ACAGGTAATG GGAACAAAAG GTTTTTCTGG TACGCAATCT ATTTTGTATC ATCATTATAT GCCAACGGAA GTAGGGCATG CAGCATTATC ACATTCTTGT CAGTTGCAGT ATGAAGAAGA TGTTGTTCTT TCTCATCGTC ACTTTCGCAC GAAAGAAAAT AAAAAAAGTG GTGATGCGGT AAGTGGCAGA AACTTTATAC TTGGAAATGA AGATTTATTA ATCGGAGTAG TGACTCCGAC AGAAAAAATG AATTATTTCT ACCGTAATGG TGATGGCGAT GAAATGTTGT TTGTCCATTA CGGAACAGGA AAAATTGAAA CAATGTTCGG AACGATTCAC TATCGAAAAG GTGATTATGT AACAATCCCA ATTGGAACGA TTTATCGTGT TATTCCAGAT GAAGGAGAGA CTAAGTTTCT TGTTGTAGAG GCAAATAGTC AAATTACAAC ACCGCGTCGC TATCGTAATG AATATGGACA ATTGTTAGAG CATAGCCCGT TTTGTGAGAG AGATATTCGT GGCCCGGAAA AATTAGAGAC GTATGATGAA AAAGGTGAGT TTGTCGTAAT GACAAAGTCA CGAGGATATA TGCATAAACA TGTTTTAGGA CACCATCCGT TAGATGTAGT TGGATGGGAT GGTTATTTAT ATCCCTGGGT CTTTAATGTA GAGGATTTTG AGCCAATCAC AGGTCGTATT CATCAGCCAC CTCCAGTACA TCAAACGTTC GAGGGTCACA ATTTCGTTAT TTGTTCTTTC GTACCACGTT TATATGACTA TCATCCAGAA TCTATTCCGG CACCGTATTA TCATAGTAAC GTGAATAGCG ATGAAGTACT GTACTATGTA GAAGGTAACT TTATGAGCCG AAAAGGTGTG GAGGAAGGGT CTATTACACT TCATCCGAGC GGCATTCCTC ACGGTCCACA TCCTGGGAAA ACAGAGGCGA GTATAGGGAA AAAAGAAACG CTTGAATTAG CTGTTATGAT AGATACATTC CGTCCGCTTC GTATTGTAAA ACAAGCACAT GAAACAGAAG ATGAAAAATA TATGTATAGC TGGATTGAAG AGGGATCATA TACTGTGAAA TAA
|
Protein sequence | MFYRHMGELP HKRHVQFRKK DGSLYREQVM GTKGFSGTQS ILYHHYMPTE VGHAALSHSC QLQYEEDVVL SHRHFRTKEN KKSGDAVSGR NFILGNEDLL IGVVTPTEKM NYFYRNGDGD EMLFVHYGTG KIETMFGTIH YRKGDYVTIP IGTIYRVIPD EGETKFLVVE ANSQITTPRR YRNEYGQLLE HSPFCERDIR GPEKLETYDE KGEFVVMTKS RGYMHKHVLG HHPLDVVGWD GYLYPWVFNV EDFEPITGRI HQPPPVHQTF EGHNFVICSF VPRLYDYHPE SIPAPYYHSN VNSDEVLYYV EGNFMSRKGV EEGSITLHPS GIPHGPHPGK TEASIGKKET LELAVMIDTF RPLRIVKQAH ETEDEKYMYS WIEEGSYTVK
|
| |