Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bxe_C0994 |
Symbol | |
ID | 4010507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia xenovorans LB400 |
Kingdom | Bacteria |
Replicon accession | NC_007953 |
Strand | + |
Start bp | 1029205 |
End bp | 1030569 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637953597 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_556217 |
Protein GI | 91781010 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCGA AATTGAGCGT TGAACGCGAA GTCCGGGACA CGGCGACGCG CACGAGGCTG ACGAATGAGC TGACCTATCA AACGGGGTTC GGCAACGAGT TCTCAACTGA GGCAATCAGC GGCGCATTGC CGATTGGACA GAACAATCCG CAAAAACCCG CATTCGGACT GTACGCGGAA CAATTGTCGG GCACGTCATT TACCACGGCC CGCGCGACGA ATCTCAGAAC CTGGATGTAT CGGATACGAC CTTCGGTGCT GCAAGGGACG TATCGGCTTA TTCGCAAGGA CTCCAACGCT CTGGCGCCGC TGGACGTTGC ACCGCTGCCT GAAGCGCAGC GATGGAACCC GCAACCGCTG CCGGATACGC CGGTCGATTT CGTAGACGGC TTAAACACGC TCGCGATCAG CGGCGACCCC GCTACCCTTT CCGGCGCCGC CGTGCATTTG TATGCGGCAA CCATCGACAT GGACCGCAAG GCGTTCGTGA ATACCGATGG CGAGATGCTT ATAATCCCGC AGCAGGGAAC GCTATGCCTC ATTACCGAAT TAGGCATGCT AACCGCCAAA CCGCGCGAGA TCGCGGTGAT CCCGCGGGGC CTGAAGTTCG CGGTTCATTT GCCGGACGGC CCAAGCCGCG GCTATGTCTG TGAGAATCAT GGGAGTGCTT TCAGGCTGCC TGATCTCGGC CTGATCGGCG CGAGCGGGCT GGCCAACCGC CGCGATTTCC TGACTCCGGT CGCGCGATTC GAAGACAGTG ATGCCGCACA CGAACTCATT GCCAAGACAG GCGGCCAACT GTGGTCGACG ACGCTCGATC ATTCGCCATT CGATGTGGTC GCCTGGCATG GCAACGTCGC GCCTTACAAA TACGACCTCG ATCGTTTCCA GTCGATCGGC TCAATCTCAT TCGACCACCC CGATCCGTCC ATTTATACCG TGCTGACTTC GCCGTCCGAC ACGCCCGGCA CGGCGAACGC GGATTTCTGC GCGTTGACGA CGCGCTGGGC GGTGGCGGAA CACACTTTTC GTCCGCCTTA TTTTCACCGC AACGTAATGA GCGAATTCAT GGGCCTGATC TGCGGCGCGC ACGACGCCAA AGCAGGCGGG TTTGTTCCTG GCGGAGCAAG CATCCATAAC GCGTTCACCC CGCACGGGCC CGACGCCGAA ACCTACCGCC GCGCGCGCGA TACCACTCTC GAACCTCACA AGATCACGGA ATCGATCGCG TTCATGTTTG AAACGCGGCT GCCCCTGCGC CTGACCGCCT GGGCCGTGCA AACGCCACAA CGACAACACG ACTACCAGCA ATGCTGGTCG GCGCTCGCTA AACAGTTCGA TGGAGACCGC CATGCAACTG AATGA
|
Protein sequence | MNSKLSVERE VRDTATRTRL TNELTYQTGF GNEFSTEAIS GALPIGQNNP QKPAFGLYAE QLSGTSFTTA RATNLRTWMY RIRPSVLQGT YRLIRKDSNA LAPLDVAPLP EAQRWNPQPL PDTPVDFVDG LNTLAISGDP ATLSGAAVHL YAATIDMDRK AFVNTDGEML IIPQQGTLCL ITELGMLTAK PREIAVIPRG LKFAVHLPDG PSRGYVCENH GSAFRLPDLG LIGASGLANR RDFLTPVARF EDSDAAHELI AKTGGQLWST TLDHSPFDVV AWHGNVAPYK YDLDRFQSIG SISFDHPDPS IYTVLTSPSD TPGTANADFC ALTTRWAVAE HTFRPPYFHR NVMSEFMGLI CGAHDAKAGG FVPGGASIHN AFTPHGPDAE TYRRARDTTL EPHKITESIA FMFETRLPLR LTAWAVQTPQ RQHDYQQCWS ALAKQFDGDR HATE
|
| |