Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3528 |
Symbol | hpcB |
ID | 5151207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 3680921 |
End bp | 3681838 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640558380 |
Product | 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
Protein accession | YP_001239527 |
Protein GI | 148254942 |
COG category | [S] Function unknown |
COG ID | [COG3384] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02298] 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0958832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGC TCGCACTGGC GGCGAAGATC ACACACGTGC CGTCGATGTA TCTCAGCGAG CTCGATGGTC CGGCCAAGGG CAAGCGCCAG TCGGCGATCG ATGGCCATAT CGAGATCGGC CGGCGTTGCC GCGAGGCGGG CGTCGACACG ATCGTCGTGT TCGACACCCA CTGGCTGGTC AATGCGGGCT ACCACATCAA CTGCGCGCCG CATTTCCAGG GACTCTACAC CTCCAACGAG CTGCCGCACT TCATTGCCAA CATGACCTAT GAGTATGACG GCAACCCGGA GCTTGGCCGC ATCCTGGCCG AGGGGGCCAA TGCCCATGGC ATCGCCACCA TGGCCCATGC GGCCACGTCG CTGGCGCTCG AATACGGGAC CCTGGTGCCG ATGCGCTACA TGAATGCCGA TCGCCATTTC AAGGTGGTGT CGGTATCGGC GCTCTGCACC TCGCATTACC TCGCCGACAG CGCCAAGCTC GGCTGGGCAT TCCGCCGTGC GGTCGAGGAT CATTACGATG GCACGGTGGC CTTCTTCGCG TCGGGCTCCC TGAGCCACCG CTTCGCGCAG AACGGCACCG CCGACGAATA TCGCGACAAG ATGTTCAGCC CGTTCCTCGA GCGTCTCGAC CACGAGGTGA TCCAGATGTG GGAGCAGGGT GAATGGCCCG ACTTCTGCGA CATGCTGCCG GAATATGCCA GCAAGGGTCA TGGCGAGGGC TTCATGCACG ACACCGCGAT GCTGCTCGGT GCGCTCGGCT GGAACCATTA CGACCGCGGC GTCGAGGTGG TCACGCCGTT CTTTCCCTCG TCGGGGACCG GTCAGATCAA CGCGGTGTTT CCGGTGTCGC CGATGCCGCA GGCGCTCGCG GACATCGGCC GGCCGCGCGC CGCGGCGCTT GCGGGGGGAC GGCTGTAG
|
Protein sequence | MGKLALAAKI THVPSMYLSE LDGPAKGKRQ SAIDGHIEIG RRCREAGVDT IVVFDTHWLV NAGYHINCAP HFQGLYTSNE LPHFIANMTY EYDGNPELGR ILAEGANAHG IATMAHAATS LALEYGTLVP MRYMNADRHF KVVSVSALCT SHYLADSAKL GWAFRRAVED HYDGTVAFFA SGSLSHRFAQ NGTADEYRDK MFSPFLERLD HEVIQMWEQG EWPDFCDMLP EYASKGHGEG FMHDTAMLLG ALGWNHYDRG VEVVTPFFPS SGTGQINAVF PVSPMPQALA DIGRPRAAAL AGGRL
|
| |