Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0874 |
Symbol | |
ID | 3846980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 998117 |
End bp | 999307 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637840547 |
Product | HemY protein N-terminus family protein |
Protein accession | YP_441430 |
Protein GI | 83718445 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG3071] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTGC GTGGAATCAT TTGGCTCGCC GTGCTGTTCG CGATCGCCGC GGCGCTCGCG ACGGTCGGGC GCTTCGATGC CGGCCAGGTG CTGATCGTCT ATCCGCCGTA TCGCATCGAC GTGTCGCTGA ACTTCTTCGT GCTCGCGATC ATCGTCGCGT TCATCGCGCT GTACGCGCTG ATGCGGATCG TGCGCAACGT GTGGCGGATG CCGCAGCGCG TGGCCGCGTA TCGCGCGCGG ATGCGCAACG AGCGCGCGCA TGCTTCGTTG CGCGATGCGC TCGCGAACCT CTACGCGGGC CGCTTCTCGC GCGCGGAGAA GGCCGCGCGC GATGCGCTCG CGGTCGACGC GAACCAGGCC GCGGCGAGCC TCATCGCCGC GGCCGCGACG CACCGGATGC ACGAGTACGC GCGGCGCGAC GAATGGCTCG CGAAGGTGAA CGGCCAGGAA TGGCAGGACG CGCGCCTGCT CGCGACGGCC GACATGCGCG CGGACGGGCG CGACGCCGAG GGCGCGCTCG CCGCGCTCGC CGAGATGCAG GCGTCGGGCG GCAAGCGGAT CCACGCGCAG CAGATCGCGC TGCGCGCGCA GCAGCAGAAC AAGAACTGGA GCGAGGTGCT GAAGATCGCG AAGGCGCTCG AAAAGCGCGA GGCGCTGCAC CCGGCCGCGG CCGTGCGGCT GCGCCAGCAA GCGGCCGAGC ATCTGCTGCG CGATCGCCGG CACGACGCGG ACGCGCTTCT CGAAGTGTGG CAGACGCTGT CGGCCACCGA GCGGCAGTCG CCGCGCCTCG CGGATCTCGC CGCCGAACTG CTGATCGCGC TCGAGCGCCG TCAGGAAGCG CGGCGGATCG TCGAGGACGC GCTCGCGCAC AACTGGAACG CGCGCCTGCT GCGCCGCTAT CCGGATACGG CGGGCGCTGA CGCGCTGCCG CTGATCCAGA AGGCCGAGGG CTGGCGACGC GAGCGTCCGG ACGACGCGGA CCTGCTGTTC GCGCTCGGCC GCTTGTGCCA GCAGCAGCAA CTGTGGGGCA AGGCGCAATC GTTCCTCGAA TCGGCGCTGA AGCTTGCCGA CGACGAGCCG CTCAAGATTC GCGCGCATCG CGCGCTCGCG CGCCTGTTCG AGCACCTCGG CGAAACCGAC AAGGCCGCGC AGCACTATCG CGAAAGCGCG CTCGCGATCA CGGTCGTGTG A
|
Protein sequence | MTLRGIIWLA VLFAIAAALA TVGRFDAGQV LIVYPPYRID VSLNFFVLAI IVAFIALYAL MRIVRNVWRM PQRVAAYRAR MRNERAHASL RDALANLYAG RFSRAEKAAR DALAVDANQA AASLIAAAAT HRMHEYARRD EWLAKVNGQE WQDARLLATA DMRADGRDAE GALAALAEMQ ASGGKRIHAQ QIALRAQQQN KNWSEVLKIA KALEKREALH PAAAVRLRQQ AAEHLLRDRR HDADALLEVW QTLSATERQS PRLADLAAEL LIALERRQEA RRIVEDALAH NWNARLLRRY PDTAGADALP LIQKAEGWRR ERPDDADLLF ALGRLCQQQQ LWGKAQSFLE SALKLADDEP LKIRAHRALA RLFEHLGETD KAAQHYRESA LAITVV
|
| |