Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1080 |
Symbol | |
ID | 4903062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1061631 |
End bp | 1062821 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640134309 |
Product | putative hemY protein |
Protein accession | YP_001065359 |
Protein GI | 126451474 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG3071] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTGC GTGGAATCAT TTGGCTCGCC GTGCTGTTCG CGATCGCCGC GGCGCTCGCG ACGGTCGGAC GCTTCGATAC CGGCCAGGTG CTGATCGTCT ATCCGCCGTA TCGCATCGAC GTGTCGCTGA ACTTCTTCGT GCTCGCGATC ATCGTCGCGT TCATCGTGCT GTACGCACTG ATGCGGATCG TGCGCAACGT CTGGCGGATG CCGCAGCGCG TGGCCGCGTA TCGCGCGCGG ATGCGCAACG AGCGTGCGCA GGCCGCGTTG CGCGACGCGC TCGCGAACCT GTACGCGGGC CGCTTCTCGC GCGCGGAGAA AGCCGCGCGC GACGCGCTCG CGGTCGACGC GAACCAGTCG GCCGCGAGCC TCGTCGCCGC GGCCGCGACG CACCGGATGC ACGAGTATGC GCGGCGCGAC GAGTGGCTCG CGAAGGTGAG CGGGCAGGAA TGGCAGGACG CGCGCCTGCT CGCGACGGCC GACATGCGCG CGGACGGCCG CGACGCGGAG GGCGCGCTCG CCGCGCTCGC CGAGATGCAG GCGTCGGGCG GCAAGCGGAT TCACGCGCAG CAGATCGCGC TGCGCGCGCA GCAGCAGAAC AAGAACTGGG CCGAGGTGCT GAAGATCGCG AAGGCGCTCG AAAAGCGCGA GGCGCTGCAT CCCGCGGCGG CCGTGCGCCT GCGCCAGCAG GCCGCCGAGC ATTTGCTGCG CGATCGCCGG CACGACGCCG ATGCGCTGCT CGAGGTGTGG CAGTCGCTGT CGGCCGCCGA GCGGCAGTCG CCGCGCCTCG CGGATCTCGC CGCCGAGCTG CTGATCGCGC TCGAGCGCCG GCAGGAAGCG CGGCGCATCG TCGAGGACGC GCTCGCGCAC AACTGGAACG CGCGTCTGCT GCGCCGCTAT CCGGATACGG CGGGTGCCGA CGCGCTGCCG CTGATCCAGA AGGCCGAGGG CTGGCGTCGC GAGCGGCCGG ACGACGCGGA CCTGCTGTTC GCGCTCGGCC GCCTGTGCCA GCAGCAGCAA CTGTGGGGCA AGGCGCAGTC GTTCCTCGAA TCGGCGCTGA AGCTGGCCGA CGACGAGCCG CTCAGGATTC GCGCGCATCG TGCGCTCGCG CGCCTGTTCG AGCATCTGGG CGAGACCGAC AAGGCCGCGC AGCACTATCG CGAAAGCGCG TTGGCGATCA CGGTCGTGTG A
|
Protein sequence | MTLRGIIWLA VLFAIAAALA TVGRFDTGQV LIVYPPYRID VSLNFFVLAI IVAFIVLYAL MRIVRNVWRM PQRVAAYRAR MRNERAQAAL RDALANLYAG RFSRAEKAAR DALAVDANQS AASLVAAAAT HRMHEYARRD EWLAKVSGQE WQDARLLATA DMRADGRDAE GALAALAEMQ ASGGKRIHAQ QIALRAQQQN KNWAEVLKIA KALEKREALH PAAAVRLRQQ AAEHLLRDRR HDADALLEVW QSLSAAERQS PRLADLAAEL LIALERRQEA RRIVEDALAH NWNARLLRRY PDTAGADALP LIQKAEGWRR ERPDDADLLF ALGRLCQQQQ LWGKAQSFLE SALKLADDEP LRIRAHRALA RLFEHLGETD KAAQHYRESA LAITVV
|
| |