Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4261 |
Symbol | hemY |
ID | 6270103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3983409 |
End bp | 3984605 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641728075 |
Product | putative protoheme IX biogenesis protein |
Protein accession | YP_001882495 |
Protein GI | 187730910 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG3071] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | [TIGR00540] hemY protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAG TGTTATTACT CTTTGTTTTG CTGATTGCGG GGATAGTGGT TGGCCCGATG ATTGCCGGTC ATCAGGGTTA TGTGCTGATC CAGACCGACA ACTACAATAT CGAAACCAGC GTCACGGGCC TGGCGATCAT ATTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG CTACTGCGGC GGATCTTCCG CACTGGCGCG CACACCCGTG GGTGGTTTGT CGGGCGTAAG CGTCGCCGTG CACGTAAGCA GACCGAACAG GCGCTGCTGA AACTGGCGGA AGGCGATTAT CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CTGAACAACC GGTGGTGAAC TATCTACTGG CTGCCGAAGC CGCGCAACAA CGTGGTGATG AAGCGCGCGC CAACCAACAT CTGGAACGCG CAGCGGAGCT GGCCGGCAAT GACACCATTC CGGTAGAAAT CACCCGTGTA CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACA GGTGCATGGA GTTCTCTGCT GGATATCATC CCATCAATGG CGAAAGCCCA TGTTGGTGAT GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT GCCGATAACG GTAGTGAAGG TTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT CATCAAGTAG CGTTGCAGGT GGCAATGGCG GAACATCTTA TTGAATGTGA CGATCATGAT ACTGCCCAGC AAATTATCAT CGATGGCCTG AAACGCCAGT ACGACGATCG CCTACTGCTG CCGATTCCTC GACTGAAAAC CAACAATCCG GAACAGCTGG AAAAAGTGCT GCGCCAGCAG ATCAAAAACG TCGGCGATCG CCCGCTGTTG TGGAGCACAC TGGGCCAGTC ACTGATGAAG CACGGAGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCAG CGCTGAAACA ACGTCCGGAC GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAGCC GGAAGAAGCT GCAGCGATGC GTCGCGACGG TTTGATGTTA ACGTTGCAGA ACAACCCACC ACAGTAG
|
Protein sequence | MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPPQ
|
| |