Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03627 |
Symbol | hemX |
ID | 8116321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 3876208 |
End bp | 3877413 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644849791 |
Product | hypothetical protein |
Protein accession | YP_003001364 |
Protein GI | 251787060 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2959] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.234561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAC AAGAAAAAAC CTCCGCCGTG GTTGAAGAGA CCAGGGAGGC CGTGGACACC ACGTCACAAC CTGTCGCAAC AGAAAAAAAG AGTAAGAACA ATACCGCATT GATTCTCAGC GCGGTGGCTA TCGCTATTGC TCTGGCGGCG GGCGTCGGTT TGTATGGCTG GGGTAAACAA CAGGCCGTCA ATCAGACTGC CACCAGCGAT GCCCTGGCTA ATCAACTGAC GGCACTGCAA AAAGCCCAGG AGAGCCAAAA AGCTGAGCTG GAAGGCATTA TCAAGCAACA AGCTGTACAG CTTGAGCAGG CGAATCGTCA GCAAGAAACG CTGGCAAAAC AGCTGGATGA AGTCCAACAA AAGGTCGCGA CTATTTCCGG CAGCGATGCT AAAACCTGGC TGCTGGCTCA GGCTGATTTC CTGGTGAAAC TCGCCGGACG GAAGCTGTGG AGCGATCAGG ACGTCACGAC CGCTGCAGCG TTGCTGAAAA GTGCAGACGC CAGCCTGGCG GATATGAATG ACCCGAGTCT GATTACCGTT CGCCGGGCAA TTACCGATGA TATCGCCAGC CTTTCTGCGG TATCGCAGGT GGATTATGAC GGCATTATCC TTAAGCTTAA TCAGCTTTCA AATCAGGTAG ATAACCTGCG TCTGGCTGAT AATGACAGCG ATGGTTCGCC GATGGATTCC GACGGTGAAG AGCTTTCCAG TTCCATCAGC GAATGGCGTA TCAATCTGCA AAAAAGCTGG CAGAACTTTA TGGACAACTT CATTACGATT CGCCGTCGTG ATGACACCGC CGTACCGCTG TTAGCGCCAA ATCAGGATAT CTATCTGCGC GAAAATATTC GCTCTCGCCT GCTGGTCGCT GCACAAGCTG TACCGCGTCA TCAGGAAGAG ACTTATCGCC AGGCGCTGGA GAACGTCTCC ACCTGGGTAC GTGCTTACTA CGATACTGAT GATGCCACCA CCAAAGCGTT CCTCGACGAG GTGGACCAGT TAAGCCAGCA AAATATCTCG ATGGATCTCC CGGAAACCCT GCAAAGCCAG GCGATGCTGG AAAAACTGAT GCAGACTCGC GTGCGTAACC TGCTGGCACA ACCGGCAGCG GGGGCAACGG AAGCTAAACC TGCACCTGCA CCTGCACCTG CACCTGCACC TGCACCGCAA GCTGATACTC CGGCAGCCGC GCCGCAAGGA GAATAA
|
Protein sequence | MTEQEKTSAV VEETREAVDT TSQPVATEKK SKNNTALILS AVAIAIALAA GVGLYGWGKQ QAVNQTATSD ALANQLTALQ KAQESQKAEL EGIIKQQAVQ LEQANRQQET LAKQLDEVQQ KVATISGSDA KTWLLAQADF LVKLAGRKLW SDQDVTTAAA LLKSADASLA DMNDPSLITV RRAITDDIAS LSAVSQVDYD GIILKLNQLS NQVDNLRLAD NDSDGSPMDS DGEELSSSIS EWRINLQKSW QNFMDNFITI RRRDDTAVPL LAPNQDIYLR ENIRSRLLVA AQAVPRHQEE TYRQALENVS TWVRAYYDTD DATTKAFLDE VDQLSQQNIS MDLPETLQSQ AMLEKLMQTR VRNLLAQPAA GATEAKPAPA PAPAPAPAPQ ADTPAAAPQG E
|
| |