Gene B21_03626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03626 
SymbolhemY 
ID8113560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3875009 
End bp3876205 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content55% 
IMG OID644849790 
Producthypothetical protein 
Protein accessionYP_003001363 
Protein GI251787059 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR00540] hemY protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.274284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAG TGTTATTGCT CTTTGTTTTG CTGATTGCGG GGATCGTGGT TGGCCCGATG 
ATTGCCGGTC ATCAGGGGTA CGTATTGATT CAGACCGACA ACTACAATAT CGAAACCAGC
GTCACTGGCC TGGCGATTAT TTTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG
CTACTGCGGC GGATCTTCCG CACAGGCGCA CACACCCGTG GGTGGTTTGT CGGACGTAAG
CGTCGCCGTG CCCGTAAGCA GACCGAACAG GCGCTGCTGA AACTAGCGGA AGGCGATTAT
CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CTGAACAACC GGTGGTGAAC
TATCTGCTGG CTGCCGAAGC CGCGCAACAA CGCGGTGATG AAGCACGCGC CAACCAACAT
CTGGAACGCG CAGCGGAGCT GGCCGGCAAC GACACAATTC CGGTAGAAAT CACCCGTGTA
CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG
GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACT
GGTGCATGGA GTTCGCTGCT GGATATCATC CCATCAATGG CGAAAGCCCA CGTTGGCGAT
GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT
GCCGATAACG GTAGCGAAGG CTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT
CATCAGGTAG CGTTGCAGGT GGCAATGGCG GAACATCTTA TTGAATGTGA CGATCATGAT
ACTGCCCAGC AAATTATCAT CGACGGCCTG AAACGCCAGT ATGACGATCG CCTACTGCTG
CCGATCCCGC GTCTGAAAAC CAATAACCCG GAGCAGCTGG AAAAAGTGCT GCGCCAACAG
ATCAAAAACG TGGGCGATCG CCCGTTGTTG TGGAGCACAC TGGGTCAGTC GCTGATGAAG
CACGGCGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCGG CGCTGAAACA ACGTCCGGAC
GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAACC AGAAGAAGCT
GCAGCGATGC GCCGCGACGG TTTGATGTTA ACGTTGCAGA ACAACCCGCC ACAGTAG
 
Protein sequence
MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW 
LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN
YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL
EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR
ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL
PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD
AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPPQ