Gene EcSMS35_4167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4167 
SymbolhemY 
ID6145480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4266317 
End bp4267513 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content56% 
IMG OID641618990 
Productputative protoheme IX biogenesis protein 
Protein accessionYP_001746118 
Protein GI170682558 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR00540] hemY protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.130048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.307244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAAAG TGTTATTGCT CTTTGTTTTG CTGATTGCGG GGATCGTGGT TGGCCCGATG 
ATTGCCGGTC ATCAGGGGTA CGTATTGATT CAGACCGACA ACTACAATAT CGAAACCAGC
GTCACTGGCC TCGCGATTAT TTTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG
CTACTGCGGC GGATCTTCCG CACAGGCGCG CACACCCGTG GGTGGTTTGT CGGACGTAAG
CGTCGCCGTG CCCGTAAGCA GACCGAACAG GCGCTGCTGA AACTAGCGGA AGGCGATTAT
CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CTGAACAACC GGTGGTGAAC
TATCTGCTGG CTGCCGAAGC CGCGCAACAA CGCGGTGATG AAGCACGCGC CAACCAACAT
CTGGAACGCG CAGCGGAGCT GGCCGGCAAC GACACAATTC CGGTAGAAAT CACCCGTGTA
CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG
GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACT
GGTGCATGGA GTTCGCTGCT GGATATCATC CCATCAATGG CGAAAGCCCA CGTTGGCGAT
GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT
GCCGATAACG GTAGCGAAGG TTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT
CATCAGGTGG CGTTACAGGT GGCAATGGCG GAACACCTTA TTGAGTGTGA TGACCATGAC
ACCGCCCAGC AAATTATCAT CGACGGCCTG AAACGCCAGT ATGACGATCG CCTGCTGCTG
CCGATCCCGC GTCTGAAAAC CAATAACCCG GAACAGCTGG AAAAAGTGCT GCGCCAACAG
ATCAAAAACG TGGGCGATCG CCCGTTGTTG TGGAGCACAC TGGGTCAGTC GCTGATGAAG
CACGGCGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCGG CGCTGAAACA ACGTCCGGAC
GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAACC GGAAGAAGCC
GCAGCGATGC GTCGCGATGG CTTGATGCTG ACCTTACAGA ATAACCCGTC ACAGTAG
 
Protein sequence
MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW 
LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN
YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL
EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR
ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL
PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD
AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPSQ