Gene EcE24377A_4317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4317 
SymbolhemY 
ID5585971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4306958 
End bp4308154 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content55% 
IMG OID640927934 
Productputative protoheme IX biogenesis protein 
Protein accessionYP_001465283 
Protein GI157157643 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR00540] hemY protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00516083 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAG TGTTATTACT CTTTGTTTTG CTGATTGCGG GGATAGTGGT TGGCCCGATG 
ATTGCCGGTC ATCAGGGTTA CGTGCTGATC CAGACTGACA ACTACAATAT CGAAACCAGC
GTCACGGGCC TGGCGATTAT TTTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG
CTACTGCGGC GGATCTTCCG CACAGGCGCA CACACCCGTG GGTGGTTTGT CGGACGTAAG
CGTCGCCGTG CCCGTAAGCA GACCGAACAG GCGCTGCTGA AACTGGCGGA AGGCGATTAT
CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CGGAACAACC GGTGGTGAAC
TATCTACTGG CTGCCGAAGC CGCGCAACAA CGTGGTGATG AAGCACGCGC CAACCAACAT
CTGGAACGCG CAGCAGAGCT GGCCGGCAAT GACACCATTC CGGTAGAAAT CACCCGTGTA
CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG
GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACA
GGTGCATGGA GTTCTCTGCT GGATATCATC CCATCAATGG CGAAAGCCCA TGTTGGTGAT
GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT
GCCGATAACG GTAGTGAAGG TTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT
CATCAGGTAG CGTTGCAGGT GGCAATGGCG GAACATCTTA TTGAATGTGA CGATCATGAT
ACTGCCCAGC AAATTATCAT CGATGGCCTG AAACGCCAGT ACGACGATCG CCTACTGCTG
CCGATTCCTC GACTGAAAAC CAACAATCCG GAACAGCTGG AAAAAGTGCT GCGCCAGCAG
ATCAAAAACG TCGGCGATCG CCCGCTGTTG TGGAGCACAC TGGGCCAGTC ACTGATGAAG
CACGGAGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCGG CGCTGAAACA ACGTCCGGAC
GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAGCC GGAAGAAGCT
GCAGCGATGC GTCGCGACGG TTTGATGTTA ACGTTGCAGA ACAACCCACC ACAGTAG
 
Protein sequence
MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW 
LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN
YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL
EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR
ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL
PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD
AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPPQ