Gene ECD_03677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03677 
SymbolhemY 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3876932 
End bp3878128 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content55% 
IMG OID 
Productpredicted protoheme IX synthesis protein 
Protein accessionACT45470 
Protein GI253979800 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.28898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAG TGTTATTGCT CTTTGTTTTG CTGATTGCGG GGATCGTGGT TGGCCCGATG 
ATTGCCGGTC ATCAGGGGTA CGTATTGATT CAGACCGACA ACTACAATAT CGAAACCAGC
GTCACTGGCC TGGCGATTAT TTTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG
CTACTGCGGC GGATCTTCCG CACAGGCGCA CACACCCGTG GGTGGTTTGT CGGACGTAAG
CGTCGCCGTG CCCGTAAGCA GACCGAACAG GCGCTGCTGA AACTAGCGGA AGGCGATTAT
CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CTGAACAACC GGTGGTGAAC
TATCTGCTGG CTGCCGAAGC CGCGCAACAA CGCGGTGATG AAGCACGCGC CAACCAACAT
CTGGAACGCG CAGCGGAGCT GGCCGGCAAC GACACAATTC CGGTAGAAAT CACCCGTGTA
CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG
GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACT
GGTGCATGGA GTTCGCTGCT GGATATCATC CCATCAATGG CGAAAGCCCA CGTTGGCGAT
GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT
GCCGATAACG GTAGCGAAGG CTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT
CATCAGGTAG CGTTGCAGGT GGCAATGGCG GAACATCTTA TTGAATGTGA CGATCATGAT
ACTGCCCAGC AAATTATCAT CGACGGCCTG AAACGCCAGT ATGACGATCG CCTACTGCTG
CCGATCCCGC GTCTGAAAAC CAATAACCCG GAGCAGCTGG AAAAAGTGCT GCGCCAACAG
ATCAAAAACG TGGGCGATCG CCCGTTGTTG TGGAGCACAC TGGGTCAGTC GCTGATGAAG
CACGGCGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCGG CGCTGAAACA ACGTCCGGAC
GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAACC AGAAGAAGCT
GCAGCGATGC GCCGCGACGG TTTGATGTTA ACGTTGCAGA ACAACCCGCC ACAGTAG
 
Protein sequence
MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW 
LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN
YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL
EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR
ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL
PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD
AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPPQ