Gene Ent638_3990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3990 
Symbol 
ID5110451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4324182 
End bp4325381 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content56% 
IMG OID640494204 
Productputative protoheme IX biogenesis protein 
Protein accessionYP_001178696 
Protein GI146313622 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR00540] hemY protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00160322 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.165471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAAG TTCTCTTCCT CTTCTTATTG TTGATCGCCG GGATCGTACT GGGGCCAATG 
CTTGCGGGCC ATCAAGGCTA CGTGCTGATC CAGACCGATA ACTACAATAT CGAAACGAGC
GTCACGGGCC TGGTGATCAT CTTGATCCTC ACCATGGTGG CGCTGTTCGC CATCGAATGG
ATCCTGCGCC GTATTTTCCG CACGGGCGCA CATACCCGCA GCTGGTTTGT TGGCCGCAAA
CGCCGTCGTG CACGCAAGCA GACCGAACAG GCGCTGCTGA AACTGGCTGA AGGCGATTAT
CAGCAAGTTG AAAAGCTGAT GACCAAAAAC GCCGATCACG CTGAGCAGCC GGTGGTTAAC
TATCTGCTAG CCGCAGAAGC CGCCCAGCAG CGCGGCGATG AAATGCGTGC CAATCAGCAT
CTTGAGCGCG CGTCCGAACT GGCTTCTAAC GACCAGATTC CAGTTGAAAT TACACGCGTG
CGTCTGCAAC TGGCGCGAGG TGAAAACCAC GCAGCGCGTC ACGGTGTTGA CCGTCTGCTG
GAAATCACGC CACACCATCC GGAAGTGCTG CGTCTGGCAG AGCAGGCTTA TATCCGCACC
GGCGCCTGGG GTTCATTGCT GGATATTATT CCTTCTATGG CAAAAGCCGA CGTGGGTGAT
GATGAACACC GTGATGCGCT GCAGCGTCAG GCGTGGATTG GCCTGATGGA TCAGGCGCGG
GCCGATCTGG GTAGCGACGG TCTGAAAACC TGGTGGAAGA ATCAGAGCCG TAAAACGCGC
CAGCAAGTTC CATTGCAGGT GGCGATGGCA GAACATCTCA TCGAATGTGA CGATCATGAC
ACCGCGCAGG CGATCATTCT TGATGGCTTG AAGCGTCAGT ATGACGATCG TCTGGTGATG
GTGATCCCGC GTCTCAAGAC CAACAATCCT GAGCAGATGG AAAAAATGTT ACGCCAGCAG
ATCAAGACGG TGGGCGATCG TCCGCTGCTA TGGAGCACGC TGGGTCAGTC GCTGATGAAG
CACGGCGAAT GGCAGGAGGC GAGCCTCGCT TTCCGCGCTG CGTTGAAACA GCGCCCGGAT
GCGTTTGATT ATGCATGGCT TGCCGACTCG CTGGACAAAC AGCACAAGCC AGAAGAAGCC
GCGGCGATGC GTCGTGATGG CCTGCTGCTC ACCTTGCAGA ATAACGGCAG TCAGGTGTAA
 
Protein sequence
MLKVLFLFLL LIAGIVLGPM LAGHQGYVLI QTDNYNIETS VTGLVIILIL TMVALFAIEW 
ILRRIFRTGA HTRSWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMTKN ADHAEQPVVN
YLLAAEAAQQ RGDEMRANQH LERASELASN DQIPVEITRV RLQLARGENH AARHGVDRLL
EITPHHPEVL RLAEQAYIRT GAWGSLLDII PSMAKADVGD DEHRDALQRQ AWIGLMDQAR
ADLGSDGLKT WWKNQSRKTR QQVPLQVAMA EHLIECDDHD TAQAIILDGL KRQYDDRLVM
VIPRLKTNNP EQMEKMLRQQ IKTVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD
AFDYAWLADS LDKQHKPEEA AAMRRDGLLL TLQNNGSQV