Gene Phep_3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3090 
Symbol 
ID8254208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3689650 
End bp3691146 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content42% 
IMG OID644936744 
Productsiroheme synthase 
Protein accessionYP_003093349 
Protein GI255532977 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0730] Predicted permeases
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000482671 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0254521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGCA ACCAGTTATT TCCTGTCTTC ATCAAACTGA ATACCATTCA TACCGTATTG 
GTAGGCGCTG GTCCTATCGG TTTAGAAAAA CTTTCTGCAT TGCTGGATAA CAGTCCGGAG
GCAAAGGTTA CTGTTATTGG AATTGATGTT TTACCGCAAT TGCAAACACT TGCTGATGGT
TATGAAGGAG TAAATGTATT GCAAAAACCA TTTGAACCTG CAGATTTAAA CGGGGCAGAC
CTGGTAATTG CAGCTACGAA TAATGAACTG CTGAACCAGG AGATCAGGAT ACAGGCCAGT
GAGCGCAATT TGCTGGTCAA CTTTGCCGAT AAGCCGGAAC TCTGCGATTT TTATCTTGGC
TCGGTTGTTA AAAAAGGGGA TTTAAAAATA GCCATTTCTA CAAACGGTAA ATCGCCCACT
ATGGCCAAAC GTTTAAAAGA AGTACTAAAC AATGGCTTGC CAGGCGAATT GAACGATACC
CTGCAAAACA TGCAGGCACT TAGAAATACA TTGAATGGCG ATTTTGCTGC AAAAGTAAAA
AAACTAAATG AAGTGACATC GGCCCTGGTT GAAGGAAAAG CTTCGGTTGA AGGAGTGGTT
GTAAGTGATG GAAAGCCTAA ATTGGGTAAA ATGAAATGGC TGATATGGCT GGCCATCGTA
TTTGCTTTTG CTATAGTGGT GGCAGCTTTT TGGCATAAGG AACCCGAATT TCAGGCTTTT
GTAGCCAATA TTAACCCGAT GTTTTACTGG TTTTTGCTGG GTGGTTTCAT CTTTGCGATG
ATAGATGGCG CCATAGGGAT GTCGTATGGT GTAACCTCTA CTTCCTTTTC GCTGGCGATG
GGTGTGCCAC CTGCATCAGC AAGTATGGGA GTACATTTGT CAGAAATTTT AAGCAATGGC
ATTGCCGGCT GGATGCATTA CCGTTTTGGA AATGTGAACT GGAAACTGTT TAAGCTCTTG
CTGATACCAG GTATTATTGG TGCAGTAACG GGTGCTTACC TGTTGTCCTC ACTGGAACAT
TACAGCCATT ATACCAAACC AATAGTTTCT TTGTATACTT TAATATTGGG TTTTGTAATT
TTGTCTAAAG CCTATCAGGC TAACCGTAAA AGTGCAGTCA GAAAAAATAA GATTAAAAAG
ATTGGTTTGC TGGGGCTTTT TGGTGGTTTT ATTGATGCTG TTGGTGGTGG TGGCTGGGGT
TCAATTGTAT TGTCAAGCTT AATCGCGGGT GGAAGAAATG CACGTTTCTC TTTAGGTACC
GTAAAAATCA CCCGCTTTTT TATAGCACTG ATGAGCTCAC TTACTTTTAT CACCATGCTG
AATGGTGCAC ATTGGGAGGC TGTTGCAGGT TTGGTGATTG GAAGTGCCCT GGCCTCGCCC
ATAGCGGCAA AGGTTTCCAA CAGGATCTCT GCTAAAACAA TTATGGTATC GGTAGGTATC
CTGGTGGTTC TGGTGAGTCT GCGCAGCATC GTCAAATTTA TTCTCGAATT AGTATAG
 
Protein sequence
MEGNQLFPVF IKLNTIHTVL VGAGPIGLEK LSALLDNSPE AKVTVIGIDV LPQLQTLADG 
YEGVNVLQKP FEPADLNGAD LVIAATNNEL LNQEIRIQAS ERNLLVNFAD KPELCDFYLG
SVVKKGDLKI AISTNGKSPT MAKRLKEVLN NGLPGELNDT LQNMQALRNT LNGDFAAKVK
KLNEVTSALV EGKASVEGVV VSDGKPKLGK MKWLIWLAIV FAFAIVVAAF WHKEPEFQAF
VANINPMFYW FLLGGFIFAM IDGAIGMSYG VTSTSFSLAM GVPPASASMG VHLSEILSNG
IAGWMHYRFG NVNWKLFKLL LIPGIIGAVT GAYLLSSLEH YSHYTKPIVS LYTLILGFVI
LSKAYQANRK SAVRKNKIKK IGLLGLFGGF IDAVGGGGWG SIVLSSLIAG GRNARFSLGT
VKITRFFIAL MSSLTFITML NGAHWEAVAG LVIGSALASP IAAKVSNRIS AKTIMVSVGI
LVVLVSLRSI VKFILELV