Gene Dtpsy_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_2052 
Symbol 
ID7385045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp2198040 
End bp2199098 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content69% 
IMG OID643655371 
Productferrochelatase 
Protein accessionYP_002553508 
Protein GI222111244 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0268125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCCT CGCCTTCTCA CCATGCCGCA CAGACGGCCT CAGCCTCTTC GATCCCGACC 
GCCGCAGCAC GCGAAAGCAC CGCCATTCTG CTGAGCAACC TGGGCACGCC AGACGCCCCC
ACCGCACCAG CGCTGCGCCG CTATCTGGCG CAGTTCCTGG GCGACCCGCG GGTGGTGGAA
ATTCCGCGGG CGCTGTGGCT GCCTCTGCTG TACGGGATCA TCCTGCGCAC GCGGCCGGCC
AAGTCGGCCG CCAAGTACGC CAGCATCTGG ACGGAAGAAG GCTCCCCGCT CGCCGTATGG
ACGGCCAAGC AAGCGGTGAT GCTGCGCGGC TGGCTGGGCG AGGCCGGCCA GCCGGTGCGT
GTACTGCCCG CCATGCGCTA CGGTCAGCCG GCCATTGGTG CGCAGCTGCA GGTGCTGCAA
GACGCCGGCG TGCGCCGCGT GCTGGTGCTG CCGCTGTACC CGCAGTACTC GGCCACGACC
ACCGCCAGCG TGTTCGACGA CGTGGCCGCG TGGGTGCGCC GCTCGCGTTG CTTTCCCGAG
CTGCGCTTCG TGAACGACTA CCACGACCAC CCGGACTACA TTGCCGCGCT GGCCCAGTCG
GTGCGCGCGC ACTGGCAGCG CGAGGGCGGC CCGGCGGACA AGCTGGTGAT GAGCTTTCAC
GGCATTCCCG AACGCAATGT GCGATTGGGT GACCCCTACG CGGAGCAATG CCGCACCACG
GCCCGGCTGC TCGCGCAGGC GCTGGGGCTG GGCGAAGACC GGTACCTGCT GACCTTCCAG
TCGCGCTTCG GCAAGGCCCG CTGGCTGGAG CCCTACACCG AACCCTCGCT GGTGGCACTG
GCACAGGGCG GCACACGCCG TGTGGACGTG ATGTGCCCAG GCTTCACCAG CGACTGCCTG
GAAACGCTGG AGGAGATCAA CCAGGAGGCA CGCGAGGCCT TCCTGCACGC AGGCGGCCAG
GACTTCCGCT ACATCCCCTG CCTGAACGAC AACCCCGCCT GGATCACTGC CTTGAGCCGC
ATCGCACAAC AGCACCTGGC GGGCTGGAGC CAGCCATGA
 
Protein sequence
MPASPSHHAA QTASASSIPT AAARESTAIL LSNLGTPDAP TAPALRRYLA QFLGDPRVVE 
IPRALWLPLL YGIILRTRPA KSAAKYASIW TEEGSPLAVW TAKQAVMLRG WLGEAGQPVR
VLPAMRYGQP AIGAQLQVLQ DAGVRRVLVL PLYPQYSATT TASVFDDVAA WVRRSRCFPE
LRFVNDYHDH PDYIAALAQS VRAHWQREGG PADKLVMSFH GIPERNVRLG DPYAEQCRTT
ARLLAQALGL GEDRYLLTFQ SRFGKARWLE PYTEPSLVAL AQGGTRRVDV MCPGFTSDCL
ETLEEINQEA REAFLHAGGQ DFRYIPCLND NPAWITALSR IAQQHLAGWS QP