Gene Bind_3476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3476 
Symbol 
ID6201099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3951741 
End bp3952802 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content61% 
IMG OID641707430 
Productferrochelatase 
Protein accessionYP_001834522 
Protein GI182680376 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.328207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.394804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCC TTGCCGCTTC TACGACGAAA GCGCGTCTGA CGGACCGAAT CGGTGTTCTG 
CTTGTCAATC TCGGAACGCC GGATGCGACC GATTACTGGT CCATGCGCCG TTATCTCCGC
GAATTCCTGT CCGACCGCCG GGTGGTCGAA CTGCCGCGCT GGGTCTGGTG GCCAATCCTG
CATGGGATTA TCCTCACCAC GCGCCCGCAT CGGAGCGGAC GCCGCTATGC CGGGATCTGG
GACAAGGAGC GTAACGAGAG CCCGCTCAAA ACCATCACCC GCGCGCAAGG GGAACTCCTG
GCTCGTTCGA TTGGCGCCTC ATCCGCGCCA GGGGACGGAA AAGCGCAAGA TATCGTGATT
GATTTCGCGA TGCGATATGG CAATCCTTCG ATCGCTTCGG GTCTCGATCG CCTGTTGCAA
CAGAATTGCG GCCGCATTCT TGTCGTGCCG CTCTATCCGC AATATGCAGC GGCGACCACG
GCGAGCGTCG CGGATAAGGT TTTCGAGATT TTGCGGGCGC GGCGCTGGCA GCCGGCCTTG
CGGATTGCAC CCCCCTATTA TGCCGAGCCC TTCTATATAG AAGCCTTGGC GCGTTCCGTG
CGGCGCGGTC TCGCCGGGCT TGATTTCGAG CCGGACGCGA TCCTTTTGTC TTTTCACGGC
ATTCCGAAAT CCTCCGTCGA CAAGGGCGAT CCCTATTATG AGCATTGCCT GATCACGGCG
GATTTGTTGC GGCAAGCCCT GGGTCTCGAC GAGGCGCATT GCGTGACGAC TTTTCAATCC
CGCTTCGGCC GCGCCGAATG GATCGGACCC GCGACCGATG CGACGGTGAA AGCTTTGGCC
GCGCGAGGTG TCAAAAAACT GGCTGTCGTC ATGCCAGGTT TTGCCGCCGA TTGCCTGGAA
ACAATCGAGG AAGTGGGCGG GGAGATCCGC GCCTTATTCC TGGGCGCGGG GGGCGAGGAT
TATGCCGCGC TCCCCTGCCT CAACGCGAGC GAGGAGGGCA TGGAGGTTAT TGCCCGGCTG
GTTCGGCGTG AATTGCAAGG CTGGCTGACC TCAACAGATT AG
 
Protein sequence
MKPLAASTTK ARLTDRIGVL LVNLGTPDAT DYWSMRRYLR EFLSDRRVVE LPRWVWWPIL 
HGIILTTRPH RSGRRYAGIW DKERNESPLK TITRAQGELL ARSIGASSAP GDGKAQDIVI
DFAMRYGNPS IASGLDRLLQ QNCGRILVVP LYPQYAAATT ASVADKVFEI LRARRWQPAL
RIAPPYYAEP FYIEALARSV RRGLAGLDFE PDAILLSFHG IPKSSVDKGD PYYEHCLITA
DLLRQALGLD EAHCVTTFQS RFGRAEWIGP ATDATVKALA ARGVKKLAVV MPGFAADCLE
TIEEVGGEIR ALFLGAGGED YAALPCLNAS EEGMEVIARL VRRELQGWLT STD