Gene Bind_1613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1613 
Symbol 
ID6200339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1820462 
End bp1821736 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content55% 
IMG OID641705603 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_001832733 
Protein GI182678587 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.681731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTATT GTCTCTCTGC CTTTGCTGGT CTGATTTTAT TTATGGCTGT TGGAGCCATT 
ATTGACAGTT TTGGGCAAGA CAGTTTGGCG CAAGAAAATC CGCAAGGAGG AATGACCAGA
GCGACGGCCT ATCAACAGGC GAGGCAATTG TCGGAGATCG GCCGGCAGAT GTTTTCCGAT
CCCAGACTGT CGGGCTCGGG TAAATTGTCC TGCGCTTCCT GCCATGATCC CCAGAATGCT
TTTTCGCCCT CCAATGACCT TGCTGTGCAA ATGGGCGGCA AAACCCTCGA TCGAGCCGCG
TTTCGCGCGG CTCCGACGCT CACCTATAAA CAGGTGACCC CGCCGTCCAC GGAACATTAT
TATGAATCCG ACGAGGAAGC TGACGGCAGC ATAGACAATG GTCCAACGGG CGGCTTGACC
TGGGATGGCC GCGTGGATCG GCGCTCGGAC CAAGCCTTGA TCCCTCTTTT ATCGCCGCTT
GAAATGGCGA ATGAAGATCG TGCGGCGCTG TCCGATACGA TCGAAAAAAT TTACGGGCAA
GCCTTGCGCG CTGTGGCAGG CGGATTGGTG TCGCAAGACA AGCAATGGCC CCTCGAAGCG
GCGATGAAGG CGCTTGACGC CTTTCAGCAG GAGGGCGCGC TCTTCTATCC CTTCTCAAGC
AAATATGATG CTTTTCTGCG CGGCAAGGCG GAATTGAGCG AGCAGGAAAA ACACGGGCTC
GAAATTTTCA CGGCAGAGGA CAAGGGTAAT TGCGCAAGCT GCCATGTCAG CGCGCCGAGC
AAGACGGGCA CCCCGCCCTT CTTCACTGAT TATGGGTTGA TTGCGATCGG CGTGCCGCGC
AATCGCGACA TAGCGCAAAA TCAAGATCCG GCTTTTTTCG ATCTCGGTCT TTGTGGTCCC
GAGCGACAGG ATTTCCGTGA CAGGCCGGAT TATTGCGGCC TGTTCAAGAC GCCGACCCTC
CGCAATGTCG CCCGCAGAAA AGTCTTCTTC CACAATGGTG TCGTGAAGAG CTTGCGCGAG
GCCGTCGCCT TCTATTTCGA ACGCGATACG AAGCCGGAGA AATATTATCC CCACGGCGCC
GATGGAACGA TTGCCAAATA TGACGATCTG CCCGCTGCTT ACCATGACAA TGTCAATAAT
GAGCCCCCCT TTGGTAAACA GGCCGGAGAC CCTTCGACGG TCTCGGAAGC CGACATTGAT
GCGGTGGTCG CCTTTCTCAA AACCCTGGAC GATGGCTTCT CGAGCCAGGA ACCCGCTGCG
GTGAAGTATC CGTAA
 
Protein sequence
MHYCLSAFAG LILFMAVGAI IDSFGQDSLA QENPQGGMTR ATAYQQARQL SEIGRQMFSD 
PRLSGSGKLS CASCHDPQNA FSPSNDLAVQ MGGKTLDRAA FRAAPTLTYK QVTPPSTEHY
YESDEEADGS IDNGPTGGLT WDGRVDRRSD QALIPLLSPL EMANEDRAAL SDTIEKIYGQ
ALRAVAGGLV SQDKQWPLEA AMKALDAFQQ EGALFYPFSS KYDAFLRGKA ELSEQEKHGL
EIFTAEDKGN CASCHVSAPS KTGTPPFFTD YGLIAIGVPR NRDIAQNQDP AFFDLGLCGP
ERQDFRDRPD YCGLFKTPTL RNVARRKVFF HNGVVKSLRE AVAFYFERDT KPEKYYPHGA
DGTIAKYDDL PAAYHDNVNN EPPFGKQAGD PSTVSEADID AVVAFLKTLD DGFSSQEPAA
VKYP