Gene Mpe_A1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1503 
Symbol 
ID4784101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1618205 
End bp1619914 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content70% 
IMG OID640090070 
Productputative nitrite/sulfite reductase 
Protein accessionYP_001020700 
Protein GI124266696 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0107318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.425168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCAAT ACACCCCCTT CGACCGCGCC TTCGTGCACC AGCGCGCAGC GCAGTTTCGC 
GACCAGCTCG AGCGCAACCG CGCCGGCACG CTGGGTGACG ACGAGTTCCG CCCCCTGCGC
CTGCAGAACG GCTGGTACAT CCAGCGCCAT GCGCCGATGC TGCGCGTGGC GGTCCCCTAC
GGCGAACTCA GCAGCCGCCA GCTGCGCCAG CTGGCGCGCA TTGCCCGCGA GTTCGACCGT
GGCTATGCGC ACTTCACCAC GCGCCAGAAC GTCCAGTACA ACTGGATCCC GCTCGACCGG
AGCGCCGACG TGATGGACCT GCTGGCCGAC GTCGACATGC ACGGCATCCA GACCAGCGGC
AACTGCATCC GCAACACCAC CAGCGATGCG CTGGCCGGCG TGGCACCGGA CGAGATCGTC
GACCCGCGGC CCTACTGCGA GATCCTGCGG CAGTGGACCA CGCTGCACCC GGAGTTCGCC
TTCCTGCCGC GCAAGTTCAA GATCGCCGTC ACCGGCGCCA CCGAGGATCG CGCCGCCACC
GCCTGGCACG ACATCGGCCT GCACCTGCAC AAGAACGACG CCGGCGAGGT GGGCTTCCGC
GTGCTGGTCG GCGGCGGCAT GGGGCGCACG CCGATCCCCG GCGTGGTGAT CCGCGAGTTC
CTGCCCTGGC ACCAGATCCT CGTCTTCATC GAGGCGATCG TGCGCGTCTA CAACCGCTAC
GGCCGGCGCG ACAACATGTA CAAGGCGCGC ATCAAGATCC TGGTCAAAGC CGAGGGCGAG
CGCTTCATCG AACAGGTGGG CAAGGAGTTC GAGGCCATCC TGAGCCGCGA CGTCGATGGC
GACGCGCAAC TGATCCCTGA GTCCGAGCTG GACCGCGTGT CCGCCTGCTT CGTGCTGCCC
GAGGGCGTCG TCGCCCACGC GAGCGCCGGG GACGGTGCCC CGGCCGATGC CCCCGTGGCC
TACCGCAAGT GGCTGGAGCG CAACGTGCAC GGCCACCGGC TCGCCGGCTA TCGCGCGGTC
ACGCTGTCCG TCAAGCGCGC CGGCCAGGCG CCCGGCGACG CCACCGACAC GCAGCTCGAC
CTGGCCGCCG ACCTGGCCGA CCGCTACTCG CACGGCGAGA CGCGGGTCAC GCACGACCAG
AACCTGTTGT TGCCCTGGGT GCGCGAGGAA GACCTGTACG CGCTGTGGCG GGCTGCGCGC
GACGCCGCCT ACGCCACGCC CAACATCGGG CTGCTGAGCG ACATGATCGC CTGCCCCGGC
GGCGACTTCT GCGGCCTGGC CAACGCGCGC TCGATCCCGG TGGCCGAGCA GATCACCGAG
CGCTTTGCCG ACATCGACGA GCTCTACGAC ATCGGCGACA TCGACCTGCA CATCAGCGGC
TGCATCAACT CCTGCGGCCA CCACCACAGC GGCCACATCG GCATCCTCGG CGTCGACAAG
GACGGCGCCG AGTGGTACCA GGTCACGCTG GGCGGCTCCG ACGGCTCGGC CTTGAGTGGC
GGCCTGGCCT CGGCGGTGCC GGGCAAGGTG ATCGGCCCGT CGTTCGCGGC CGATGAGGTG
GCCGACGCCG TTGAGGCGGT GATCGAGACC TACCGCGGCC AGCGCGCCGC GAACGAGCGC
TTCATCGACA CCGTGCGGCG CGTCGGCCTC GAGCCCTTCA AGACCGCCGC CAACGCGGTG
CGCCGCAGCA CGGCGAAGGT GGCCGCATGA
 
Protein sequence
MYQYTPFDRA FVHQRAAQFR DQLERNRAGT LGDDEFRPLR LQNGWYIQRH APMLRVAVPY 
GELSSRQLRQ LARIAREFDR GYAHFTTRQN VQYNWIPLDR SADVMDLLAD VDMHGIQTSG
NCIRNTTSDA LAGVAPDEIV DPRPYCEILR QWTTLHPEFA FLPRKFKIAV TGATEDRAAT
AWHDIGLHLH KNDAGEVGFR VLVGGGMGRT PIPGVVIREF LPWHQILVFI EAIVRVYNRY
GRRDNMYKAR IKILVKAEGE RFIEQVGKEF EAILSRDVDG DAQLIPESEL DRVSACFVLP
EGVVAHASAG DGAPADAPVA YRKWLERNVH GHRLAGYRAV TLSVKRAGQA PGDATDTQLD
LAADLADRYS HGETRVTHDQ NLLLPWVREE DLYALWRAAR DAAYATPNIG LLSDMIACPG
GDFCGLANAR SIPVAEQITE RFADIDELYD IGDIDLHISG CINSCGHHHS GHIGILGVDK
DGAEWYQVTL GGSDGSALSG GLASAVPGKV IGPSFAADEV ADAVEAVIET YRGQRAANER
FIDTVRRVGL EPFKTAANAV RRSTAKVAA