Gene MCA2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2059 
Symbol 
ID3103857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2214069 
End bp2215715 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content63% 
IMG OID637171213 
Productnitrite/sulfite reductase protein 
Protein accessionYP_114490 
Protein GI53803616 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.155547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCAAT ACGACCAATA TGACCAGCGG CTGGTCGACG AACGGGTTGC CCAGTTCCGC 
AGCCAGACCC GCCGATTTCT CCAGGGCGAA CTGAGCGAGG ACCAGTTCCG TCCCTTGCGA
CTGATGAACG GGCTGTACCT CCAGCGTCAT GCCCCCATGC TGCGCGTCGC CATTCCTTAC
GGCGTGCTCT CGTCCCGGCA GCTGCACAAG CTGGCCGAGA TCGCCCGCCG TTACGACAAA
TCCGTCGCTC ATTTCACCAC CCGCCAGAAC ATCCAGTTCA ACTGGCCACG CCTGGAAGAC
GTGCCGGACA TCCTGGCGGA GTTGGCGACG GTCCAAATGC ACGCCATCCA GACCAGCGGC
AACTGCATCC GCAACGTCAC CAGCGACCCG CTGGCCGGCG TCTGTCCCGA CGAGATCGAA
GACCCCAGGC CCTATTGCGA AATCATCCGG CAGTGGTCGA CCCTGCATCC GGAGTTCAGC
TACCTGCCGC GCAAGTTCAA GATCGCCGTC AGCGGTGCGA GGAAAGACCG CGCCGCGACC
CAGGTACACG ACATCGGCTT GCAGATGGTG GAAAACGAGG CAGGCGAAAC CGGCTTCGAG
GTCCTGGTCG GCGGCGGCCT CGGACGCACC CCGATCATCG GCCAGACCAT CCGCCCCTTC
CTCGCCAAAA CGGATCTCCT CTCCTACCTG GAAGCCATCC TGCGCGTATA CAACCGCTTC
GGGCGGCGCG ACAACAAATA CAAGGCGCGC ATCAAAATCC TGCTCAAAGA GACCGGCATT
GAAGATTTCA CCCGTCGGGT CGAGGCCGAA TGGGTGCAAA TCCGCCAGCA ATTGGTGCTG
GACACCGGCG AGATCGAGCG TGTAAAGAGG CATTTCACGG CGCCCGAGTA CAAGACCGTG
GCCGGTCCCG GCCTGGACGC CCAAGCAGCG GCCGACCCGC GCTTCGCCGT CTGGCTGAGG
AACAACACCA CGCCGCACAA GATCCCCGGT TACCGGGCGG TATTCCTTTC GCTGAAAGCC
CGCGGCGTAC CGCCGGGTGA CATCGCCGAC ACCCAGTTGG ATGCCGTGGC CGACCTCGCG
GAGCGTTACA GCTTCGGCGA GGTCCGCGCC ACCCACACCC AGAACCTGGT GTTCGCGGAT
GTGCATCAGG ACGACCTGCA CGAACTCTGG CAGCGCCTGG ACTCCATCGG CTTGGCGACG
CCCAACATCG GCAAGGCGAC TGACATGATC TGCTGCCCCG GTCTGGATTA TTGCTCGCTG
GCCAATGCCA GTTCGATCTC GGTGGCGGAA GACATCTATT CGCGCATCGA CGATCTCGAC
TATCTGCACG ACCTCGGCGA CCTGCGGATC AACATCTCCG GCTGCATGAA TGGCTGCGCC
CATCAAAGCG TCGGCCACAT CGGCATTCTG GGCGTCGACA AGAAAGGCGA GGAATGGTAC
CAGCTCACGC TCGGCGGCAG CTCCAGCAAT GACGCCTCCC TCGGCGAGCG CCTCGGGCCC
GCCATCGACA AGGCCCACGT GGCCGAAGCC GTGGAAACCA TTCTGCAAAC CTACATCGAG
CTACGTCAGG ACGGGGAATC CTTCCTCGAT ACCGTGCGGC GTCTCGGCAT CAACCCGTTT
CAGGAGCGCG TCTATGCCCA TCATTAA
 
Protein sequence
MYQYDQYDQR LVDERVAQFR SQTRRFLQGE LSEDQFRPLR LMNGLYLQRH APMLRVAIPY 
GVLSSRQLHK LAEIARRYDK SVAHFTTRQN IQFNWPRLED VPDILAELAT VQMHAIQTSG
NCIRNVTSDP LAGVCPDEIE DPRPYCEIIR QWSTLHPEFS YLPRKFKIAV SGARKDRAAT
QVHDIGLQMV ENEAGETGFE VLVGGGLGRT PIIGQTIRPF LAKTDLLSYL EAILRVYNRF
GRRDNKYKAR IKILLKETGI EDFTRRVEAE WVQIRQQLVL DTGEIERVKR HFTAPEYKTV
AGPGLDAQAA ADPRFAVWLR NNTTPHKIPG YRAVFLSLKA RGVPPGDIAD TQLDAVADLA
ERYSFGEVRA THTQNLVFAD VHQDDLHELW QRLDSIGLAT PNIGKATDMI CCPGLDYCSL
ANASSISVAE DIYSRIDDLD YLHDLGDLRI NISGCMNGCA HQSVGHIGIL GVDKKGEEWY
QLTLGGSSSN DASLGERLGP AIDKAHVAEA VETILQTYIE LRQDGESFLD TVRRLGINPF
QERVYAHH