Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1504 |
Symbol | |
ID | 3102610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1602141 |
End bp | 1603400 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637170679 |
Product | sulfite oxidase SoxC, putative |
Protein accession | YP_113961 |
Protein GI | 53804403 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.545192 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCTT TCATCGAACC GGAAACCCTT CCTTCCGCAA TCCCCAGTTC GGGGCCGGAC CGCCGGCGCT TCCTGAAAGC GGGGCTGGCC GTCACCGGCG CCGCCCTGAC CGGTGGCGTC CGGGCGGCGC CGCCGCCTTG GATGACGCGG CCCGGCGCCC CGCTTTCCAA TTACGGCCAG CCCTCGCCGC ACGAACGCGC CGTCATCCGC TGGGTCGCGG CCAATCCCGA TGCGCCGGGG AACGGCATTT CCTGGACACC GCTGGAACGT CTGGAAGGCA TCATCACCCC CAGCGGCTTG CACTTCGAGC GCCACCACAA CGGCGTGCCG CAAATCGATC CCGCCGTCCA TCGCCTGGTC GTGCACGGAC TGGTTGTCAA GTCCTCGAGT TTCGGCATCG ACGACCTCCT GCGCTACCCG CAGACCTCGC GCCAGTGTTT CGTCGAATGC GGTGGCAACG GCAATGCCGG CTGGCACCTG GAGCCGATGC AAGCCCCGGC CGGTAACGTC CACGGCCTTG CTTCCTGCAG CGAATGGACT GGCGTACCGC TGGCTACCGT GTTGGAGGAA TGTGGCCTGC AACCGAACGC CAAATGGCTG ATCGCGGAAG GCGCCGATGC CGCGGCGATG AACGTCAGCA TTCCCCTGGA AAAGGCGCTG GACGATGCCC TGCTCGCCCT GTACCAGAAC GGCGAGCGCC TGCGGCCGGA GAACGGTTAT CCACTGCGGC TCATCCTGCC CGGCTGGGAA GGTGTCACCA ACGTCAAATG GTTGCACCGC CTGCAGCTTG CGGAGCAGCC CGCGATGGCC CGTAACGAAA CCGCGAAATA CACCGAGCTG CTGCCCTCCG GCCAGGCCCG GCAGTTCAGT TTCGTCATGG AGGCCAAGTC GCTCATCACT CGTCCCTCCG CCGGCCAGTC CTTGCCCGGC CCCGGCTTGC ACCCGATCTC CGGGCTGGCC TGGAGCGGCC GGGGCGCGAT CCGACGGGTG GAAGTTTCGG CCGATGGCGG CAAGACCTGG CAGGACGCGG CGCTCGACCC GCCCGTGCTG CCCAAGTGCT TCACCCGCTT CCGCCTGCCC TGGCGCTGGG ACGGCTCGCC TGCCGTACTC AAGAGCCGGG CCACCGACGA AACCGGCTAT GTCCAGCCCG AACGCCAGAC CCTGATCGCC GAGCGCGGGC GCCACGGCTA CTTCCACTAC AATGCGATCG TATCCTGGGC CGTCGCCGCC GATGGGAGCG TCAGCCATGT CTATGCGTGA
|
Protein sequence | MKPFIEPETL PSAIPSSGPD RRRFLKAGLA VTGAALTGGV RAAPPPWMTR PGAPLSNYGQ PSPHERAVIR WVAANPDAPG NGISWTPLER LEGIITPSGL HFERHHNGVP QIDPAVHRLV VHGLVVKSSS FGIDDLLRYP QTSRQCFVEC GGNGNAGWHL EPMQAPAGNV HGLASCSEWT GVPLATVLEE CGLQPNAKWL IAEGADAAAM NVSIPLEKAL DDALLALYQN GERLRPENGY PLRLILPGWE GVTNVKWLHR LQLAEQPAMA RNETAKYTEL LPSGQARQFS FVMEAKSLIT RPSAGQSLPG PGLHPISGLA WSGRGAIRRV EVSADGGKTW QDAALDPPVL PKCFTRFRLP WRWDGSPAVL KSRATDETGY VQPERQTLIA ERGRHGYFHY NAIVSWAVAA DGSVSHVYA
|
| |