Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0553 |
Symbol | |
ID | 3103098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 579565 |
End bp | 582750 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637169768 |
Product | discoidin domain-containing protein |
Protein accession | YP_113072 |
Protein GI | 53805205 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.326441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAAG CATTTCGCAG TCCCGCCTGG TCCTGGGGAC TGGTGTTGTG GTGGGGTTTG GCCGCGGGTG GCGGGACCGA GGTGCTGGAC GATTTCGAGA CCCTGTCCGG CTGGACGGCG GAAGGTTCGC CCGGCACGCG CTTCGAACTG AAGCAGGACG GTGGGTTCGA GGGCAGGGGG CTCAGGCTGG ACTTCGAATT CGCCGGCGGT GCCGGTTACG TCATCCTGCG CAAGCATTTC GAGCTTCCCT TGCCGGAAAA CTACGCGTTC AGTTTCCGCG CCAAGGGCGC GGGGCCGGCC AACACCCTGG AGTTCAAGCT GCTGGACCCG GGCGCGCAGA ACGTCTGGTG GCACCGGCGC GAGGCCAAGG GGTTGCCCGC AGGCTGGCAG CTCCAGGCGA TCCGCAAGTC CGCCATCGAC TTCGCCTGGG GGCCTTCGGG CGGTGCGCCG TTGCTGGCGC TGGGCGGCAT CGAGTTCGCG CTGACCGCGG CGGCCGGCGG CAAGGGTACG CTGTGGCTGG ACGATCTCCG CTTCGAGCGG CGCGATGCGG TGGTGGACTA CCGGGGGCAA CCCAAGGTGT CGGCATCCTC CGCCAGCGGC GAGGACGGCG CGGAGAATGC GCTGGATGGG CAGACGCAGA CCGTCTGGCG TAGCGCATCC AGACCTGCGC GCCAGTGGCT CCGGCTCGAT TTCGGCCAGC GCCGCGAATA CGGCGGGCTT GTCCTCGCCT GGGAGGGGGA GTGCCACGCG CGGGACTATG CCGTGGAGGT TTCGGACGAC GGCCGGCGTT GGCGCGGCGT CTACCGGGTG GCGGATGGCA ACGGCCGGCG CGACTACCTG CCGCTGCCGG AGATCGAATC GCGCTACCTC CGGTTGGAAC TGCGCAAGAG CGCCTGCGGC AAGGGCTACG CGCTCCGCGA GCTGACGGTG AAGCCGCCCG ATTTCGCCGC TTCCCCCAAC CGCTTGTTCG AGCACATCGC CCGCAATGAG GCGCGCGGCG CTTATCCGCG CTATTTCCGA GGCGAGCAGA CGTACTGGAC GCTGGTTGGC GCCAACGGCG GCCACCGCAA GGGGCTCTTG GGGATGGATG GGGCGCTGGA GACGGAACGC GGCGGCTTCA CGGTGGAGCC GTTCCTGTAT GCGGACGGCC GCTTGCTGGG CTGGAACGAG GGGCGGCTCT CGCAATCGCT GGAACAGGGG GATCTGCCTC TGCCGACCGT GGAACGCGAC TACGGGGACC TGAGCCTGGA GGTGACGGGG TTCGCCGGTA AGCTCGGCGA CGCTCCGCTG TCGCTCGCCC GCTACCGGGT GGCAAACCGC GCCGTGACGG CGCGCCGGCT CAGGTTGTTC CTGGCCGTAC GTCCGTTCCA GGTCAATCCG CCCTGGCAGT CGCTCAACAT GAAGGGCGGG GTGAGTCCGA TCCATCGGAT CGAGGCTTCC GGCCGCGTAC TCGAAGTGGA CGGCGCCGAC GCGCTGGTAG CGATGAATCC GCCCGACGGC TTCGGCGCCG CCGGTTTCGA CCAGGGCGAC ATCACCGATT TTCTGGGTGA ATCCCGGCTG CCGCCGCGCA CCGTGGCTTC CGATTCCGCC GGGTATGCAT CGGGAGCCTG GAGGTTCGAT CTGGAGCTGG GGCCGGCAGC GGCACGGGAG ATCTTCATCG CCGTTCCCGA GCGCAAGAAG GGTTCGTCCT TGGCCGTCGA AACGGCGCTG AAGGCCGAAG GCGAGACCCT GGGAGCCCGA TTGTGGCAGG AGGCGGTCGG CTACTGGCGG CAGGCGCTGG CCGGGCCCGA TTTTCTGCTG CCGGAATCCG AACGGGACCT GGTGCGGAGC CTCCGGGCCA ATCTGGCCTA CATCCTGGTG AACCGCGACG GTCCGGCCTT GCAGCCGGGT TCCCGTACCT ATGCCCGCAG CTGGATCCGC GATGGCGCAT TGATGTCCTC GGCCCTGCTC ATGCTGGGAC GCGGCGAGGA AGTGAAGCGC TTCCTCGAAT GGTACGCGCG GTTCCAGAGC GCGGACGGCG CCATTCCCTG CTGTATCGAC AGCCGCGGCC CGGACAGTGT GCCGGAGAAC GACAGCCACG GCGAGTTCGT CTACACCGTG GCCGAGTACT ACCGTCATAC CCGCGACCTC GAGTTCGTGC AGGCGCTGTG GCCCCACATC GTCGCAGCCA TGGGCCACGT CGACGCGCTG CGGCACCAGC GGATGACCGA GGTCTACCGG AGCGGCGAGG GCCGGGCCTT CTACGGTCTG ATGCCGGCCT CGATCAGCCA CGAAGGCTAT GCCTCGCAGC CGGTACATGC GTTCTGGGAC GACTTTTGGA CTCTGCGCGG CATCCGCGAT GCGGTGATGC TCGCGAAGGT CCTGGGCGAT CACGCCCACG CGGGAAGCTG GAGCCACATG GCGGCGGAGT TCGGCGGCCA CCTTCATGCC GCGCTGCAAG CCACCATGGC GCGTAAGCAC ATCGACTACA TCCCGGCCTC GGCCGACCTG GGTGACATCG ACCCCAACGC CGTCGCCATC ATGGTGTCGA TCGCGGGCGA GGCCGGAAGG TTGCCGCAGG CCGCGCTGGC CAAGACCTTC GACGATTATC TCGCGCACTT CCGCAAGCGC CGCGACGGCA ACGGCGACGA CGGCCATACG CCGTACGAGG TGCGCCTCGT CGAGGCTCTG GTGCGGCTGG GGCGGCGTGA CGACGCCTGG GAGGTCCTGC GTTCCCTGCT GCGCGACCAG CGGCCCCAGG CCTGGCGCCA GTGGGCCGAA GTGGTGTGGC GCAACCCCGA GGCGCCGCGG TTCATCGGCG ACATGCCGCA TTCCTGGATC GGCGCGGAAT TCATCCGCTC GCTGCGGAGC TGTTTCGCCT ATGAGGACGA TGCCGAAGAT TCGCTGGTGC TCGCCGCGGG AATTCCCGCG GAATGGCTAT ACAATGCGGC CAGCGCCGAA GTCGGGGTCC GCCGCCTGCC TACCGTCCAT GGCCTGCTCG ATTACGGTCT GCGGACTGAA GGCGCCGAGA CTTTGAAGCT GCACATCGAC GGACGCCTGG CCGTGCCGCC CGGCGGCATC CGGATACGGC CGCCGGTGGC CAGGCCGATC CAGGCGGCGA AGGTGAACGG CACCGCGGTT TCCGGTTTCA CGGGCGCCGA GCTGAGGATC GAAGGCGTGC CGGCGGAGGT CGAGATCCGG TATTGA
|
Protein sequence | MIEAFRSPAW SWGLVLWWGL AAGGGTEVLD DFETLSGWTA EGSPGTRFEL KQDGGFEGRG LRLDFEFAGG AGYVILRKHF ELPLPENYAF SFRAKGAGPA NTLEFKLLDP GAQNVWWHRR EAKGLPAGWQ LQAIRKSAID FAWGPSGGAP LLALGGIEFA LTAAAGGKGT LWLDDLRFER RDAVVDYRGQ PKVSASSASG EDGAENALDG QTQTVWRSAS RPARQWLRLD FGQRREYGGL VLAWEGECHA RDYAVEVSDD GRRWRGVYRV ADGNGRRDYL PLPEIESRYL RLELRKSACG KGYALRELTV KPPDFAASPN RLFEHIARNE ARGAYPRYFR GEQTYWTLVG ANGGHRKGLL GMDGALETER GGFTVEPFLY ADGRLLGWNE GRLSQSLEQG DLPLPTVERD YGDLSLEVTG FAGKLGDAPL SLARYRVANR AVTARRLRLF LAVRPFQVNP PWQSLNMKGG VSPIHRIEAS GRVLEVDGAD ALVAMNPPDG FGAAGFDQGD ITDFLGESRL PPRTVASDSA GYASGAWRFD LELGPAAARE IFIAVPERKK GSSLAVETAL KAEGETLGAR LWQEAVGYWR QALAGPDFLL PESERDLVRS LRANLAYILV NRDGPALQPG SRTYARSWIR DGALMSSALL MLGRGEEVKR FLEWYARFQS ADGAIPCCID SRGPDSVPEN DSHGEFVYTV AEYYRHTRDL EFVQALWPHI VAAMGHVDAL RHQRMTEVYR SGEGRAFYGL MPASISHEGY ASQPVHAFWD DFWTLRGIRD AVMLAKVLGD HAHAGSWSHM AAEFGGHLHA ALQATMARKH IDYIPASADL GDIDPNAVAI MVSIAGEAGR LPQAALAKTF DDYLAHFRKR RDGNGDDGHT PYEVRLVEAL VRLGRRDDAW EVLRSLLRDQ RPQAWRQWAE VVWRNPEAPR FIGDMPHSWI GAEFIRSLRS CFAYEDDAED SLVLAAGIPA EWLYNAASAE VGVRRLPTVH GLLDYGLRTE GAETLKLHID GRLAVPPGGI RIRPPVARPI QAAKVNGTAV SGFTGAELRI EGVPAEVEIR Y
|
| |