Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0049 |
Symbol | |
ID | 3103006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 48224 |
End bp | 50446 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637169275 |
Product | cellulose-binding domain-containing protein |
Protein accession | YP_112589 |
Protein GI | 53802764 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATACGA AACACAGCGG TTACTTCAAG AATTCCCTTT GGCTGTTGGT TCTCGGCTCT TGTGCCTGGG CCGCTTCGTC GGTTTCGGCA TGGGCAGCCG ATGGATGCAG CTTTGAGTAC ATCATCACCA GTCAGGCAGC GAACACCTTT TCCGCCAGTG TGAGGGTCAC GAACACGGGC AGCGCGCTGA GCGGATGGAC CGTGGCGTGG AACATGCCCG GCGGACAGAA GATCACCCGA TTGTGGGACG GCAAGTGGTC ACAGAGGCTG TCGGCGGTCA CCGTGCGCAA CCTGGAGTCC AACCGCAAGG TGGCGAGCGG AGGAGTCATC CAGTTCGGAT TCGATGCCAC CTACTCCGGG GTCAACGGGA TTCCTGCGGC CATGACGCTC AATGGGGCAC AATGTTCATC GACTCCGACT CCGACTCCGA CTCCGACTCC GACGCCGACG CCGACGCCGA CGCCGACGCC GACGCCGACG CCGACGCCGA CTCCGGGTCC GGCGGCCAAC CCGATCGGCA TCAACATCAC CGGGCTTTCC TACTACGGGA CCGAGGTGCC GTTCCTGAAT CTGTTCAAGC TGTCCGAGCC GTGGCTGACC CAGTGCGACG CCTACCAGGA TCCGAACTGC AGCACCTTCG TCGAAGCGGG CGGCAGTTCC TGGAACACGA GGGAGCAGGC CAAGCTGATT CTGGATTCGA ACGGCTACCC GCGTTCGCTG CCCGATCCCG CCCAGGGGGC GGCTAGCGGC ACCAACTACA CATCGGTCGC GACCCTGGTC CCGACCGGCT TGAATTCCGC CACCCCGGCC GGACGGTTCA TTGTCCTTTA TGACGGCGAA GGCACCCTGG CATACGGCCG CGGCGCGAGC AGGAATGCCT CGCTGTCCTC GCCGGGCCGG GATGTCATCG ACGTCTCGAC CGACGGCATC CAGACCTGGA TCCAGGTCTC CATCAAGGCC ACCGATCCCA ACAAGACCGG GAATCACATC AGGAACCTGC GCCTGCTGCA GGCCGGCGGG GTCTGCAGCA ACGATCCGGC GGCGTACTGC GACCCCTCGG CGGCACAGAG CGGCTGTGCA AGCGGCGGCT CGTGCCGGTC GTTCGAGCAG GTTTACCCGA CTCAACCGTT CGACCCTCGC TTCCTGCGCA ATCTGGCGGG TTTCAAAGCC GTCCGCTTCA TGGCGTTCCA GAACACCAAC GACTCGCAGG TCGAACTGTG GGCCGACCGC ACCCTGCCCG ACGACGTCAC CTGGGTGTCC GAGCGCGGCG ACGGCGGTCC GGTGGAGATG GCGGTGGCGC TCGGCAATCA GCTCGGTGCC GACATCTGGG TGAACATGCC GACCCATGCG GACGACGGCT ATGTGCGCAG TTTCGCCACT CTGGTGAAGA ACACGCTCGC GGCAAGCCGG AAGGTGTACG TGGAGTACAG CAACGAGGCC TGGAACGGTG CGTTTTCCGC CGGGAGCTGG ATGGAAAATC AGGCACTCAC GCGCTGGGCC GGCGCCAGCG ACACACCCTT CGGCAAGCGC CTGCAATGGT ATGGCATGCG CACCGCCCAG ATCTGCGACA TCTGGAAGGC GGTCTGGGGT GCATCGTTCA GCCGGGTGGT GTGCGTCCTG GGCGCCCAGG CGGCCAATCC CTGGACCGCC AGGCAGGCGC TGGACTGTCC TTTATGGGCG GCGGAAAACG GCGGAGCCTC GTGCGTGCAG CACGGCATCC GCGCGCTGGC GATTGCTCCT TATTTCGGCT ATTACCTCGG TCTGCCGGAG AATCGGACGG TCGTGGACGC GTGGACCGGT CAGGCGGACG GCGGGCTGGC CAGCCTGTTC GCCGAGCTCC TGCAGGGCGG TTCCTTCGTC AACGGGCCGG CCGGCGGTGC GCTGGAGGAC GCAAGGCGGC AGATGCTGCA GTACAAGGCC GTCGCCGCGG AATATGGGCT GGAGCTGGTC GCTTATGAGG GGGGGCAACA TCTGGCGGGT GTCGGCGCGG TGGTCGACGA CAATGCGGTC ACCGATCTGT TCGTCGCCGC CAACCGCGAC GGCCGCATGG GCCCGGTCTA CAGCCGGCAC CTGAACGACT GGAGCGCCGC GGGCGGCGGA CTCTACAATC TGTGGAACAG CGTGGAGCCC TATTCGAAGT GGGGGGCGTG GGGATTGCTC GAATACCGCG ATCAGGGCGG CGCGCCGAAA TACGACGCGG TGAAGAGCCT CCTCTCCCCT TAG
|
Protein sequence | MDTKHSGYFK NSLWLLVLGS CAWAASSVSA WAADGCSFEY IITSQAANTF SASVRVTNTG SALSGWTVAW NMPGGQKITR LWDGKWSQRL SAVTVRNLES NRKVASGGVI QFGFDATYSG VNGIPAAMTL NGAQCSSTPT PTPTPTPTPT PTPTPTPTPT PTPTPGPAAN PIGINITGLS YYGTEVPFLN LFKLSEPWLT QCDAYQDPNC STFVEAGGSS WNTREQAKLI LDSNGYPRSL PDPAQGAASG TNYTSVATLV PTGLNSATPA GRFIVLYDGE GTLAYGRGAS RNASLSSPGR DVIDVSTDGI QTWIQVSIKA TDPNKTGNHI RNLRLLQAGG VCSNDPAAYC DPSAAQSGCA SGGSCRSFEQ VYPTQPFDPR FLRNLAGFKA VRFMAFQNTN DSQVELWADR TLPDDVTWVS ERGDGGPVEM AVALGNQLGA DIWVNMPTHA DDGYVRSFAT LVKNTLAASR KVYVEYSNEA WNGAFSAGSW MENQALTRWA GASDTPFGKR LQWYGMRTAQ ICDIWKAVWG ASFSRVVCVL GAQAANPWTA RQALDCPLWA AENGGASCVQ HGIRALAIAP YFGYYLGLPE NRTVVDAWTG QADGGLASLF AELLQGGSFV NGPAGGALED ARRQMLQYKA VAAEYGLELV AYEGGQHLAG VGAVVDDNAV TDLFVAANRD GRMGPVYSRH LNDWSAAGGG LYNLWNSVEP YSKWGAWGLL EYRDQGGAPK YDAVKSLLSP
|
| |