Gene MCA0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0049 
Symbol 
ID3103006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp48224 
End bp50446 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content65% 
IMG OID637169275 
Productcellulose-binding domain-containing protein 
Protein accessionYP_112589 
Protein GI53802764 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACGA AACACAGCGG TTACTTCAAG AATTCCCTTT GGCTGTTGGT TCTCGGCTCT 
TGTGCCTGGG CCGCTTCGTC GGTTTCGGCA TGGGCAGCCG ATGGATGCAG CTTTGAGTAC
ATCATCACCA GTCAGGCAGC GAACACCTTT TCCGCCAGTG TGAGGGTCAC GAACACGGGC
AGCGCGCTGA GCGGATGGAC CGTGGCGTGG AACATGCCCG GCGGACAGAA GATCACCCGA
TTGTGGGACG GCAAGTGGTC ACAGAGGCTG TCGGCGGTCA CCGTGCGCAA CCTGGAGTCC
AACCGCAAGG TGGCGAGCGG AGGAGTCATC CAGTTCGGAT TCGATGCCAC CTACTCCGGG
GTCAACGGGA TTCCTGCGGC CATGACGCTC AATGGGGCAC AATGTTCATC GACTCCGACT
CCGACTCCGA CTCCGACTCC GACGCCGACG CCGACGCCGA CGCCGACGCC GACGCCGACG
CCGACGCCGA CTCCGGGTCC GGCGGCCAAC CCGATCGGCA TCAACATCAC CGGGCTTTCC
TACTACGGGA CCGAGGTGCC GTTCCTGAAT CTGTTCAAGC TGTCCGAGCC GTGGCTGACC
CAGTGCGACG CCTACCAGGA TCCGAACTGC AGCACCTTCG TCGAAGCGGG CGGCAGTTCC
TGGAACACGA GGGAGCAGGC CAAGCTGATT CTGGATTCGA ACGGCTACCC GCGTTCGCTG
CCCGATCCCG CCCAGGGGGC GGCTAGCGGC ACCAACTACA CATCGGTCGC GACCCTGGTC
CCGACCGGCT TGAATTCCGC CACCCCGGCC GGACGGTTCA TTGTCCTTTA TGACGGCGAA
GGCACCCTGG CATACGGCCG CGGCGCGAGC AGGAATGCCT CGCTGTCCTC GCCGGGCCGG
GATGTCATCG ACGTCTCGAC CGACGGCATC CAGACCTGGA TCCAGGTCTC CATCAAGGCC
ACCGATCCCA ACAAGACCGG GAATCACATC AGGAACCTGC GCCTGCTGCA GGCCGGCGGG
GTCTGCAGCA ACGATCCGGC GGCGTACTGC GACCCCTCGG CGGCACAGAG CGGCTGTGCA
AGCGGCGGCT CGTGCCGGTC GTTCGAGCAG GTTTACCCGA CTCAACCGTT CGACCCTCGC
TTCCTGCGCA ATCTGGCGGG TTTCAAAGCC GTCCGCTTCA TGGCGTTCCA GAACACCAAC
GACTCGCAGG TCGAACTGTG GGCCGACCGC ACCCTGCCCG ACGACGTCAC CTGGGTGTCC
GAGCGCGGCG ACGGCGGTCC GGTGGAGATG GCGGTGGCGC TCGGCAATCA GCTCGGTGCC
GACATCTGGG TGAACATGCC GACCCATGCG GACGACGGCT ATGTGCGCAG TTTCGCCACT
CTGGTGAAGA ACACGCTCGC GGCAAGCCGG AAGGTGTACG TGGAGTACAG CAACGAGGCC
TGGAACGGTG CGTTTTCCGC CGGGAGCTGG ATGGAAAATC AGGCACTCAC GCGCTGGGCC
GGCGCCAGCG ACACACCCTT CGGCAAGCGC CTGCAATGGT ATGGCATGCG CACCGCCCAG
ATCTGCGACA TCTGGAAGGC GGTCTGGGGT GCATCGTTCA GCCGGGTGGT GTGCGTCCTG
GGCGCCCAGG CGGCCAATCC CTGGACCGCC AGGCAGGCGC TGGACTGTCC TTTATGGGCG
GCGGAAAACG GCGGAGCCTC GTGCGTGCAG CACGGCATCC GCGCGCTGGC GATTGCTCCT
TATTTCGGCT ATTACCTCGG TCTGCCGGAG AATCGGACGG TCGTGGACGC GTGGACCGGT
CAGGCGGACG GCGGGCTGGC CAGCCTGTTC GCCGAGCTCC TGCAGGGCGG TTCCTTCGTC
AACGGGCCGG CCGGCGGTGC GCTGGAGGAC GCAAGGCGGC AGATGCTGCA GTACAAGGCC
GTCGCCGCGG AATATGGGCT GGAGCTGGTC GCTTATGAGG GGGGGCAACA TCTGGCGGGT
GTCGGCGCGG TGGTCGACGA CAATGCGGTC ACCGATCTGT TCGTCGCCGC CAACCGCGAC
GGCCGCATGG GCCCGGTCTA CAGCCGGCAC CTGAACGACT GGAGCGCCGC GGGCGGCGGA
CTCTACAATC TGTGGAACAG CGTGGAGCCC TATTCGAAGT GGGGGGCGTG GGGATTGCTC
GAATACCGCG ATCAGGGCGG CGCGCCGAAA TACGACGCGG TGAAGAGCCT CCTCTCCCCT
TAG
 
Protein sequence
MDTKHSGYFK NSLWLLVLGS CAWAASSVSA WAADGCSFEY IITSQAANTF SASVRVTNTG 
SALSGWTVAW NMPGGQKITR LWDGKWSQRL SAVTVRNLES NRKVASGGVI QFGFDATYSG
VNGIPAAMTL NGAQCSSTPT PTPTPTPTPT PTPTPTPTPT PTPTPGPAAN PIGINITGLS
YYGTEVPFLN LFKLSEPWLT QCDAYQDPNC STFVEAGGSS WNTREQAKLI LDSNGYPRSL
PDPAQGAASG TNYTSVATLV PTGLNSATPA GRFIVLYDGE GTLAYGRGAS RNASLSSPGR
DVIDVSTDGI QTWIQVSIKA TDPNKTGNHI RNLRLLQAGG VCSNDPAAYC DPSAAQSGCA
SGGSCRSFEQ VYPTQPFDPR FLRNLAGFKA VRFMAFQNTN DSQVELWADR TLPDDVTWVS
ERGDGGPVEM AVALGNQLGA DIWVNMPTHA DDGYVRSFAT LVKNTLAASR KVYVEYSNEA
WNGAFSAGSW MENQALTRWA GASDTPFGKR LQWYGMRTAQ ICDIWKAVWG ASFSRVVCVL
GAQAANPWTA RQALDCPLWA AENGGASCVQ HGIRALAIAP YFGYYLGLPE NRTVVDAWTG
QADGGLASLF AELLQGGSFV NGPAGGALED ARRQMLQYKA VAAEYGLELV AYEGGQHLAG
VGAVVDDNAV TDLFVAANRD GRMGPVYSRH LNDWSAAGGG LYNLWNSVEP YSKWGAWGLL
EYRDQGGAPK YDAVKSLLSP