Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1493 |
Symbol | |
ID | 3102989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 1588125 |
End bp | 1591292 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637170668 |
Product | cellulose-binding domain-containing protein |
Protein accession | YP_113950 |
Protein GI | 53804422 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.498527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGATCC GGAACGACGG CCCTGCGCTG GCGGGCTGGA ACGTCGCCTG GGAGATGCCC GACGGCCAGG AGATCACGAA TTCCTGGAAC GGCCGCTTCG TCCAGACGGG CGCACACGTC GACGTCGGCA ACCTGGATTG GAACAAGGAC ATTCCCGCGG GATCGAACAT CGAGTTCGGT TTCAACGGCA GCCACACCGG CAGGAACCGT ACGCCCGCGA GCCTGAGCGT CAACGGCGTG AAATGCACGA TGGCCGTGGC CGCGCCCCCG CCCTCGCCCA GACCGGCGGC CTGCGAAGCC ACCTATCAGA TTTCCGATCA GTGGAACACC GGCTTCACGG CCAACGTCCG GATCAGAAAC GATGGGGACC CGCTGGCGGG GTGGAAGGTC GTCTGGGATA TGCCCGATGG CCAGCGGGTC ACGAATTCTT GGAACGGTCG GTTCACCCAG AGCGGGTCCC ATGTGGAAGT CGGCCATCTC GACTGGAACA GGGACATTGC CACGGGATCG AACATCGAGT TCGGCTTCAA CGGCAGCCAC GCCGGTGCCA ACCGCGCACC TTCGAGCCTG AGCCTCAATG GTGTCGCCTG TGCGCTGGTC ACGGTTGCGC CGTCCCCGAC ACCCGCGCCG ACCGCTACGC CGAGCCCCAT CCCGACGCCG ACGCCCAGCC CGGCACCGAC CGCGAGCCCG AGTCCGGCGC CCACGCCGAC ACCGGCGCCG ACTTCCAGCC CCAGCCCTAC GCCTGCGCCG ACCGGACCGT GTCAGGCCGA CTACCAGGTG ACGGCGCAAT GGAATGATGG ATTCACTGCC AACGTGAAAG TCAGAAACAA CGGCACCGCC CTGGACGGCT GGACGGTCGC CTGGACCATG CCCAGCGGCC AAACCGTGAC GGGAATGTGG AACGGCCAGT ACGTCCAGAC CGGCCCCCAG GTCGCGGTGA CGCCCGCCGA CTGGAATCGC CAGATCGCAA GCGGCGCTTC CATCGAGTTC GGTTTCAACG GCAGCCATTC GGGCAGCAAC GCCGTACCCG GCGTGCTGAC GCTGAATGGT GCCGCTTGCG CCACCACCAC GGGCGGCGGC ACCCCCACGC CGACCCCGGT GCCGGTGGCG CCGGGGGCAC CGGCCGGTCT GTCCGCCACC GTGGCCGACA ACGCGGTCGT CAACCTGGTT TGGACGGCGG CCGACGCATT CGCCCAGGGT TTCCGGATCG AGCGCCGCAC CGCCGCCGGT GTCGACTGGG CTCTCGTGGC CGAAACGGCC GCCGGTGTTC CGTCCTTCGG CGACGGCACG GTGGCCATGG GGAACGACTA CGAGTACCGG GTCTATGCCT TCAACGCGGT GGGAAACAGC CCGCCCGCGA CGGCATCCGC TTCCTTGCCG ACCTTGCTCA AGTACGGCGA GTCGCAGTAC CAGAAACAGG GCTGTGCCAG CTGCCACGGG AAGGACGGCA AGGGAGGCTT CACCAACAAG CCGCTGGTCC ATTTCACGGC CGACCAGCTC GCGACGCTCA CCGAGATCAT CCGCGTCCGC ATGCCCCCGT CCAAGCCTGC GAACTGCGTC AGCAACTGCG CGGCGGGTAC CGCCAAGTAC ATCATCGAGG TGCTTGCCGC CGCCGCTTCG GGAGGCGGCG GAGGCGGCGG CAACGCCTGC GCCGGCAGCC CGCCGCCCGG TGGACGGGCA TTGCGCCTGC TCACCCGCCA GGAATACCAG AACACGGTCA ACGACCTGCT CGGCCTGTCC GAAAACCTGG TGCATCTCCT GCCGGAGGAA AACCGCGTGG ATGGCTTCGA CAACAATGTC GCAACCAACC TGGTGACGAG CATTCGCCTC GAGGCTTTCC TGACTCAGGC CGAAGCCCTC GCCGCAAAGG CGGTGCAGCA AAACTGGAAC GCGCTGCTGC CGTGCACACA GCAGGATGCG GCCTGTGCCC GTCGGTTCGT CGAAGCTTTC GGCAAGCGCG CCTACCGGCG GCCATTGACC CCGGAAGAGG CCGACGCCTA CGCTGCGCTG TTCGGGCAGG GCTCATTCCG GGAAGGCGTG GAGGCGACCA TCACCCATAT GCTGGTTTCG CCGAATTTCC TCTACCGTTC CGAGCTGGGA GAGGTGCAGG CGGACGGGAC GTACAAGCTG ACGCCGTACG AAACGGCGAG CGCCCTTTCC TATCTGTTCC TGGGGTCGCT GCCCGACGGC GAACTGGTCA GTGCCGCGGA CCAGAACCTG CTGGACACGC CCGACCAGCG CATCGCCCAC GCCTCGCGCC TGTTGAGCCT GCCGCGCAGC CGCAACCGGG TGGGCCATTT CGTCGGCCAG TGGCTGCTGG GTACCAGCCC GTACACGCTG CCGGAGAAGG ACCAGGCGGT GTATCCGCGG TATGACGCCG CGGTCAGGTC CGCAATGTCC GAAGAACTGA TCGGCTTCTT CGATCACGTC GCCTTCGAGT CCACCCAGAG CTTTCCGGAG CTGTTCACCG CGAACTACGT CGTCGTCGAC GACACCCTGG CGGACTACTA CGGGCTCGGC CGTCCGGGGG GCAGCGGATT CGCGCCGGTG ACGGTGAGCG ACGGAACCCG CACCGGCATC CTCACGCTCG GTGCGGTGCT GTCGCGCTAT GCCAACAGCA ACGAGTCGCA TCCGTTCAAA CGCGGTGGTT TCCTGTACAA GCGCCTGCTT TGCCGCGATT TGCCGCTGCC GGCCAACGCG GGCTTCATCC AGGCGCCGCA GCCGGATCCG AATGCGACGA CGCGGCAGCG CTTCGAGTTC CACAGCAAGT CCAATACCAG CTGCTACGGC TGCCACCAGT ACCTGGACGG ACCGGGCTTC GGCTTCGAGA ACTACGACGG CGCCGGCATC TTCAGGGCAT CCGAAAACGG ACAGGCCATC GATGCCAGCG GCGTCCTGCG CGGCCTGGAG ACGTTCACGC CCACCGAGGA GCTGAGCTTC ACCGATCTCC CCGACCTCAG CCGGAAAATC GCGGCCAGCC CGACCGCGGC GCAGTGCGCG GCGCGCCAGT ACTACCGATT TGCCACCGGC AGACGGGAGG CATCGTCCGA CAGCTGCGCC CTCGACAGTT TCCTGCAGAC CTATTCGGCC AACGGCCATA ACCTGCAGAC CATGCTGCTC GGCATCGTCA ACGCACCCGG CTTCACCGTG CGCCGTGCCG ATCAATAA
|
Protein sequence | MKIRNDGPAL AGWNVAWEMP DGQEITNSWN GRFVQTGAHV DVGNLDWNKD IPAGSNIEFG FNGSHTGRNR TPASLSVNGV KCTMAVAAPP PSPRPAACEA TYQISDQWNT GFTANVRIRN DGDPLAGWKV VWDMPDGQRV TNSWNGRFTQ SGSHVEVGHL DWNRDIATGS NIEFGFNGSH AGANRAPSSL SLNGVACALV TVAPSPTPAP TATPSPIPTP TPSPAPTASP SPAPTPTPAP TSSPSPTPAP TGPCQADYQV TAQWNDGFTA NVKVRNNGTA LDGWTVAWTM PSGQTVTGMW NGQYVQTGPQ VAVTPADWNR QIASGASIEF GFNGSHSGSN AVPGVLTLNG AACATTTGGG TPTPTPVPVA PGAPAGLSAT VADNAVVNLV WTAADAFAQG FRIERRTAAG VDWALVAETA AGVPSFGDGT VAMGNDYEYR VYAFNAVGNS PPATASASLP TLLKYGESQY QKQGCASCHG KDGKGGFTNK PLVHFTADQL ATLTEIIRVR MPPSKPANCV SNCAAGTAKY IIEVLAAAAS GGGGGGGNAC AGSPPPGGRA LRLLTRQEYQ NTVNDLLGLS ENLVHLLPEE NRVDGFDNNV ATNLVTSIRL EAFLTQAEAL AAKAVQQNWN ALLPCTQQDA ACARRFVEAF GKRAYRRPLT PEEADAYAAL FGQGSFREGV EATITHMLVS PNFLYRSELG EVQADGTYKL TPYETASALS YLFLGSLPDG ELVSAADQNL LDTPDQRIAH ASRLLSLPRS RNRVGHFVGQ WLLGTSPYTL PEKDQAVYPR YDAAVRSAMS EELIGFFDHV AFESTQSFPE LFTANYVVVD DTLADYYGLG RPGGSGFAPV TVSDGTRTGI LTLGAVLSRY ANSNESHPFK RGGFLYKRLL CRDLPLPANA GFIQAPQPDP NATTRQRFEF HSKSNTSCYG CHQYLDGPGF GFENYDGAGI FRASENGQAI DASGVLRGLE TFTPTEELSF TDLPDLSRKI AASPTAAQCA ARQYYRFATG RREASSDSCA LDSFLQTYSA NGHNLQTMLL GIVNAPGFTV RRADQ
|
| |