Gene MCA1493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1493 
Symbol 
ID3102989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1588125 
End bp1591292 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content67% 
IMG OID637170668 
Productcellulose-binding domain-containing protein 
Protein accessionYP_113950 
Protein GI53804422 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.498527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGATCC GGAACGACGG CCCTGCGCTG GCGGGCTGGA ACGTCGCCTG GGAGATGCCC 
GACGGCCAGG AGATCACGAA TTCCTGGAAC GGCCGCTTCG TCCAGACGGG CGCACACGTC
GACGTCGGCA ACCTGGATTG GAACAAGGAC ATTCCCGCGG GATCGAACAT CGAGTTCGGT
TTCAACGGCA GCCACACCGG CAGGAACCGT ACGCCCGCGA GCCTGAGCGT CAACGGCGTG
AAATGCACGA TGGCCGTGGC CGCGCCCCCG CCCTCGCCCA GACCGGCGGC CTGCGAAGCC
ACCTATCAGA TTTCCGATCA GTGGAACACC GGCTTCACGG CCAACGTCCG GATCAGAAAC
GATGGGGACC CGCTGGCGGG GTGGAAGGTC GTCTGGGATA TGCCCGATGG CCAGCGGGTC
ACGAATTCTT GGAACGGTCG GTTCACCCAG AGCGGGTCCC ATGTGGAAGT CGGCCATCTC
GACTGGAACA GGGACATTGC CACGGGATCG AACATCGAGT TCGGCTTCAA CGGCAGCCAC
GCCGGTGCCA ACCGCGCACC TTCGAGCCTG AGCCTCAATG GTGTCGCCTG TGCGCTGGTC
ACGGTTGCGC CGTCCCCGAC ACCCGCGCCG ACCGCTACGC CGAGCCCCAT CCCGACGCCG
ACGCCCAGCC CGGCACCGAC CGCGAGCCCG AGTCCGGCGC CCACGCCGAC ACCGGCGCCG
ACTTCCAGCC CCAGCCCTAC GCCTGCGCCG ACCGGACCGT GTCAGGCCGA CTACCAGGTG
ACGGCGCAAT GGAATGATGG ATTCACTGCC AACGTGAAAG TCAGAAACAA CGGCACCGCC
CTGGACGGCT GGACGGTCGC CTGGACCATG CCCAGCGGCC AAACCGTGAC GGGAATGTGG
AACGGCCAGT ACGTCCAGAC CGGCCCCCAG GTCGCGGTGA CGCCCGCCGA CTGGAATCGC
CAGATCGCAA GCGGCGCTTC CATCGAGTTC GGTTTCAACG GCAGCCATTC GGGCAGCAAC
GCCGTACCCG GCGTGCTGAC GCTGAATGGT GCCGCTTGCG CCACCACCAC GGGCGGCGGC
ACCCCCACGC CGACCCCGGT GCCGGTGGCG CCGGGGGCAC CGGCCGGTCT GTCCGCCACC
GTGGCCGACA ACGCGGTCGT CAACCTGGTT TGGACGGCGG CCGACGCATT CGCCCAGGGT
TTCCGGATCG AGCGCCGCAC CGCCGCCGGT GTCGACTGGG CTCTCGTGGC CGAAACGGCC
GCCGGTGTTC CGTCCTTCGG CGACGGCACG GTGGCCATGG GGAACGACTA CGAGTACCGG
GTCTATGCCT TCAACGCGGT GGGAAACAGC CCGCCCGCGA CGGCATCCGC TTCCTTGCCG
ACCTTGCTCA AGTACGGCGA GTCGCAGTAC CAGAAACAGG GCTGTGCCAG CTGCCACGGG
AAGGACGGCA AGGGAGGCTT CACCAACAAG CCGCTGGTCC ATTTCACGGC CGACCAGCTC
GCGACGCTCA CCGAGATCAT CCGCGTCCGC ATGCCCCCGT CCAAGCCTGC GAACTGCGTC
AGCAACTGCG CGGCGGGTAC CGCCAAGTAC ATCATCGAGG TGCTTGCCGC CGCCGCTTCG
GGAGGCGGCG GAGGCGGCGG CAACGCCTGC GCCGGCAGCC CGCCGCCCGG TGGACGGGCA
TTGCGCCTGC TCACCCGCCA GGAATACCAG AACACGGTCA ACGACCTGCT CGGCCTGTCC
GAAAACCTGG TGCATCTCCT GCCGGAGGAA AACCGCGTGG ATGGCTTCGA CAACAATGTC
GCAACCAACC TGGTGACGAG CATTCGCCTC GAGGCTTTCC TGACTCAGGC CGAAGCCCTC
GCCGCAAAGG CGGTGCAGCA AAACTGGAAC GCGCTGCTGC CGTGCACACA GCAGGATGCG
GCCTGTGCCC GTCGGTTCGT CGAAGCTTTC GGCAAGCGCG CCTACCGGCG GCCATTGACC
CCGGAAGAGG CCGACGCCTA CGCTGCGCTG TTCGGGCAGG GCTCATTCCG GGAAGGCGTG
GAGGCGACCA TCACCCATAT GCTGGTTTCG CCGAATTTCC TCTACCGTTC CGAGCTGGGA
GAGGTGCAGG CGGACGGGAC GTACAAGCTG ACGCCGTACG AAACGGCGAG CGCCCTTTCC
TATCTGTTCC TGGGGTCGCT GCCCGACGGC GAACTGGTCA GTGCCGCGGA CCAGAACCTG
CTGGACACGC CCGACCAGCG CATCGCCCAC GCCTCGCGCC TGTTGAGCCT GCCGCGCAGC
CGCAACCGGG TGGGCCATTT CGTCGGCCAG TGGCTGCTGG GTACCAGCCC GTACACGCTG
CCGGAGAAGG ACCAGGCGGT GTATCCGCGG TATGACGCCG CGGTCAGGTC CGCAATGTCC
GAAGAACTGA TCGGCTTCTT CGATCACGTC GCCTTCGAGT CCACCCAGAG CTTTCCGGAG
CTGTTCACCG CGAACTACGT CGTCGTCGAC GACACCCTGG CGGACTACTA CGGGCTCGGC
CGTCCGGGGG GCAGCGGATT CGCGCCGGTG ACGGTGAGCG ACGGAACCCG CACCGGCATC
CTCACGCTCG GTGCGGTGCT GTCGCGCTAT GCCAACAGCA ACGAGTCGCA TCCGTTCAAA
CGCGGTGGTT TCCTGTACAA GCGCCTGCTT TGCCGCGATT TGCCGCTGCC GGCCAACGCG
GGCTTCATCC AGGCGCCGCA GCCGGATCCG AATGCGACGA CGCGGCAGCG CTTCGAGTTC
CACAGCAAGT CCAATACCAG CTGCTACGGC TGCCACCAGT ACCTGGACGG ACCGGGCTTC
GGCTTCGAGA ACTACGACGG CGCCGGCATC TTCAGGGCAT CCGAAAACGG ACAGGCCATC
GATGCCAGCG GCGTCCTGCG CGGCCTGGAG ACGTTCACGC CCACCGAGGA GCTGAGCTTC
ACCGATCTCC CCGACCTCAG CCGGAAAATC GCGGCCAGCC CGACCGCGGC GCAGTGCGCG
GCGCGCCAGT ACTACCGATT TGCCACCGGC AGACGGGAGG CATCGTCCGA CAGCTGCGCC
CTCGACAGTT TCCTGCAGAC CTATTCGGCC AACGGCCATA ACCTGCAGAC CATGCTGCTC
GGCATCGTCA ACGCACCCGG CTTCACCGTG CGCCGTGCCG ATCAATAA
 
Protein sequence
MKIRNDGPAL AGWNVAWEMP DGQEITNSWN GRFVQTGAHV DVGNLDWNKD IPAGSNIEFG 
FNGSHTGRNR TPASLSVNGV KCTMAVAAPP PSPRPAACEA TYQISDQWNT GFTANVRIRN
DGDPLAGWKV VWDMPDGQRV TNSWNGRFTQ SGSHVEVGHL DWNRDIATGS NIEFGFNGSH
AGANRAPSSL SLNGVACALV TVAPSPTPAP TATPSPIPTP TPSPAPTASP SPAPTPTPAP
TSSPSPTPAP TGPCQADYQV TAQWNDGFTA NVKVRNNGTA LDGWTVAWTM PSGQTVTGMW
NGQYVQTGPQ VAVTPADWNR QIASGASIEF GFNGSHSGSN AVPGVLTLNG AACATTTGGG
TPTPTPVPVA PGAPAGLSAT VADNAVVNLV WTAADAFAQG FRIERRTAAG VDWALVAETA
AGVPSFGDGT VAMGNDYEYR VYAFNAVGNS PPATASASLP TLLKYGESQY QKQGCASCHG
KDGKGGFTNK PLVHFTADQL ATLTEIIRVR MPPSKPANCV SNCAAGTAKY IIEVLAAAAS
GGGGGGGNAC AGSPPPGGRA LRLLTRQEYQ NTVNDLLGLS ENLVHLLPEE NRVDGFDNNV
ATNLVTSIRL EAFLTQAEAL AAKAVQQNWN ALLPCTQQDA ACARRFVEAF GKRAYRRPLT
PEEADAYAAL FGQGSFREGV EATITHMLVS PNFLYRSELG EVQADGTYKL TPYETASALS
YLFLGSLPDG ELVSAADQNL LDTPDQRIAH ASRLLSLPRS RNRVGHFVGQ WLLGTSPYTL
PEKDQAVYPR YDAAVRSAMS EELIGFFDHV AFESTQSFPE LFTANYVVVD DTLADYYGLG
RPGGSGFAPV TVSDGTRTGI LTLGAVLSRY ANSNESHPFK RGGFLYKRLL CRDLPLPANA
GFIQAPQPDP NATTRQRFEF HSKSNTSCYG CHQYLDGPGF GFENYDGAGI FRASENGQAI
DASGVLRGLE TFTPTEELSF TDLPDLSRKI AASPTAAQCA ARQYYRFATG RREASSDSCA
LDSFLQTYSA NGHNLQTMLL GIVNAPGFTV RRADQ