Gene M446_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3420 
Symbol 
ID6130578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3794059 
End bp3795966 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content71% 
IMG OID641643591 
Productcellulase 
Protein accessionYP_001770243 
Protein GI170741588 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.342079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTCA AGAGCAGCAA CCCAGCCACG CAGATCGCGG CCCCCCCGGC GCCATGGCTG 
GACGCATCGA CGGACAGCGG TGCGCTCGGC GACAACCTGA CCAGCTTCCG GAGCCTGAAG
CTGGACGGAA CGGGCGCGCC CGGTACGGCG ATCCTGGTGA GCTACACCGG AACGACGGCG
GCCGGTAAGC GCGTCTCCGG CACCATGCCG GCCGTGACCG TGGACGCCTC GGGGGCCTGG
AGCGTCGCCA CGACCAGCCT GTCCGACGGC GTGTACGCCT TCACGGCCGT CGCGCAGAGC
GGGCCCAGGA GCACGAGCGC GCCCTCGCCC GCCCTCACCG TCACCATCGA CACGACGCCC
CCGCCGGCGC CGATCCTCAG GGATTTCCCG GCCGCCTCCA CGAACAACGC GACGCCGACC
CTGGCCGGGA CGGCCGAGAA GGGCGCCAAG GTCGCCATCT ACATGGATGG TGCCGCGACC
GCCCAGGCCA TCGTGACGGC CGACGCGACG GGTGCGTGGT CCTACACGGA GGCGAGCAGG
CTGATCGACG GCACCCACAG CTTCGTGGCC ACGGCCACGG ACGCGGCCGG GAACACCTCG
GTGCGCTCCT CCGCGAAGTC CGTCATCATC GACACCGTCG CGCCCACCGA GACCATCACC
CAGCTGCTCG TCGCGGGCGA CAACGTGATC GACGCCGGCG AGCAGGCGGC CGGCACCGTC
ACGGTCTCGG GCACGCTCTC GGCCGCGCTC GCCCCGGCGG AGAGCCTGCT GCTGACCGTG
GCCGGGGCGA CCTACACGGT GCCGCAGGCC AGCCTCTCGG GCACCAGCTT CTCGCTGCAG
GTGGCCAAGC CGGCCGCCGG CTGGGCGAGC GGATCCGCCT CGGCGCGGGT GCAGGACGCC
GCCGGCAATG CGGGCCAGAC CACCACGCAG AGCTTCACCC TCGGCGGCGC GCCGGTTCCC
CCGGCGGGCC AGCTCTCCCT GCTCGGCATC AACCTCGCGG GCGGCGAGTT CGGCAGCGCG
GTGCCCGGCC GGTACGGGAC CGACTACATC TACCCCAACC ACGCGGAGAT CGACTACTAC
GCCGGCAAGG GCCTGAACGT CATCCGCCTG CCCTTCCTGT GGGAGCGCCT GCAGCCCGTC
CAGGGCGGCG CCCTGAGCAG CAGCGATCTC GCCTACATCG ACGACGTCGT GAGCTACGCG
AACGCCAAGG GCATGAAGGT CGTTCTCGAC ATGCACAATT ACGGCTCCGG TTACGGCTCC
CCCGTCGGCA GCGCGGCCAC CCCGGTCGGC GCCTTCGCGG ATTTCTGGGG CAGGATGGCG
GGGCACTTCG CGTCCAACCC GAACGTGCTG TTCGGCCTGA TGAACGAGCC GCAGCAATCC
ACCGCCACCG AGTGGCTCGG CGACGTCAAC GCGGCGATCC AGGCCATCCG CGGGGCCGGC
GCCACCGCGC AGGAGATCCT GGTCCCGGGG ACCTACTGGG ACGGGGCCTG GACCTGGACG
ACCTCGGACA ACGCGGCCGT GCTGGGCGGC GGCGTGGTCG ACCCGTCGAA CAACTACGCC
TTCGAGGTGC ATCAGTACCT CGACGGCGAC GGGTCCGGCA CGCACGCGGG CGTCGCCTCG
ACGAGCATCG GCGTCGAGCG CCTCCAGGCG GCCACGCAAT GGGCCGAGAG CACCCACAAC
CGGCTGTTCC TCGGCGAGTT CGGGGTGGCC CAGGATCAGA CGAGCCTGGC GGCGATGGAC
GCCATGCTGG GCTACATGAG CCAGCACACC GGCGCCTGGC AGGGCGCCAC CTACTGGGCG
GGCGGCCCGT GGTGGGGCGA CTACATGTTC TCGGCCGAGC CGACCGGGCT CGGGACCTCC
AGCGTCACCG ACAAGCCCCA GATGACGGTG CTCGACAAGT ACATCTGA
 
Protein sequence
MALKSSNPAT QIAAPPAPWL DASTDSGALG DNLTSFRSLK LDGTGAPGTA ILVSYTGTTA 
AGKRVSGTMP AVTVDASGAW SVATTSLSDG VYAFTAVAQS GPRSTSAPSP ALTVTIDTTP
PPAPILRDFP AASTNNATPT LAGTAEKGAK VAIYMDGAAT AQAIVTADAT GAWSYTEASR
LIDGTHSFVA TATDAAGNTS VRSSAKSVII DTVAPTETIT QLLVAGDNVI DAGEQAAGTV
TVSGTLSAAL APAESLLLTV AGATYTVPQA SLSGTSFSLQ VAKPAAGWAS GSASARVQDA
AGNAGQTTTQ SFTLGGAPVP PAGQLSLLGI NLAGGEFGSA VPGRYGTDYI YPNHAEIDYY
AGKGLNVIRL PFLWERLQPV QGGALSSSDL AYIDDVVSYA NAKGMKVVLD MHNYGSGYGS
PVGSAATPVG AFADFWGRMA GHFASNPNVL FGLMNEPQQS TATEWLGDVN AAIQAIRGAG
ATAQEILVPG TYWDGAWTWT TSDNAAVLGG GVVDPSNNYA FEVHQYLDGD GSGTHAGVAS
TSIGVERLQA ATQWAESTHN RLFLGEFGVA QDQTSLAAMD AMLGYMSQHT GAWQGATYWA
GGPWWGDYMF SAEPTGLGTS SVTDKPQMTV LDKYI