Gene Mnod_3023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_3023 
Symbol 
ID7304229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp3109818 
End bp3113180 
Gene Length3363 bp 
Protein Length1120 aa 
Translation table11 
GC content60% 
IMG OID643600724 
Productpolysaccharide deacetylase 
Protein accessionYP_002498269 
Protein GI220922967 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCCGA ACGCACGTCC AGTCTTCTAC GGTGCAAGCG GCCGACGCGC TCGCTTCACG 
AATGCGGCGC TGCTGCTTAT CTCGTTCCTC GCGGTGTTCG GACTCGCCAG CCTGATCTAC
GGAATGCTGG TTGCTCCCAA TCTGCCCGCT TCTGAGAAAT CAAGGCCCAA GGCTATAAAT
TCAGACACCG GCATCCGAGC CGACCGGGTT ACGGCGACAG AGCCTGTCAA TCCGACCTCG
AACCGGCAGG TTCCGCCTAC CGCGGCGCAA GCCTTGCGCT TCGCGTACCT CGCGAGCAAC
GACTATGCGC TCGCCTCCCT CAAACAGCAT GCCGCCGAGC TCGACGGGAT CATTCCGGAC
TGGCTTGAGC TGAGGCAGCA GGAGGGACGG ATCCACATCC AGGTCGATGA TAACAGCGTC
GGCATTTCCC GGTGGCTGAA AGCGAACGCA CCTCGGCTGC AAATTTATCC TCTGATCAAC
AGTGGCCTGA CCACTCATCA AACGAACCTT GTATTTGCGA TGAATTCTGC AAGAGAAAAG
AGCATCGCTG AGATAATTTC TTATTTGGAG CGGAACGAAT TGCCCGGGAT TACCTTACAG
GTCCCAAATG CCACCCCCTC TAACGAGCGA ATCATCATCA ATTTCGTCCG TGAGCTGAGG
CGCAGGCTCG GCGAGCAGCA ACGCAAGCTG ATCGTGATGA CGAGCCTCAC CGACGGTGCC
CCCCGCATCA GTGAACTCAG CAAAGCGTCG GACTATGTCC TCGTGACGAC CCATGACAAT
ACCCAGTACG GTCGGCCAAC CCCGATCGTC CCCCAAGCAT GGATCGAGTC GCAGCTAGCG
GCTCTCTTCG CCCGGGCCGA TGCAAGCAAG ATCATCGTCA GTATCGGTTC GTTTGGAATC
GATTGGGATC GATTTGGTCG GATGAAGCAG ATCTCCGTAC CCGCGGCCTG GACGCTTATG
CAGAATGCTG GCACGCCGCT CAAGTTCGAT CAACGCTCGC TCAACGCAAC GTTGCGGTAT
CGCGATGCAA ATGGGCAGCC GCACGAGGTC TGGCTGCTCG ACGGCGTCAC CTGCTTCAAT
CAGCTCCGAG CGGTTTTGGC CTACAAGCCG GCCGGAATAG CCCTCTTCGG CCTCGGCTAC
GAGGATGCGG GCATCTGGTC TATGTGGGCG CCGACCAAGC TTCCCGACGC TCACGCGCTG
AAGTCGCTCG CAACCTTGCA GCCGGGGGGC GACTTTTTCG CGAGCCTCAA GGCGGCACTC
GTTTCGGCGA CCCCGGGAGG ACCGGGCAAG CGAATCCTCG CCTACAATGA CAGGCTTGGC
CTCATCACCG GACAGTCCGT CACGGTGGCA CCGGTTCAGG CCCAAGCAAC CACGTGGTTT
CCGGTTGCAA AAAACCTTGT CGCGCTGACG TTCGACGATG GACCGGATCC CAATTACACC
GCGAAGATCT TGGATATCTT ACGCGAAAAG GGAGCGAAGG CGACGTTCTA TATCATCGGC
CGCAATGCCG TGCAGTCGCC CGGGCTGCTC AAGAGGATCT ACGATGAGGG GCACGACATC
GGGAATCACA CCTTCTCGCA CGCGCGCCTC ATGGAGAGCA GCGGGCGGGA ACGGATCGCC
GTCGAGCTGA ACATGGCTCA GCGGATCATC GAGGCGCAAA CGGGGGTCCG AACCACCCTG
TTTAGGCCAC CGGAAGCTTT CAGGAGCCTC TCCTTCCTCG ACTTCTCGCC GCAGCTTGTC
GAGGTCGCCA CCGAGCTCGG GTACCAGATT GGAGCGCTCG ACACGGATTC CTTCGACTGG
GCTGCAGCCG CCTTCGGCGT CAAGAAGGCC AACGTCGTCG ACCGGGTTGT GAGCAAGGTT
GCGAACGGGC AGGGTCAGAT CGTGCTGATG CACGATGCTG GTGGAAACCG GCAACTGACC
ATCGACGCCC TACCGGACAT CATTGATCAG CTTCAGGCCA GGGGCTTTCA GTTCGTCACG
ACACACGAAC TCGTCGGCAG GGCGCGCGAC GCGGTCATGC CGCCGACGCG GGCGCCGAGC
CTCATCGAGG CCTTCAACAC GGAATTCTGG CGCATCGGAT CCCAGACGGT GTCCTGGCTC
TGCGATGCCA TTCCGGCCGT CGCCATCACG ACCACCGTGC TCGCGATCTT TCGCCTGACG
CTGATCATCA TCGGCGCGAC CGCGCATGGC CTGAGAGGCA GCCGTCGAGA CCCACCGGAG
GGATGGCGAC CGCGGGGAAT TGCTGTCCTG GTGCCGGCAT ACAACGAGGA GATTGTGATC
CTCAAGACGA TCCAGACTCT GCTTGCGTCG ACCATAGCAG AGCAGATCGA GATCATCGTT
ATCGACGATG GTTCGACGGA CAACACGGCT GCGGTTGTTC GAACAGCTTT CCCGAATACA
GCTGCGGTGC AGATCTACAC GAAGGCGAAT GGCGGCAAGG CTGCTGCTCT CAATTACGGT
CTTCAGAAAA CATCGACTGA GATTATCGTC GCCATCGACG GCGATACAGT GCTCCTGCCC
GATGCAATTG AGCATCTGGC GCGCCATTTC GCTGATCCGA AAATCGGCGC CGTCGCCGGA
ACCGTCTCTG TCGGCAATCG GAAGACGCTC ATCGCCCGTT TCCAGGCCCT CGAATACACG
ATGAGCCAGA ACCTGGACCG CAGGGCCTTC CAGCTCATCA ACGCCATCGG CGTCGTTCCC
GGCGCGATCG GCGCGTGGCG GCGCGAGGCT CTGATGGCTG TCGGAGGCTA CTCGTCAGAC
ACGCTTGCGG AAGATGCCGA TTTGACGATC TCGCTCGAAC TCGCGGGCTG GAAAGTCGTG
TGCGAGCCGC GCGCCCGCGC ACTCACGGAG GCGCCCGAGC GCCTCCGCGC CTTCCTCAAG
CAGCGCTTTC GCTGGATGTT CGGCACGCTG CAGGTCGCTT ACAAGCACGC CCCGGCGAGC
TTAAGGAGAC CGCGGGGTAT TTCTCTAATC CTGATCCCTA ACGTTCTGCT GTTTCAGTTC
CTATTTACGT TGCTGGCGCC CCTGATGGAT TTGATTCTTA TCTTCTCGGT CGTCTCTAGC
GTTGTCGATA TTACACTCAT CGGCTCGAGA AGTGAGGGCT ACGGGACGCT TGAACTGCTG
ATGGCCTATT GGCTCGTCTT CCAAGTCTTC GATTTTCTCG CAGGCTGTGC GGCGTTGCTT
CTGCATGGCA AGTCGCCGGA GTGGCGCCTG CTGCCGCTGC TTGTTCTTCA GCGGTTCTGC
TACCGCCAGC TGCTCTACAT CACGGCCATT CGAACATTGC TGACCGCACT CAGAGGAACT
TTCGTGGGCT GGGGCAAACT CGTCCGTACC GGAAGTGTCG ATCTACCGGT CGCGTCGGCT
TGA
 
Protein sequence
MRPNARPVFY GASGRRARFT NAALLLISFL AVFGLASLIY GMLVAPNLPA SEKSRPKAIN 
SDTGIRADRV TATEPVNPTS NRQVPPTAAQ ALRFAYLASN DYALASLKQH AAELDGIIPD
WLELRQQEGR IHIQVDDNSV GISRWLKANA PRLQIYPLIN SGLTTHQTNL VFAMNSAREK
SIAEIISYLE RNELPGITLQ VPNATPSNER IIINFVRELR RRLGEQQRKL IVMTSLTDGA
PRISELSKAS DYVLVTTHDN TQYGRPTPIV PQAWIESQLA ALFARADASK IIVSIGSFGI
DWDRFGRMKQ ISVPAAWTLM QNAGTPLKFD QRSLNATLRY RDANGQPHEV WLLDGVTCFN
QLRAVLAYKP AGIALFGLGY EDAGIWSMWA PTKLPDAHAL KSLATLQPGG DFFASLKAAL
VSATPGGPGK RILAYNDRLG LITGQSVTVA PVQAQATTWF PVAKNLVALT FDDGPDPNYT
AKILDILREK GAKATFYIIG RNAVQSPGLL KRIYDEGHDI GNHTFSHARL MESSGRERIA
VELNMAQRII EAQTGVRTTL FRPPEAFRSL SFLDFSPQLV EVATELGYQI GALDTDSFDW
AAAAFGVKKA NVVDRVVSKV ANGQGQIVLM HDAGGNRQLT IDALPDIIDQ LQARGFQFVT
THELVGRARD AVMPPTRAPS LIEAFNTEFW RIGSQTVSWL CDAIPAVAIT TTVLAIFRLT
LIIIGATAHG LRGSRRDPPE GWRPRGIAVL VPAYNEEIVI LKTIQTLLAS TIAEQIEIIV
IDDGSTDNTA AVVRTAFPNT AAVQIYTKAN GGKAAALNYG LQKTSTEIIV AIDGDTVLLP
DAIEHLARHF ADPKIGAVAG TVSVGNRKTL IARFQALEYT MSQNLDRRAF QLINAIGVVP
GAIGAWRREA LMAVGGYSSD TLAEDADLTI SLELAGWKVV CEPRARALTE APERLRAFLK
QRFRWMFGTL QVAYKHAPAS LRRPRGISLI LIPNVLLFQF LFTLLAPLMD LILIFSVVSS
VVDITLIGSR SEGYGTLELL MAYWLVFQVF DFLAGCAALL LHGKSPEWRL LPLLVLQRFC
YRQLLYITAI RTLLTALRGT FVGWGKLVRT GSVDLPVASA