Gene Mnod_1265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1265 
Symbol 
ID7308704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp1342851 
End bp1344191 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content67% 
IMG OID643599007 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002496569 
Protein GI220921268 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTGGA CCACCATCGT TCGCAAGGGA CTCGCGGCCT GCGGGATCGC CTCCCTGGCG 
GGGATCGCCG CGGCAAGCCC TGCCTCCGCG GTCACCGAGC TGCAATGGTG GCACGCCATG
GTGGGCGCCA ACAACGACAC CATCATACGG CTCGCCGAGG AGTTCAACGC CGCGCAGAGC
GAGTACCGGG TGGTGCCGGC CTACAAGGGC ACCTATCCGG AGACGCTGAA TGCGGGCATC
GCGGCCTTCC GGGCCGGCAC GGCGCCCCAC ATCATCCAGG TCTTCGAGGT CGGCACCGCC
ACCATGATGG CGGCCAAGGG CGCGGTGAAG CCGGTCTACC AGCTGATGAA GGAGGCGGGC
GAGCCCTTCG ACCCGAACGC CTACCTGCCG GCGGTCACCG GCTACTACTC GACCGCCAAG
GGGGAGATGC TCTCCTTCCC GTTCAACTCC TCGTCCATGG TGATGTGGGT CAACCGCGAC
GCCCTGCGCA AGGCCGGGCT CGACCCCAAC GCGCCGCCGA AGACCTGGCC GGCCGTGTTC
GAGGCCGCCA AGGCCCTCAA GGCCGCCGGC TACTCGACCT GCGGCGTCTC GAACACCTGG
GTGACCTGGG CGCATCTGGA GCAGTTCTCG GCCTGGCACA ACGTGCCGCT CGCCACCAAG
GCGAACGGTC TCGACGGTTT CGACACCAGC CTCGAGATCA ACAATCCGCT GCAGGTCCGG
CATCTGGCGA CGCTGGCGGA GATGCAGAAG GAGAAGCTCT ACGATTATTC CGGCCGCTAC
GACAACGGCT TCGGGCGCTT CACTTCGGGC GAGTGCCCGC TCTTCCTCGG CTCGTCGGGC
TCCTACGGCA ACGTGCGCGG CAACGCCAAG TTCGACTGGG CGGCAGCAGC GATGCCCTAC
TATCCGGACG TGCCGGGCGC GCCCCAGAAC AGCATCATCG GGGGCGCCTC GCTCTGGGTG
ATGGGCGGCA AGTCGGCCGA GGAGTACAAG GGCGTCGCCA AGTTCTTCGC CTTCCTGTCG
GACACCGACC GCCAGGCGCG GATCCACCAG ACCACCGGCT ATCTGCCGAT CACCAAGGCG
GCCTACGAGA AGTCGAAGGC GGACGGCTTC TACGACAAGA ACCCGGCGCT CGAAGTGCCG
ATCAGGGAAC TCACCAACAA GGCGCCCACC GAGAATTCCC GGGGCTTGAG GCTCGGCAAC
ATGCCGCAGA TGCGCGACGT CTGGGCCGAG GAGATCGAGG CGGCGCTCGC CGGCAAGAAG
CCCGCCAAGC AGGCCCTCGA CGAGGCCGCC GCCCGCGGCA ACGCGATGCT GCGCCAGTTC
GAGAAGCAGG CGAACCGCTA G
 
Protein sequence
MDWTTIVRKG LAACGIASLA GIAAASPASA VTELQWWHAM VGANNDTIIR LAEEFNAAQS 
EYRVVPAYKG TYPETLNAGI AAFRAGTAPH IIQVFEVGTA TMMAAKGAVK PVYQLMKEAG
EPFDPNAYLP AVTGYYSTAK GEMLSFPFNS SSMVMWVNRD ALRKAGLDPN APPKTWPAVF
EAAKALKAAG YSTCGVSNTW VTWAHLEQFS AWHNVPLATK ANGLDGFDTS LEINNPLQVR
HLATLAEMQK EKLYDYSGRY DNGFGRFTSG ECPLFLGSSG SYGNVRGNAK FDWAAAAMPY
YPDVPGAPQN SIIGGASLWV MGGKSAEEYK GVAKFFAFLS DTDRQARIHQ TTGYLPITKA
AYEKSKADGF YDKNPALEVP IRELTNKAPT ENSRGLRLGN MPQMRDVWAE EIEAALAGKK
PAKQALDEAA ARGNAMLRQF EKQANR