Gene Mext_2218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2218 
Symbol 
ID5832755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2461258 
End bp2464365 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content71% 
IMG OID641368017 
Productglycosyl transferase family protein 
Protein accessionYP_001639684 
Protein GI163851641 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.966726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGGCC CGGTATTGGT GTTGTCGAGG AATGTGGCCG TGATCAAGCT GCCGAAGCTG 
TTTCGGCCCA GGCGCCGCGA TGCCGCGATG GCCGCCCCCT TCTTCGATCC CGCCTTCTAC
CGCTCGGCCT ATCCCGACGT GGCTGCGGGC GGCGGCGACC CGCTTCGGCA CTACCTCGAC
CATGGCTGGC TGGAGGGCCG CGACCCGTCC GCGGTGTTCT CGACGCTGTT CTACGCCGAC
CGCCACTTCG CCGACCGAGC GCCGATCAAC CCGCTTCTCC ACTATGCGGG ACTCTCGCCG
ACCGAGCGGC TGCGGGTCGC GACGCAGCCG GGTTCGGATT TTCCGGCGCT CCAGGCCGGC
GTGGTCGAGC CCTATTTCGA TGCGCCGTAT TATGCCCGCC TCGCCGGTCT CGAGGGAGGC
GAGGACCCGC TCGCCCACTA CCTGACCAAG GGCTGGCGGC GCGGGCTCGA CCCGAACGAT
GCCTTCTCGG GCGACGCCTA CCTCCAGCGC CACGCGCATA TGCGGCGCCT CGGCGTCAGC
CCGCTCTATC ATTTTGCCGC GACGCGACGG CTCCAGGCCT CGGAGGCCGC CGCCCCGCCC
GCGCGGCGTG CCAAGCCGCA GGCAGCACTG CTTCCCGCCG ACCCGGCCGA AGCCTCCGAC
GCGCAGATCT ACGAGACGGT GGCGGCGGCC TTCGACCGGG AGCACTACCT CGACACCAAT
CCCGACATCC GCCGCTCCGG CATCGATCCG GTCCGCCATT TCCTCGATTT CGGCGCCCGC
GAGGACCGCG ACCCGAGCCC CGACTTCTCG GTGCGCTTCT ACCGCAAGCA TTACCGGCGG
GAGATCGGGG CCGGGGTGAA CCCGTTCTTC CACTACCTCG CGGTGGGCCG GGCGCGCGGC
TTCCGGCCGA CACCGTTCGG TCTCGGAACC TGGCCCCCGC AGGTCGCGCC GACCCCGGCC
GAGTGGGAAC AGGCGCAGCC TGCCGCCGAC ATCGCAGCGG CGCGGGTCGT GCTGATCATG
CCCGTCTACA AGGGCTACGA CGACACCCTG CGGGCGATCC ACTCGGTGCT GAGCGAGGAG
CAGGCCACGC CCTTCGCCCT CCTCGTCATC GACGATTGCA GCCCCGATCC GCGCCTGAGC
GCGGCACTGG CCGAGCTCGG CGGCCGCGGC CTCTTCGCCC ACCTCGTCAA CGAGGCGAAT
CTCGGCTTCG TACGGACCTG CAACCGCGGC CTGGGGCGGG CCGCGGGCAA GGACGTCGTG
CTCCTCAATG CCGACGTCGT CGTTTACGGC GACTGGCTCG ACCGCCTGCT CTGGCACCTC
GACGCCGACC CCCGCGTCGC GACGGTCACG CCGTTTTCAA ACAACGCCAC GATCTGCAGT
TATCCGCGGC CCAACGTCGA CAATCAGGCC CGGCTCGAGA TCACGCCCGC CGAGATCGAC
GCCTTCACCC GCGCCTGCAA CGCCCGCACG AGTTCGCCGG TGCCGACCGG CGTCGGCTTC
TGCATGGCGA TGCGCCGGGA GGCGATCGCG GCGGTGGGGC TCCTCGACGC CGAGACCTTC
GGGCGCGGCT ACGGCGAAGA GAACGATTTC TGCATGCGCG CCCTCAAGGC GGGCTTCACC
AACGTGCTGG CCCACGACAT CTTCGTGTTC CATTCCGGCA GCGTCTCCTT CGGCGCGCTG
CTCGCGACCA AGGGCGCGGA CATCTTCCGC GCGATCCTGA CCAAGCACCC CGACTACCAG
CGCCGGGTCC ACAACCACAT CGAGGTCGAT CCCGCGCGCT TCGCCCGGCG CCGCCTCGAC
CTCTACCGCT TCGCCCGGCG CGCCACGGCC AGCGCCGAGC GCGGCATCGC CCTGATCGTG
ACCCACGATT TCGGCGGCGG GGTCGAGACC CATATCGAGG CGCTGAGCCT CCGGCTCGCC
GAGGCCGGTC TCGCCGTCGT CTATCTGCGC ACCGACGAGC CCGGCGGGTT CAGGCTCGGC
CTGCCCGGAT CGGGCGGGAT CGACTTCCCC GTTTCGATCC TCGATCCGCT CTCCCTCGAC
CGGGATGCGG ATCTCCTCGC CGAGCTGATC GGTTGGCTCG CCCCGGCAAT GGTGCATGTG
CACTCGCTGG CGGGGCTCGA TGCCCCCTCG ACCCGCGCGG CGATGCAGCT GATTGAGGAG
GCCGGCACCG GCTACGACGT GACGCTGCAC GACTACGCCT CGGTCTGCCA CCGCAACAAC
CTCGTGCGGT CCGACGGCGT CTATTGCGGG TTGGCGGAAC CCGCCGTCTG CCGCGACTGC
ATCCGCCTCG ATCGCGACGC CGACGGGATC GTGCTGCCCG ATCCCGCTGA GCGGCGGCGG
GACTGGGCCG GGTTCCTCGA CCGCGCCCGG ACGGTGTTCG CCCCCTCCGC CGACCTCGCG
GATCGAATCG GCAGCACGCT GCACCTCGAC CGCATCGCGG TGCGCCCGCA CGAGGAGACG
CTGGCCGGCG TGGAGCTCAA GGCGCGCCAG CGCCGGGAGG GGCCGCTGCG GGTCGCGGTG
ATCGGATCGA TCGGCGCGCA CAAGGGCTAC GACGTCGTCC ACAACCTCGC CCTCGATGCC
CGGCTGCGGC AATTGCCGAT CGTGTTCACG ATCATCGGCC ACTCCGCCGA GCCGCGCGCC
ATGGAGGCGG CGGGCGTGCG CGAGACCGGC CTCTATGGCA GCGACGCGGC GGCGCTCGCC
GAGATCGCCC GGCTCGATCC GGACCTCGTT CTGCTCCCCT CGATCTGGCC GGAGACCTAC
TGCTACACGC TCTCGCTGGC GCTGGCGGCG GGGGTGCCGC CGGTGGTGTT CGATCTCGGC
GCCCAGGCCG AGCGGCTGCG CGAAAGCGGG GCGGGGCATG GCCTCGACCC GGCCCTGGCC
GATGATCCGC AGAGGCTGAA CGCGGCCCTG CTCGCGCTTC CCATCGACGC GCTCTGGGCC
GCCCGCGAGC CGTTCCGGCC GACCGTCTAC CCGCGGATTC TGGAAGATTA TTACGGGCTC
GACGCGGCCG CCCTGTCGAA CGTCTCGCCC GAGGCGGGGA GGGAAGGGGT CGCGCCGGTG
GATCACCGAG ACGGGCTCCG CTCCGCGCCA TCCCCCCAGA TGTCATAG
 
Protein sequence
MRGPVLVLSR NVAVIKLPKL FRPRRRDAAM AAPFFDPAFY RSAYPDVAAG GGDPLRHYLD 
HGWLEGRDPS AVFSTLFYAD RHFADRAPIN PLLHYAGLSP TERLRVATQP GSDFPALQAG
VVEPYFDAPY YARLAGLEGG EDPLAHYLTK GWRRGLDPND AFSGDAYLQR HAHMRRLGVS
PLYHFAATRR LQASEAAAPP ARRAKPQAAL LPADPAEASD AQIYETVAAA FDREHYLDTN
PDIRRSGIDP VRHFLDFGAR EDRDPSPDFS VRFYRKHYRR EIGAGVNPFF HYLAVGRARG
FRPTPFGLGT WPPQVAPTPA EWEQAQPAAD IAAARVVLIM PVYKGYDDTL RAIHSVLSEE
QATPFALLVI DDCSPDPRLS AALAELGGRG LFAHLVNEAN LGFVRTCNRG LGRAAGKDVV
LLNADVVVYG DWLDRLLWHL DADPRVATVT PFSNNATICS YPRPNVDNQA RLEITPAEID
AFTRACNART SSPVPTGVGF CMAMRREAIA AVGLLDAETF GRGYGEENDF CMRALKAGFT
NVLAHDIFVF HSGSVSFGAL LATKGADIFR AILTKHPDYQ RRVHNHIEVD PARFARRRLD
LYRFARRATA SAERGIALIV THDFGGGVET HIEALSLRLA EAGLAVVYLR TDEPGGFRLG
LPGSGGIDFP VSILDPLSLD RDADLLAELI GWLAPAMVHV HSLAGLDAPS TRAAMQLIEE
AGTGYDVTLH DYASVCHRNN LVRSDGVYCG LAEPAVCRDC IRLDRDADGI VLPDPAERRR
DWAGFLDRAR TVFAPSADLA DRIGSTLHLD RIAVRPHEET LAGVELKARQ RREGPLRVAV
IGSIGAHKGY DVVHNLALDA RLRQLPIVFT IIGHSAEPRA MEAAGVRETG LYGSDAAALA
EIARLDPDLV LLPSIWPETY CYTLSLALAA GVPPVVFDLG AQAERLRESG AGHGLDPALA
DDPQRLNAAL LALPIDALWA AREPFRPTVY PRILEDYYGL DAAALSNVSP EAGREGVAPV
DHRDGLRSAP SPQMS