Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2218 |
Symbol | |
ID | 5832755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2461258 |
End bp | 2464365 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641368017 |
Product | glycosyl transferase family protein |
Protein accession | YP_001639684 |
Protein GI | 163851641 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.966726 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGGGCC CGGTATTGGT GTTGTCGAGG AATGTGGCCG TGATCAAGCT GCCGAAGCTG TTTCGGCCCA GGCGCCGCGA TGCCGCGATG GCCGCCCCCT TCTTCGATCC CGCCTTCTAC CGCTCGGCCT ATCCCGACGT GGCTGCGGGC GGCGGCGACC CGCTTCGGCA CTACCTCGAC CATGGCTGGC TGGAGGGCCG CGACCCGTCC GCGGTGTTCT CGACGCTGTT CTACGCCGAC CGCCACTTCG CCGACCGAGC GCCGATCAAC CCGCTTCTCC ACTATGCGGG ACTCTCGCCG ACCGAGCGGC TGCGGGTCGC GACGCAGCCG GGTTCGGATT TTCCGGCGCT CCAGGCCGGC GTGGTCGAGC CCTATTTCGA TGCGCCGTAT TATGCCCGCC TCGCCGGTCT CGAGGGAGGC GAGGACCCGC TCGCCCACTA CCTGACCAAG GGCTGGCGGC GCGGGCTCGA CCCGAACGAT GCCTTCTCGG GCGACGCCTA CCTCCAGCGC CACGCGCATA TGCGGCGCCT CGGCGTCAGC CCGCTCTATC ATTTTGCCGC GACGCGACGG CTCCAGGCCT CGGAGGCCGC CGCCCCGCCC GCGCGGCGTG CCAAGCCGCA GGCAGCACTG CTTCCCGCCG ACCCGGCCGA AGCCTCCGAC GCGCAGATCT ACGAGACGGT GGCGGCGGCC TTCGACCGGG AGCACTACCT CGACACCAAT CCCGACATCC GCCGCTCCGG CATCGATCCG GTCCGCCATT TCCTCGATTT CGGCGCCCGC GAGGACCGCG ACCCGAGCCC CGACTTCTCG GTGCGCTTCT ACCGCAAGCA TTACCGGCGG GAGATCGGGG CCGGGGTGAA CCCGTTCTTC CACTACCTCG CGGTGGGCCG GGCGCGCGGC TTCCGGCCGA CACCGTTCGG TCTCGGAACC TGGCCCCCGC AGGTCGCGCC GACCCCGGCC GAGTGGGAAC AGGCGCAGCC TGCCGCCGAC ATCGCAGCGG CGCGGGTCGT GCTGATCATG CCCGTCTACA AGGGCTACGA CGACACCCTG CGGGCGATCC ACTCGGTGCT GAGCGAGGAG CAGGCCACGC CCTTCGCCCT CCTCGTCATC GACGATTGCA GCCCCGATCC GCGCCTGAGC GCGGCACTGG CCGAGCTCGG CGGCCGCGGC CTCTTCGCCC ACCTCGTCAA CGAGGCGAAT CTCGGCTTCG TACGGACCTG CAACCGCGGC CTGGGGCGGG CCGCGGGCAA GGACGTCGTG CTCCTCAATG CCGACGTCGT CGTTTACGGC GACTGGCTCG ACCGCCTGCT CTGGCACCTC GACGCCGACC CCCGCGTCGC GACGGTCACG CCGTTTTCAA ACAACGCCAC GATCTGCAGT TATCCGCGGC CCAACGTCGA CAATCAGGCC CGGCTCGAGA TCACGCCCGC CGAGATCGAC GCCTTCACCC GCGCCTGCAA CGCCCGCACG AGTTCGCCGG TGCCGACCGG CGTCGGCTTC TGCATGGCGA TGCGCCGGGA GGCGATCGCG GCGGTGGGGC TCCTCGACGC CGAGACCTTC GGGCGCGGCT ACGGCGAAGA GAACGATTTC TGCATGCGCG CCCTCAAGGC GGGCTTCACC AACGTGCTGG CCCACGACAT CTTCGTGTTC CATTCCGGCA GCGTCTCCTT CGGCGCGCTG CTCGCGACCA AGGGCGCGGA CATCTTCCGC GCGATCCTGA CCAAGCACCC CGACTACCAG CGCCGGGTCC ACAACCACAT CGAGGTCGAT CCCGCGCGCT TCGCCCGGCG CCGCCTCGAC CTCTACCGCT TCGCCCGGCG CGCCACGGCC AGCGCCGAGC GCGGCATCGC CCTGATCGTG ACCCACGATT TCGGCGGCGG GGTCGAGACC CATATCGAGG CGCTGAGCCT CCGGCTCGCC GAGGCCGGTC TCGCCGTCGT CTATCTGCGC ACCGACGAGC CCGGCGGGTT CAGGCTCGGC CTGCCCGGAT CGGGCGGGAT CGACTTCCCC GTTTCGATCC TCGATCCGCT CTCCCTCGAC CGGGATGCGG ATCTCCTCGC CGAGCTGATC GGTTGGCTCG CCCCGGCAAT GGTGCATGTG CACTCGCTGG CGGGGCTCGA TGCCCCCTCG ACCCGCGCGG CGATGCAGCT GATTGAGGAG GCCGGCACCG GCTACGACGT GACGCTGCAC GACTACGCCT CGGTCTGCCA CCGCAACAAC CTCGTGCGGT CCGACGGCGT CTATTGCGGG TTGGCGGAAC CCGCCGTCTG CCGCGACTGC ATCCGCCTCG ATCGCGACGC CGACGGGATC GTGCTGCCCG ATCCCGCTGA GCGGCGGCGG GACTGGGCCG GGTTCCTCGA CCGCGCCCGG ACGGTGTTCG CCCCCTCCGC CGACCTCGCG GATCGAATCG GCAGCACGCT GCACCTCGAC CGCATCGCGG TGCGCCCGCA CGAGGAGACG CTGGCCGGCG TGGAGCTCAA GGCGCGCCAG CGCCGGGAGG GGCCGCTGCG GGTCGCGGTG ATCGGATCGA TCGGCGCGCA CAAGGGCTAC GACGTCGTCC ACAACCTCGC CCTCGATGCC CGGCTGCGGC AATTGCCGAT CGTGTTCACG ATCATCGGCC ACTCCGCCGA GCCGCGCGCC ATGGAGGCGG CGGGCGTGCG CGAGACCGGC CTCTATGGCA GCGACGCGGC GGCGCTCGCC GAGATCGCCC GGCTCGATCC GGACCTCGTT CTGCTCCCCT CGATCTGGCC GGAGACCTAC TGCTACACGC TCTCGCTGGC GCTGGCGGCG GGGGTGCCGC CGGTGGTGTT CGATCTCGGC GCCCAGGCCG AGCGGCTGCG CGAAAGCGGG GCGGGGCATG GCCTCGACCC GGCCCTGGCC GATGATCCGC AGAGGCTGAA CGCGGCCCTG CTCGCGCTTC CCATCGACGC GCTCTGGGCC GCCCGCGAGC CGTTCCGGCC GACCGTCTAC CCGCGGATTC TGGAAGATTA TTACGGGCTC GACGCGGCCG CCCTGTCGAA CGTCTCGCCC GAGGCGGGGA GGGAAGGGGT CGCGCCGGTG GATCACCGAG ACGGGCTCCG CTCCGCGCCA TCCCCCCAGA TGTCATAG
|
Protein sequence | MRGPVLVLSR NVAVIKLPKL FRPRRRDAAM AAPFFDPAFY RSAYPDVAAG GGDPLRHYLD HGWLEGRDPS AVFSTLFYAD RHFADRAPIN PLLHYAGLSP TERLRVATQP GSDFPALQAG VVEPYFDAPY YARLAGLEGG EDPLAHYLTK GWRRGLDPND AFSGDAYLQR HAHMRRLGVS PLYHFAATRR LQASEAAAPP ARRAKPQAAL LPADPAEASD AQIYETVAAA FDREHYLDTN PDIRRSGIDP VRHFLDFGAR EDRDPSPDFS VRFYRKHYRR EIGAGVNPFF HYLAVGRARG FRPTPFGLGT WPPQVAPTPA EWEQAQPAAD IAAARVVLIM PVYKGYDDTL RAIHSVLSEE QATPFALLVI DDCSPDPRLS AALAELGGRG LFAHLVNEAN LGFVRTCNRG LGRAAGKDVV LLNADVVVYG DWLDRLLWHL DADPRVATVT PFSNNATICS YPRPNVDNQA RLEITPAEID AFTRACNART SSPVPTGVGF CMAMRREAIA AVGLLDAETF GRGYGEENDF CMRALKAGFT NVLAHDIFVF HSGSVSFGAL LATKGADIFR AILTKHPDYQ RRVHNHIEVD PARFARRRLD LYRFARRATA SAERGIALIV THDFGGGVET HIEALSLRLA EAGLAVVYLR TDEPGGFRLG LPGSGGIDFP VSILDPLSLD RDADLLAELI GWLAPAMVHV HSLAGLDAPS TRAAMQLIEE AGTGYDVTLH DYASVCHRNN LVRSDGVYCG LAEPAVCRDC IRLDRDADGI VLPDPAERRR DWAGFLDRAR TVFAPSADLA DRIGSTLHLD RIAVRPHEET LAGVELKARQ RREGPLRVAV IGSIGAHKGY DVVHNLALDA RLRQLPIVFT IIGHSAEPRA MEAAGVRETG LYGSDAAALA EIARLDPDLV LLPSIWPETY CYTLSLALAA GVPPVVFDLG AQAERLRESG AGHGLDPALA DDPQRLNAAL LALPIDALWA AREPFRPTVY PRILEDYYGL DAAALSNVSP EAGREGVAPV DHRDGLRSAP SPQMS
|
| |