Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3112 |
Symbol | |
ID | 6130324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3443680 |
End bp | 3446559 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641643303 |
Product | glycosyl transferase family protein |
Protein accession | YP_001769956 |
Protein GI | 170741301 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.264403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCG ATCCCGTGCC CCGCGCCCCG GCGGATGCGC CGTCCTCCCC GGCGCGCCCG CGCCTGGCGG TGATCGTCCC GGTCTTCAAG CACAGCGGGC TGGTGCGCGA GGCGGTCGCC TCGCTCACGC GCCAGTCGCG GTTCGCGGAC GTGGACGTGG TCCTGGTCGA CGACGGGTGC CCCGACCCGC AGACCTTCAC GACGCTGACC GCCTTCTGCG CGCGCTGGCC CAACATCCAC TGCGTGAGGC AGGCCAATGG CGGCCTCAGC GCGGCCCGCA ACCGCGGCAT CGCCTTCGCG CTCCGCCGGC TCCCGGCGGC GGAGGGCGTC TACTTCCTCG ATGCCGACAA CCTCCTCGCC CCCTACGGCA TCGCGGCGAT GCAGGAGGCG CTGCTCCTTC ATCCCGAGGC GGATTGGTTC TACCCGGACA TCCGGATGTT CGGTGTCCGG GCCTTCCACG ATTACAGCGG CGACTTCAAC GCCTACGCGG CCTCGGTCGT CAACCTGTGC GAGGCCGGCA GCCTGATCCG GCGCCGCATG ATCGAGGCGG GTCTGCGCTT CGACGAGGGG ATGCGGCTCG GCTACGAGGA TTGGGATTTC TGGCTCTCGG CCGTGGGGCG CGGCTTCCGC GGCCGCCACC TGCCGAATCT CGGCCTGTCC TACCGGAAGC GGGCGGAGAG CATGCTCGCC GACTCGACCC GCTCGGACAC GCAGATCCGG GAATACCTGC GCCGCAAGCA CGAGGCGCTG TTCCAGGTGA ACGCCTTCGT CGCGCGGGAG CACGACGAGC TGCCGCGCTT CGCCGTGCGC CTGTCCGACG AGGCGAGCGT GGAGATGGCG AGCGACCTGC AGCGCGGCGG CCCGCGCATG GACCAGCAGG CCTTCGAGAC GCGGATCTGG CAGGCGATCC GCGAGCCGGC CTTCGTCTGG GCGGGCCAGT ACCTCCTGTC GACGACCGGC CGGACGCTCG CCCTGCTGCG CGGGGCGGGC CTGACGCGCT GGCTCTGCCT GGAGATCGAG CGGGCCCTCG CCGGCCACAA CTTCGTGTCG CTGACCCTCG CGGCCTCGGA CGACGACACC ATCCGGGTCC GGCGCGGGCA CGGCTTCTCG CCGCACTGCC ACCTCCTCGC CGTCTCGCAG AAGCTCCTGC AGGCGATCGC CCTCGACGAG AAGGACGCCT GGATCACGGA ACTGCCGATC CACGGCGCCA ATTACCGCGT CGCCACCCTG GAGATCCGCC TCCCGCCTCG GGCGATGGCG GCGGAGACCC TGAAGGACGT GGCGGTGACG GACTTCGTCC AGTTCTGCCT GCACCTGCGC TGCCACGGCC TGCGCGGCCG TCCGAGCAAC CTGGTCGAGG AGGTCTTCCT CGGCTCCCGC CCGCTCGACG CGATGGCGCC GCGCGCCCGC GCCCAGTTCG ACGGCGCCCT GCTCCCGCCG GTGGCCGAGG CGCGCGAGGG CCGCGTCGCC TGCGTGCTGC CGCATTGCGA TTTCGGCGGC GTCGAGAAGG TCACGTTCTG CCTCGCCCGC GAGTTGCGCC GCCAGGGGCT GCGCACCAGC CTGATCCTGC TCGGGAGCGA CGTCGCCTAC CGGGCGCACC GCGCGCTCGA GGCCTTCGAC GACATCTACC TCGTCGACGC CGGGGGCCGC ATCGCCGCCT GGGCGGGCGA TTCCTTCCTG GGCACGCAGC TGCCCAGGAT CCTGGACGAG GCCTGGGCCC GCGACTTCGC GAATGTCCTG ACCACGTTCG AACTCGTGGT GAGCTGCCAC TCGGCCGAGA TCATGGGATT GTTCAGCGGG CTGCGGCGCC GGGGCGTCAC CACGGCGACC TACCTGCACC TGTTCGACAA GTCGCGCATC GGCGCCGCCT GCGGCCATCC GATGCTCGCC CTCGCCTACG AGCACGCGAT CGACCTCGTC CTGACCTGCT CGGAGGGGAT GGCCTGCGAG ATGGCGAGCC TCGGCATCCC GCGCGACAAG ATCCTGGCCC TGCCGAACGC GCCCTCGCTG GAGCCCGATC CGCACCGCGC CGTCGCGCCG CGCGCCCCGG CCGGTCGCCC CCTGCGCCTG CTCTACCTGG GCCGCCTCGA TACCCAGAAG GGGCTCGACC GGCTGGCCGA GATCATCGAC GCGCTCGACC CGGACCCGCT CTTCGAGATC CGGGTGGTCG GGAAGGCGGT GTTGACCGAC GCCCACCTGA CCCTCAGCCG GCACGCGCAC CTGGTCGAAC CGCCGGTCTA CGACGATGCC GGCCTCTCCG AGATCTACCG CTGGGCCGAT ATCCTGCTGC CGTCCCGCTA CGAGGGGCTC CCGCTCACCG TGCTGGAGGC CATGGTCCAC GGCGTGGTGC CGATCGTGGC CGCCTGCGGG GCGGTGGCCG AGGCGGTCGA GTCGGGCGTC AGCGGCGTCG TCGTCCCGCA GGAGCGCTGC GTTCCGGGCT TCCTCGACCA CCTGCGGGCC CTGGCGGCCG CGCCGGAGCG GTTGGAGGCG ATGAGCCGCG CGGCGATGGC GAGGGCCGCC GGCCGGCCCT GGAGCGTGCT CGCGGAGCGC CTGCGCGCGC GGCTCGGCGC CGTCCGGGCG GCCCGCGCCC GAGAGAGGGC CGGAGCGGGC CGCGCAGGAG CGGGAGCCGG GCTGATCGGC GCCCGCGGCG GCCGAGCCGT GCCCCCACGG CGGGGCCGCC CCGCTCCCGG CGACCTTGCA GGATCGCGCC GAATCGCTAG CTGGGAAGCG ACCCTCCGGA GAGTTCGGCA TGAATGCCCG TCCCGACGCG CGCGACTTTG CGGCCCCCCA GCAGTTCGGC ATCGGTCAGC CGGTGCCGCG GGCCGAGGAT CCCGTGCTGG TGCAGGGGCA GGGGCGCTAC ACGGACGACC TCGCCCTTGA
|
Protein sequence | MSADPVPRAP ADAPSSPARP RLAVIVPVFK HSGLVREAVA SLTRQSRFAD VDVVLVDDGC PDPQTFTTLT AFCARWPNIH CVRQANGGLS AARNRGIAFA LRRLPAAEGV YFLDADNLLA PYGIAAMQEA LLLHPEADWF YPDIRMFGVR AFHDYSGDFN AYAASVVNLC EAGSLIRRRM IEAGLRFDEG MRLGYEDWDF WLSAVGRGFR GRHLPNLGLS YRKRAESMLA DSTRSDTQIR EYLRRKHEAL FQVNAFVARE HDELPRFAVR LSDEASVEMA SDLQRGGPRM DQQAFETRIW QAIREPAFVW AGQYLLSTTG RTLALLRGAG LTRWLCLEIE RALAGHNFVS LTLAASDDDT IRVRRGHGFS PHCHLLAVSQ KLLQAIALDE KDAWITELPI HGANYRVATL EIRLPPRAMA AETLKDVAVT DFVQFCLHLR CHGLRGRPSN LVEEVFLGSR PLDAMAPRAR AQFDGALLPP VAEAREGRVA CVLPHCDFGG VEKVTFCLAR ELRRQGLRTS LILLGSDVAY RAHRALEAFD DIYLVDAGGR IAAWAGDSFL GTQLPRILDE AWARDFANVL TTFELVVSCH SAEIMGLFSG LRRRGVTTAT YLHLFDKSRI GAACGHPMLA LAYEHAIDLV LTCSEGMACE MASLGIPRDK ILALPNAPSL EPDPHRAVAP RAPAGRPLRL LYLGRLDTQK GLDRLAEIID ALDPDPLFEI RVVGKAVLTD AHLTLSRHAH LVEPPVYDDA GLSEIYRWAD ILLPSRYEGL PLTVLEAMVH GVVPIVAACG AVAEAVESGV SGVVVPQERC VPGFLDHLRA LAAAPERLEA MSRAAMARAA GRPWSVLAER LRARLGAVRA ARARERAGAG RAGAGAGLIG ARGGRAVPPR RGRPAPGDLA GSRRIASWEA TLRRVRHECP SRRARLCGPP AVRHRSAGAA GRGSRAGAGA GALHGRPRP
|
| |