Gene Mext_4260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4260 
Symbol 
ID5833934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4740364 
End bp4742592 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content71% 
IMG OID641370051 
Productglycosyl transferase family protein 
Protein accessionYP_001641700 
Protein GI163853657 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.109167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.249298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGT GCCGCATCGA ACCCAAGGGA CAGATCGAGC CGGAGGGGGA GGGCGTCTGG 
CGCTCCACCG GCTCCGATCC GCAATTCCGG CTGACCCATC CCCGCGGCGG ACGGCTGTTT
CCGTTCCGGC TCGCGGGGGG CTGGGTGCGC ATCCGCGCCG ACCTCGAAGG GCTGGGCCAC
ACGAAGCCGG TGCCGCTCCT CTACGTCGAT GACGGCTCGG GCTTTCGCGA GGACGAGGCG
GTCCGGCTCG AACTACGGGA GGACGGGGGC ATCGACGACC TCGTGCGCCT GCCGCGCCGG
GTGCGGGGCC TGCGGCTCGA TCCGCTCGCC GCGAGCGGGC GGTTCCGGCT CTCGCGGGCG
ACCCTCGAAC CGCTGCACGG CCCCGCCGCG GCGCTCGCCG TCGCCCGCCG GTTGCTGGCG
CGGCTGCCGG AAGAGGAGAG GGGGGGCGGG GCGCTGCTCG GGCACCTCGC GCGCCTCGCC
GTCAGCCGCC CGCCCGCCGC CCTGTGGCGG GCGTTCTGGG ACAGCGCCCG GCCGCGCCCG
GCGGCGGCCT CCTACGCCGC CTGGATCGAG ACGATCGAGC GCCCAGCCCT GCCGAGCCCG
GAAGCGATGC GGGCGGCGAT CGCGGGCTTC CGGGTGCGGC CGCGCATCTC CATCGTCATG
CCGGTCTACA ACACGCCCAA ACCCTATCTG GAGGCGGCGC TCGCGAGCAT CCGGGCGCAG
GCCTATCCCG ATTGGGAGCT GTGCCTCTGC GACGACGCAT CGAGCGCGCC CCACATCGCG
CCGATGCTGG ACGCGCTGGC GGCGGAGGAG CCACGGGTGC GGCTGCATCG GCGATCGAAG
AACGGCGGCA TCGTCGCGGC GAGCAACGAT GCGCTCGCCT TGGCCACCGG CGACTGGCTC
ACCCTGATCG ACCACGACGA CGCGATCCCG CCGCACGCGT TCTACGCGCT CATTGCCGCC
CTGAACGCGG ATCCGCAGAT CGACTTCCTC TACTCGGACG AGGACAAGAT CACGGTGGGC
GGCGAGCGCT ACGAGCCGTT CTTCAAGCCG GACTGGTCGC CCGAGACGCT GGAAGCCTGC
ATGTACACGG CCCATCTGGC GCTCTACCGG ATGGATATCG TCGCCCGCAT CGGCGCCTTC
CGGGCGGAAT GCGAGGGCGC GCAGGATTAC GATTTCGTCC TGCGCTACAC CGAGCACGTC
GCCCGCGTGC ACCACGTGCC CGAGGTGCTC TACCACTGGC GCGCGATCCC CGGCTCGACC
GCACAGGCCA TGGACAACAA GGGCTACGTG GTCGCCGCCG CGGTCCGGGC GCTCGAAGAC
CGGGCGCGGC GCACCGGCGG GCTCGATTTC GTGCGCCCCA CCGCCTTTGC CGGCAGCTTC
CACCTGCGCC GGCCGCTCGC CGCACGCCCG CTGGTCTCGA TCGTGATCCC CACCGCCGGG
CGCGACAGCG AGGTTCGCGG CCGCACCGTC GATCTCCTGG CCGCCTGCCT CGCCAGCATC
CGCGAGCGGA CCACCTACGA GGCGATCGAA ATCGTGGCGG TCGATAATGG CGACCTGCGG
CCGACGACGA AAGAAGCGCT GGAGCGCTAC GATGCCCGCT CCGTCACCTG GGACCAGCCG
GTCTTCAACG TCGCCGCCAA GATGAATCTC GGCGCGCGTG CGGCCACCGG CGAGGTGCTG
ATCTTCCTCA ACGACGACAT CGAGATCATC AGCCCCGACT GGATCGAGGC GATGCTGGCG
CTTCTGCAGA TCCCCGGCGT CGGCGCGGTC GGCCCCAAGC TCCTGTTCGA GACCGGCGAG
TTGCAGCATG TCGGCGTCAC GGTGATCGAC GCGACGCCGG ACCATCCCCG CCGCTCCTAC
CCGCGGGAAG ACCCCGGCCA CTTCTTCTCG ACCGCGGGCA ACCGCAACTA CCTCGCGGTG
ACGGGGGCCT GCGTGATGGT GCGGCGGGCC GAGTTCAAGG CGATCCAGGG GTTCGACGAG
GGATTCGCCG TCAACTACAA CGACGTCGAT CTCTGCCTGC GGCTGTGGGA GCGGGGCCTG
CGCAGCGTCT ACTGCGCGGA GGTCGAGCTC TACCATTACG AGAGCCGCAA CCGAGCCCGC
ACCGTCGCGG CGGACGAGCA GGCCCGCTTC CGCGCGCGCT GGGCGGAGGC AATCCCGCGC
GACCCCTATT ACAGCGCGTG GTTCGAGGCG CTGCCGCCGA CCTTCGAGCT CGATCCGGCG
CGGTTTTAG
 
Protein sequence
MIACRIEPKG QIEPEGEGVW RSTGSDPQFR LTHPRGGRLF PFRLAGGWVR IRADLEGLGH 
TKPVPLLYVD DGSGFREDEA VRLELREDGG IDDLVRLPRR VRGLRLDPLA ASGRFRLSRA
TLEPLHGPAA ALAVARRLLA RLPEEERGGG ALLGHLARLA VSRPPAALWR AFWDSARPRP
AAASYAAWIE TIERPALPSP EAMRAAIAGF RVRPRISIVM PVYNTPKPYL EAALASIRAQ
AYPDWELCLC DDASSAPHIA PMLDALAAEE PRVRLHRRSK NGGIVAASND ALALATGDWL
TLIDHDDAIP PHAFYALIAA LNADPQIDFL YSDEDKITVG GERYEPFFKP DWSPETLEAC
MYTAHLALYR MDIVARIGAF RAECEGAQDY DFVLRYTEHV ARVHHVPEVL YHWRAIPGST
AQAMDNKGYV VAAAVRALED RARRTGGLDF VRPTAFAGSF HLRRPLAARP LVSIVIPTAG
RDSEVRGRTV DLLAACLASI RERTTYEAIE IVAVDNGDLR PTTKEALERY DARSVTWDQP
VFNVAAKMNL GARAATGEVL IFLNDDIEII SPDWIEAMLA LLQIPGVGAV GPKLLFETGE
LQHVGVTVID ATPDHPRRSY PREDPGHFFS TAGNRNYLAV TGACVMVRRA EFKAIQGFDE
GFAVNYNDVD LCLRLWERGL RSVYCAEVEL YHYESRNRAR TVAADEQARF RARWAEAIPR
DPYYSAWFEA LPPTFELDPA RF