Gene Mext_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1949 
Symbol 
ID5832022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2178006 
End bp2180159 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content71% 
IMG OID641367749 
Productglycosyl transferase family protein 
Protein accessionYP_001639419 
Protein GI163851376 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.603088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAC GGGCCGGGTC CATCGCGTCG CTCGCGGGCC GGGCGTTGCC GGTGACGGGT 
GCGGACGGGG CCGAGCTGCG CATCGACCTC TCCGGCGCCG ACCTCGGCGG CGCCTGGGTG
AGCCTGACGT GGCGCGATGC CGGCAGCGCC GGCATCGCCC GCGCCCTCGT CAGCGCGGTC
TCGGGCGAGG GAGAGAGCGT GGCGTTGGCC GAAGAGCCGC TCGGCGGCGA CGCCTTCGCC
TGGACCGGCC GATTGCCGCC GGGCGTCGCG GCTTTGCGCC TGAGCACCCT GTCGCGAACC
AATCCGTTCG TGCTGCAAGA CCTTTCCATC CGCCGCCGCG GTCGCCTCGG GATCGTGGCG
CGAGCGGGCG CCCGCCAGCC CGGCCTCACG GCCCAGGCGG TCTACTGGCG CATCCTCGGC
CTGAAGGTCC GCGCCCGCGG GCTGATCGCG CGGGCCCTCT CCCACCGGGC CGAGACCGGC
TACGCGGCCT GGATCGCCCG GTTCGACCGC CTTACGGCGG CGGAGCGGGC GCGAATCCAC
GCCGAGATCG CGGGGTGGGA GGCGCCGCCG CGGGTCTCCG TCCTGATGCC GGTGCACGAT
CCCGATCCGC GCGTGCTGGA GGCGGCGATC CGCTCGGTGC GGAACCAGCT CTATCCGGCC
TGGGAACTCT GCATCGCCGA CGACGCCTCA ACCGACCCGC GCATTCCGCG GCTGATCGCC
CGCCACGCGG CGGAGGAGCC ACGCATCCGC TCCGTGCGCC GATCGGAGAA CGGCCACATC
GCCCGCGCGA CCAACGACGC TCTGATGCTG GCGAGCGGCA CCTACACCGC CTTCCTCGAC
CACGACGACC TGCTCTCGGA AAACGCCCTG TTCGAGGTCG CCGGAGCCAT CCGGACCGAT
CCGGATCTGG AGCTGATCTA CAGCGACGAG GACAAGGTCG ATGGACGCGG CCGCCGCTTC
GAGCCGCATT TCAAGTCGGG CTACGACCGC GAATTGCTGT GGGCCCAGAA CTACGTGAAC
CATCTCTGCG TGGTCCGCAC CGACACGCTG CGCCGGCTCG GCGGCCTGCG GCCCGGTTTC
GAGGGCAGCC AGGATCACGA TCTGCTGCTG CGCCTGACCG AAGGACTGGC AGCAGAGCGG
GTGCGCCACA TCCCGAAGGT GCTCTATCAC TGGCGGGCGG CGGCCGGCTC CGGCACCTTC
TCGGATCGGG CTCTGGCGCG GGCCGAGGCG GCGCGCCTGC AGGCGCTCAC CGAAGTGGCC
GCGCGCAGGG GTGCCCGGGC CGAACGGGGA GAGAAGGGGT TCAACCGGCT GGTCCGCCTC
CTGCCGGAGC CGCCGCCCCT CGTCTCGGTC GTCATCCCGA CCCGCGACCG GGCGGAACTG
CTCGGCGTCG TCCTCGACGG GCTGTTCGCG CGCACCGACT ATCCCGCCCT GGAGGTCGTC
GTCGTCGATA ACGGCAGCAC CGAGCCGGCG ACGCGGGATC TGTTCGCGCG CTACGGTTCC
GAGCGGCGCC TGCGCGTGCT GCCCGCACCG GGTCCGTTCA ACTTCTCGGA TCTGTCGAAC
CGGGGAGCCG CCGCGGCGCG GGGCACGATC CTGCTGTTCC TCAACAACGA CATCGAGGTG
ATGGAGCCGG GCTGGCTCAC CGAACTCGTT TCAATCGCGA GCGACCCCGA GATCGGCGCG
GTTGGCGCGA AGCTCCTCTA CCCCGACGGC ACGATCCAGC ACGGGGGGAT CGTGCTCGGG
ATCGGCGGCA TTGCCGGCCA CAGCCATCTC GGCCTGCCGG GCAACGAACC CGGCTACTTC
GCGCGGATGC TGCTGTCGCA GGAGGTCTCG GCAGTGACCG GCGCCTGCCT CGCTATGCGC
GCAAAGGTCT TTTCTGAAGT CGGCGGCTTC GATGCCGCGC ATCTCGCCGT GGCCTTCAAC
GACGTGGATC TGTGCCTGCG GATTCGTGCG GCCGGTTACC GCATCGTCTG GACGCCGCAG
GCCCGCCTCC TCCACCACGA ATCGAAGAGC CGGGGCGCCG AGGACACGCC GGAGAAGCGC
GCCCGCTTCG AGGCCGAATC ACGGGTGATG CGCGAGCGCT GGGAGCCGGT GCTGCGGGCG
GACCCCTATT ACAATCCGAA CCTTTCGCGC GCGGCGGCGC ATTACCGGCT GTAG
 
Protein sequence
MAERAGSIAS LAGRALPVTG ADGAELRIDL SGADLGGAWV SLTWRDAGSA GIARALVSAV 
SGEGESVALA EEPLGGDAFA WTGRLPPGVA ALRLSTLSRT NPFVLQDLSI RRRGRLGIVA
RAGARQPGLT AQAVYWRILG LKVRARGLIA RALSHRAETG YAAWIARFDR LTAAERARIH
AEIAGWEAPP RVSVLMPVHD PDPRVLEAAI RSVRNQLYPA WELCIADDAS TDPRIPRLIA
RHAAEEPRIR SVRRSENGHI ARATNDALML ASGTYTAFLD HDDLLSENAL FEVAGAIRTD
PDLELIYSDE DKVDGRGRRF EPHFKSGYDR ELLWAQNYVN HLCVVRTDTL RRLGGLRPGF
EGSQDHDLLL RLTEGLAAER VRHIPKVLYH WRAAAGSGTF SDRALARAEA ARLQALTEVA
ARRGARAERG EKGFNRLVRL LPEPPPLVSV VIPTRDRAEL LGVVLDGLFA RTDYPALEVV
VVDNGSTEPA TRDLFARYGS ERRLRVLPAP GPFNFSDLSN RGAAAARGTI LLFLNNDIEV
MEPGWLTELV SIASDPEIGA VGAKLLYPDG TIQHGGIVLG IGGIAGHSHL GLPGNEPGYF
ARMLLSQEVS AVTGACLAMR AKVFSEVGGF DAAHLAVAFN DVDLCLRIRA AGYRIVWTPQ
ARLLHHESKS RGAEDTPEKR ARFEAESRVM RERWEPVLRA DPYYNPNLSR AAAHYRL