Gene Mext_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0831 
Symbol 
ID5832717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp905500 
End bp907575 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content72% 
IMG OID641366613 
Productglycosyl transferase family protein 
Protein accessionYP_001638307 
Protein GI163850264 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.74925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.236177 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA CCCCCGCCGT CCCGCTGCTC GACACCGTCG ATCCCGGCAC CGGTCGCCGC 
GCGGCGCTGG CCGAGGCCGC GCTGATTGTC GAGGCGATCC AGCGCAACGG CGTGCGCCTC
GCCTTCGCGC CGCAGCCCGA TCCGGATGTC TCCATCGTCA TCGTCGCCCG CGATGCCCGC
CACCTGCTGG CGCTCACCCT CTATCGCCTC TGCGCGAGCC AGGGGCTGGC AGGCGTCCGC
TTCGAGGTCG TGCTGTTCGA CAACGCCTCC GCGCCGGAGA CCCGCGCCCT CTACCCCCAT
CTCGACGGGG TGACGCTGAT CGAGAACGCC ACCAACACCG GCTTCGGGCC CGCCTGCAAC
GCGGGCGCGG CCAGGGCGCG GGGGCGCTTC ATCCTGTTCC TCAACCCGGA TGTCGATCTC
CTGCCCGGCG CGCTGGCGGC GATGGTCGCG ACCTTCCGCG ATCATGAAGG CGCTGGCATT
GTCGGCGCCC GCCTCGTCTT CCCCGGCGGC GTGCTGCAGG AATCCGGCGC AGGCTTTCGC
GACGACGCGC AACTCACCCA TCCGCACGGG CGCGGCAACG CCGACCCCTT CGCCCCCGAG
CATGCCGCGA CCCGCGATGT CGGCTACGTC TCGGGCGCGG TGCTGATGAT CGAGCGGGCC
CTGTTCGAGG CGCTCGGCGG CTTCGATCCG CTCTTCGCCC CGGCCTATTT CGAGGACACC
GACCTCTGCC TGCGCTGCCA TCAGGCCGGG CGTCGCGTCA TCGTGCAGCC GCGCGCCACG
GCGATCCATT ACGAGAACGC CACCAGCGCC CGCCGCGAGG ACGTGGAGGC GCTGCTCGAC
CGCAACCGCG CCCGCTTCCT CGACCGGCAC CGGCAGAGCC TGTTCGCGCA GGGGCCGCAG
CCGCGGGGAA CCGGCCTCCT CGACCACGAT CCCTGGCGCC TGAGGGTGCT CTACGTCGAT
GATCGGGTGC CGCATCTCGA TCTCGGCGCC GGCCTGCCGC GGGCCAACGC CATCCTCAAC
GCCATGGCGG GTCTCGGCTA CGCGGTGACG TTCTTCCCGA ACTACGAGGC GGATGCGGAG
GAGGCGCGGC GCTACCGCGA CCTCGATGAG CGCATCGAGA TTTCTTACGC CAGCGGTGAC
GAGGGGTTCG CCCGCCTGAT CGCCGAGCGG CGCGACCATT ACGACGTGCT CTGGGTCAGC
CGGCCGCACA ACATCCTGTT CGTCACGCAG GCGCTGCACG CCGCCGGGCT CGACCCGCGC
AGCTTCGTGC GCTCGAAGGT GATCTTCGAT TCCGAGGCCC TGTTCGCGCT GCGCGACTTC
GTGACGGAGG CCGCGACCGC GGGCAGCGCG GTGGCCGCCG ATCTGGCGTG GCAGGCCGAG
CGCGAGACGC GCCTGTTCGG GCTCGCCGAC GCGGTGGTCT GCGTCTCGCC GGCCGAGGCG
CGGGTGCTCG CCCGCTACAG CGCCTGCAAC GCCACTGTGC TCGGCCACGC CCTGACCCGG
CCCGAGGCGC CGACGCCGGG TTTTGCCGGC CGCGCGGGCT TCGTCTTCGT CGGCGCGCTC
GCCCGCGAAG GGCAGCCCAA CGTCGATTCC CTCGACTGGT TTTTCGGGAG CGTCTGGCCG
CTCGTGCGGG CGCGGTTGCC GGCGGCGCAG CTCACCATCG TGGGTGGGAT CGCGCCGGAA
ATCCGGGAGC GCTACGCGCG CGAGCCGGGC GTGCAGGTCA CCGGTCGGGT GCCGCAGACC
GAGCCCTATC TCGACGCCGC CCGCGTGTTC CTGGCGCCGA CGCGCTTCGC CGCCGGCATT
CCGCACAAGG TCCATGAGGC GGTGGCGCGC GGCCTGCCCT GCGTCGTCAC GCCGATCCTC
GCCGATCAGG TCGGCTGGGC GGACGGCGCC GGCTTCCTCG TGCGCGACTG GCGCAACCCA
AAACCTTTCG CGGAGGCGCT GGTGGCGCTC CACGAGGACG CGGCCTTGTG GGACGCGGTT
CGGGAGGAGG GGAGCCGGCA CATCGCCGAG GATTGCGACA CCGAGGCCTT CGCGGCCGCG
ATCCGGGCCC TGTGCGAGGC GCAGGTCGTC GCATGA
 
Protein sequence
MSETPAVPLL DTVDPGTGRR AALAEAALIV EAIQRNGVRL AFAPQPDPDV SIVIVARDAR 
HLLALTLYRL CASQGLAGVR FEVVLFDNAS APETRALYPH LDGVTLIENA TNTGFGPACN
AGAARARGRF ILFLNPDVDL LPGALAAMVA TFRDHEGAGI VGARLVFPGG VLQESGAGFR
DDAQLTHPHG RGNADPFAPE HAATRDVGYV SGAVLMIERA LFEALGGFDP LFAPAYFEDT
DLCLRCHQAG RRVIVQPRAT AIHYENATSA RREDVEALLD RNRARFLDRH RQSLFAQGPQ
PRGTGLLDHD PWRLRVLYVD DRVPHLDLGA GLPRANAILN AMAGLGYAVT FFPNYEADAE
EARRYRDLDE RIEISYASGD EGFARLIAER RDHYDVLWVS RPHNILFVTQ ALHAAGLDPR
SFVRSKVIFD SEALFALRDF VTEAATAGSA VAADLAWQAE RETRLFGLAD AVVCVSPAEA
RVLARYSACN ATVLGHALTR PEAPTPGFAG RAGFVFVGAL AREGQPNVDS LDWFFGSVWP
LVRARLPAAQ LTIVGGIAPE IRERYAREPG VQVTGRVPQT EPYLDAARVF LAPTRFAAGI
PHKVHEAVAR GLPCVVTPIL ADQVGWADGA GFLVRDWRNP KPFAEALVAL HEDAALWDAV
REEGSRHIAE DCDTEAFAAA IRALCEAQVV A