Gene Mext_2224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2224 
Symbol 
ID5834286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2468643 
End bp2469947 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content73% 
IMG OID641368023 
Productglycosyl transferase group 1 
Protein accessionYP_001639690 
Protein GI163851647 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.307523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTCG TGATCCTCGC CGAATTCGCC GCCGCGAGCG GGGGTGCCGA GAAGGTCGCG 
GTGGAATCCG CCCGCGGGCT CGCCGAGGCC GGCGCAACGG TGACCTACAT CCAGGCGATC
ACCGGACCCG TCGATTCGCT GCTCGACCAT CCGCGCCTGC ACCGCATCGA CCTCGCTCTG
CCGGATGTGT GGTCGCTGGC GGCATGGCGC GGCGCGGCGT CGGGGATCTG GAACGGCGAG
GCCGCCGCGC GCCTCGCGAG TGCGCTCGAC AGCCTGCCGG TGCCGCCCGA CTGCCTTCAC
CTGCACCAGT GGACCCGCGC GCTCTCGCCC GCCGTGCTGC CGGTGCTGCT CAGCCGCGGC
GTTCCCCTGG TGCTGACGCT GCACGACTAT GCCCTCACCT GTCCGAACGG TGTCGATTAC
CGCTTCGATC GGGCCGAGCC CTGCGCGCTC GTCCCGCTGT CCGGCGCCTG CCTCGCGGCC
GCCTGCGATC CGAAGAGCCG GCGGCACAAG CTGGTGCGGG TCGGTCGCGC CGCCGCCCTG
CGGGTCGCGG CGCGAGGGGC CGATCTCGAC GTCGTCCATG TCTGCGACGG CAGCCATGCG
CGGGTGGCGG GACGGTCCGG GGCCCTGCGC CTGCGCCATC ACCGCATCGA CAACCCGGTG
CGGGTGGAGA AGCGGGCGCC GGCCCTGCCG GCTTCGGGCG ATGCGATCGT CTATGTCGGG
CGCCTAACGC CGGAGAAGGG CGCGGATCTC GTCGCCGAGG CCGCGCGGCG GGCCGGACTG
CCCGCGCTCT TCATCGGGGC CGGCCCGCTC GAAGCACGTC TGCGGGCGGA GGGCGCCGAG
GTGCTCGGCT GGCGAAGCCC GGAGGCGGTC GAGGCGATCC TGCATCGCCG CGCGCGTGCG
CTCTGCGCAC CGTCGCGCTG GGTCGAGACC GGGCCGCTCA CCGTCTACGA GGCGCTGGCC
CAGGGGATTC CCGTTGTGGC GTCGCGGCGC TCCGGCGCGG CGGAGAAGGT GGCGGACGGG
GAGACCGGCT TCGTCGTCGA GCCTGAGGTG GCGGCGCTGG CCGATGCCTT CGCGGCGCTC
AAGGCCGACG CGCTGACCGC CCGCCTCGGC CGGCAGGCCT ATGACCGGTA CTGGCAGGCC
CCGCTGACGC TCGCCGCCCA CGCGCTTTCC CTGCTGACGC TGTATCGGCG GATTGGGGAT
GAATACAAAA TGCGGCAGTG CGATATGAGC CCGGCTACCG CCGAGCCTGC GGTTGTCCAT
TCCATGGGCA GCACCCTTGG CAAAGGGCGC CTTCCGACAT TATAG
 
Protein sequence
MHVVILAEFA AASGGAEKVA VESARGLAEA GATVTYIQAI TGPVDSLLDH PRLHRIDLAL 
PDVWSLAAWR GAASGIWNGE AAARLASALD SLPVPPDCLH LHQWTRALSP AVLPVLLSRG
VPLVLTLHDY ALTCPNGVDY RFDRAEPCAL VPLSGACLAA ACDPKSRRHK LVRVGRAAAL
RVAARGADLD VVHVCDGSHA RVAGRSGALR LRHHRIDNPV RVEKRAPALP ASGDAIVYVG
RLTPEKGADL VAEAARRAGL PALFIGAGPL EARLRAEGAE VLGWRSPEAV EAILHRRARA
LCAPSRWVET GPLTVYEALA QGIPVVASRR SGAAEKVADG ETGFVVEPEV AALADAFAAL
KADALTARLG RQAYDRYWQA PLTLAAHALS LLTLYRRIGD EYKMRQCDMS PATAEPAVVH
SMGSTLGKGR LPTL