Gene Mext_4173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4173 
Symbol 
ID5832441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4643609 
End bp4645417 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content73% 
IMG OID641369963 
Productglycosyl transferase family protein 
Protein accessionYP_001641613 
Protein GI163853570 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCC TTGGCGTCGC GCCGGCCGCG CTGTCACGGG CGGAGGGTGG CTGGCCGCTC 
GACGGCGCGC TGCGCCTTGG CGACCGGGTG CTGTCCTTCG GGGCCGCGAG CCATCTCCGC
GCCTGCCTCC TGCTCCTGCT GATCGGTCTC GCCAGCTTCC TGCCGGGCCT TGCCTCGCTC
CAGCCGATGG ACCGGGACGA GCCGCGCTTT GCCCAAGCCT CCAAGCAGAT GCTGGAGACG
GGCGACCTCG TCGATATCCG CTTCCAGGCC GAGGCTCGCC ACAAGAAGCC GGTCGGGATC
TACTGGGCCC AGGCCGCCGT CGTCGCGGCC GGCGAGGCGC TCGGTGTGCC GCAGGCGCGC
ACGCAGATCG GGCTGTACCG GATTCCCTCG CTCCTCGGCG CGCTGGCGGC GATCCTGCTG
ACCTACTGGG CGGGCCTCGC CCTGCTCGAC CGGCGCCGGG CGCTGCTGGC CGCCGCCCTG
TTTTCCGCCT GCATCATGCT CTCGGCGGAA GCGCGCCTTG CCAAGACCGA CGCGCTGCTC
ACCGCCTGCT CGGTCGCCGC CTTCGGCGCG CTTGCCCGCG CCTGGCTCGG GCGCGCCCGG
TTGGAGCGGC GCCGGGGCCC GGCCTCGCTC GGAACGGCCT TGGTCTTCTG GCTCGGGCTC
GCGCTCGGCA TCCTCGTGAA GGGGCCGATG GTGCCGCTCT TCGCAGGGCT CGCCGTCTTC
GTGCTGTGTC TGCGCGAGGG CTCGGCCCGC TGGCTGCTCG ACCTGCGCCC GCGCTTGGGC
CTCCTCATCA CGCTCGCCGT CGTGGCGCCC TGGTTCCTGG CGATCGCCTG GAAGAGCGGT
GGCGCCTTCT TCGGCGAGGC GGTGGGGCGC GACATGCTCG GCAAGGTCGG CACCGGCGCC
GAGAAGCATT GGGGCCCGCC CGGCGCCTAC GCGCTGGCCT TCTTCGCCAC CTTCTGGCCG
GGCGCCGCCT TCGCCGCCCT CAGCCTTCCC TTCGCCTGGG CGCGGCGGGG CGAGGAGGCG
GTGGCGCTGT TGCTCGCCTG GATCGTGCCG ATGTGGCTGA TCTTCGAGGC GGTGCCGACC
AAGCTGCCGC ATTACGTCCT CCCCCTGATG CCGGCGGTGG CGATCCTGAC CGTGCTGGCG
CTGTCGCGTG GCGCGCTCGA TCCGCGACGT CCGGGCGCGC GCTGGGTGGC GGGGCTCGTG
GGGTTGATTC CGGTCGGGCT GACGCTGGGC CTCAGCCTCG CCGCGTGGCG TCTCGACCAT
GTGCTGCCCC TCGCCGCCCT GCCGCTTCTG CTCGCCGCCT GCCTCCTCGC CGGCCTCGCC
TGGGCCGCCT TCGCCCGCGG GGCGAGAGAA GGGGCGGGGC AAGGGGCGGG GCAAGGGACA
CGGCAAGAGG CAGGGGAGGG CGCTCTGGTC CTCGCCGTCG CCGCTTCGGT GGTGCTGTCG
GGCGCCGTGT TCGGCCTGAC CCAGCCGGTG CTGCAAAGCC TCAAGGTCTC GCCACGGCTC
GCCGCGATCC GCGATGCCCT GCCCTGCGAG GCCCCGCGTG TGGCAAGCCT CGGCCTTCGC
GAGCCGAGCC TCGTCTTCAC CGTCGGCACG GATCTGGCCA TGCTGAATTC CGGCGCGGAG
GCCATTGCCT TCCTACGGGA GGGCGGCTGT CGCCTCGTGC TGGTCGAGGA CCGGTTCGCC
GCCGAATTCA CGGCGGCCGA AGGCGGGCAA CCGCTTACCC CCATCGGTCG GGTCACCGGC
TTCAACATCA ACGGCGGCAA GCCGGTCGGG GTCTCCGCCT ACGCCGCGCT GCCGGGTTCC
ACGCCATGA
 
Protein sequence
MTRLGVAPAA LSRAEGGWPL DGALRLGDRV LSFGAASHLR ACLLLLLIGL ASFLPGLASL 
QPMDRDEPRF AQASKQMLET GDLVDIRFQA EARHKKPVGI YWAQAAVVAA GEALGVPQAR
TQIGLYRIPS LLGALAAILL TYWAGLALLD RRRALLAAAL FSACIMLSAE ARLAKTDALL
TACSVAAFGA LARAWLGRAR LERRRGPASL GTALVFWLGL ALGILVKGPM VPLFAGLAVF
VLCLREGSAR WLLDLRPRLG LLITLAVVAP WFLAIAWKSG GAFFGEAVGR DMLGKVGTGA
EKHWGPPGAY ALAFFATFWP GAAFAALSLP FAWARRGEEA VALLLAWIVP MWLIFEAVPT
KLPHYVLPLM PAVAILTVLA LSRGALDPRR PGARWVAGLV GLIPVGLTLG LSLAAWRLDH
VLPLAALPLL LAACLLAGLA WAAFARGARE GAGQGAGQGT RQEAGEGALV LAVAASVVLS
GAVFGLTQPV LQSLKVSPRL AAIRDALPCE APRVASLGLR EPSLVFTVGT DLAMLNSGAE
AIAFLREGGC RLVLVEDRFA AEFTAAEGGQ PLTPIGRVTG FNINGGKPVG VSAYAALPGS
TP