Gene Mext_4589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4589 
Symbol 
ID5834888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5128075 
End bp5130198 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content73% 
IMG OID641370383 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001642028 
Protein GI163853985 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.260554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA TCGAGCGGAT GCCTTCGCGG TTCTTCGTCG GCGCCGAGCC GGGCAAGCCG 
GACGTGACGC CTGAACCCTG GTTCCTCGAC CCGCGTGAGA TCGGACGGGC CCTGCGCGCG
CGCTGGGCGC TCGTGCTGGC CCCGGCTGTG CTCCTCTTGG TGGCGGCCGT GGCGTGGCTC
GCGCTGGTGC CGCCGCTCTA CGCCGCCGTG ACGCAGATCC TGATCGACCC GCGCGGCATC
CAAGTGGTCA AGGACGGCGT GACGCCCTCG GACCAGGCGA GCGATGCGAG CCTGTTCCTC
GTCGACAGCC AGATCCGCGT CCTCATCTCC GACGAGGTGC TGCGGCAGGT CGTGACTCGG
TTCAAGCTCG ACCAGGACCC GGACTTCGTT CGTCCCGCCT CGCCGCTCGA GACGCTCAAG
AGCCGCCTCT CCTCGCTGAT CGTCACCGCC GGCGGCCCTG CCGACGACAC GCTCACCGCC
CTGCGCACGC TGCGCGATCG CACGACCGCG CGCCGCCTGG AGCGCAGCTT CGTGGTCGAA
CTCGCCGTCT CCAGCGAGGA ACGCCGGAAA TCCGCCGAGC TCGCCCAGGC CATCGCCGAA
ACCTACCTCA CCACCGTCTC GCAGGCGCAG GCGCAGGTCA CCCGCAAGGC CGGCGAGGCG
GTGTCGAGCC GGCTCGGCGA GTTGCAGGAC GACCTCCGGC AGGCCGAGGA CAAGGCGCAG
AAGTTCCGCG CCGCCAACAA CCTCGTCGGC ACCCGCGGCC AGCTCGTCAG CGAGCAGGCG
CTGACCCAGC TCAACCAGCA GCTCGGCGCG GCGCGTGCCC GGGCCGGCGA GCTGCGCGGG
CGGCTCGCCC AGATCGAGGC GGTCGCCAAC GGGCGGGCCG ACCTCAATTC GGTGACCGAG
ATCGTCCAGT CCACGACGGT CGCGCAATTG CGCGCCCAGC TCGCCCAGAT CGAGGCGGCC
CGGGCCGACA CCCTGTCCAA CCTCGGCCCC CGTCACCCCA CCCTGCGCAC CGGCGAGCTG
CAGGTGCAGA CCCTGCGCAA CGACATCAAC GCCGAAATCC GCCGCATCGC CGCGGCCACC
CGCAACGACT ACCGGTCGGC CTTGTCCAAC GAGGCCTCAC TCGCCGCCAC CCTGGAGAGC
CGCAAGAAGG AGGCTCTGTC CGTCGACAAG AGCTTCGTGC GCCTGCGCGA ACTGGAGCGG
CAGGTCGAAG CGAGCCGCGC GGTCTACGAG GCCTTCCTCG TCCGCGCCCG CGAGCTTCAG
GAGCAGCAGC GCCTCGACAC CTCGACCTCG CGCGTCATCT CGCCCGCCTC GCTGCCGGAG
CGCCGGCTCG GCCCGCCGAT CCCGGCCATC TTCGCCGCGG CGCTGGCGGC CGGGCTCGGC
TTCGGCACCG CGCTCGCCCT CCTCGCCGTG CCGGCCGCGG GGCGGATCGG TTCGCGCCGC
CGGTTTCAGC AGCTCGCGGG GCTCCCCGTG GTCGCCGCCC TGCCGGCCAA GGTGCCGACC
CGGACGCGGA GCAAGGCCGG CAGCGAATCC CTGCGCGCCG ACACCGCCTA CGACGTGGCC
GTGGCCCGTC TCGGCAGCCG TCTGCAGCGC GATTTCGGCG CCACGCGGCC GACGGTGGTC
CTCGTCACCT CGGCGGACGA CCGGAGCGGC AAGTCGGAGC TGGCGCGCAG CCTCGCCGCC
TCGGCTGCGC TCGACGGCCA GCGGGTGCTG CTCGTCGATG CCGACCCGGA GGCGATGATC
TCGGGCGATC TCCGGAGCCA GGCCAAGCGC GGCGCCGCCG ACGTGCTGCG GACGCATTCG
GGGCTCGGCG ACGCGTTGGT CGAGGGGCCG ACCGGGGTCA AGATCCTGCC CTACGACGAC
GCGGCCCTGC GCCTCGGCAC CGCGGCCTAT ACCAGTGCGA TCCTGACGGC GGCTTCTGCC
TTCGACACGG TGTTCGTCGA TATCGGGCTG ATCGGCACCG ACATCGCCGC CGAGCGTCTC
GCCCAGGACC AGCGCTTCCC GGCGCTGCTT CTGACGGCCA GCGCCGCCCG CAGCGGCACC
GCCCGGCTGC GGCGGGCGCT CGACGCCCTC GGCCGCGACC CGCGGGTGCA GCTCGTCATG
ACCGACGCCG AGGCCGAGGG GTGA
 
Protein sequence
MTMIERMPSR FFVGAEPGKP DVTPEPWFLD PREIGRALRA RWALVLAPAV LLLVAAVAWL 
ALVPPLYAAV TQILIDPRGI QVVKDGVTPS DQASDASLFL VDSQIRVLIS DEVLRQVVTR
FKLDQDPDFV RPASPLETLK SRLSSLIVTA GGPADDTLTA LRTLRDRTTA RRLERSFVVE
LAVSSEERRK SAELAQAIAE TYLTTVSQAQ AQVTRKAGEA VSSRLGELQD DLRQAEDKAQ
KFRAANNLVG TRGQLVSEQA LTQLNQQLGA ARARAGELRG RLAQIEAVAN GRADLNSVTE
IVQSTTVAQL RAQLAQIEAA RADTLSNLGP RHPTLRTGEL QVQTLRNDIN AEIRRIAAAT
RNDYRSALSN EASLAATLES RKKEALSVDK SFVRLRELER QVEASRAVYE AFLVRARELQ
EQQRLDTSTS RVISPASLPE RRLGPPIPAI FAAALAAGLG FGTALALLAV PAAGRIGSRR
RFQQLAGLPV VAALPAKVPT RTRSKAGSES LRADTAYDVA VARLGSRLQR DFGATRPTVV
LVTSADDRSG KSELARSLAA SAALDGQRVL LVDADPEAMI SGDLRSQAKR GAADVLRTHS
GLGDALVEGP TGVKILPYDD AALRLGTAAY TSAILTAASA FDTVFVDIGL IGTDIAAERL
AQDQRFPALL LTASAARSGT ARLRRALDAL GRDPRVQLVM TDAEAEG