Gene Mext_3556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3556 
Symbol 
ID5831102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3934208 
End bp3936475 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content73% 
IMG OID641369350 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001641007 
Protein GI163852964 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGAGG CGAGTGACAT CGCGCGCAGC GCGCGCCAGC GGGAGCAGCC GACGCTGCGG 
GAACCTGGCG CGACCAGGGA GACCATCGCG GCGCTGAGGG AGGTCGTAGG CCCAGGGGAG
GCGCGCCACG AGCCGCCGTT CCCTGCGCAC GAGCAGTGCA CAGGGTTGCT CCCTGCTCCG
GCGGCCGGCC TCACGCCCGC GGCGTCGGCG GCTTACGCCG ATAGCGGGCT CCGGATGCGC
GACCTCCTCT CGACCCTCAG GCGGCGGGCG CGCCTCGTCG GGGCCGTGGC GGTCGCGGGG
GCATTGCTGA CGGGCACGCT GTCCGCCCTC CTGCCGCCCT CCTACGTGAC GACCGCGCAT
GTCATGGTCG AGCCGGCCGC CTCCGAGGTC GCTGCAGGCC TGCTGGCGCC CGTCGTAGAC
ACTCACATCA TCCTGCTAAC CTCCGAGGCC CGCCTCAGGC GCGTGTTCGA TGACTTGGCC
GCCGAGCCTC GATACATCGA GGCCTCGGCG GCCGTCGGGG CCCATCCCTC ATTCGCCGAA
CGCGCCAAGT TCCTGCTGCG CGACGCGGCG CGCGAACTGG GCTTGGCGTC CCGCCCGGCA
GAGCCCGAGT CCGAGACGCC GTCGACCGAC CTCGGGCCTG GCCAGGCCCG GGCGGCGATG
CTGGTGCGCC AGGAGCGCCA GTCCAGCATC ATCAGCGTCA GCTTTCAAGA CCGCAGCGCG
CAGAGGGCGG CGCTTGTCGC CAACCGTATG GTGACCCGAC ACGTGCAAGA ACTGAGCGAG
CGCCGGACGC AGGAGGCCGC CGGACGCAAA GACTTCCAGG AGCAGCGGGT GGAGGAGGCT
CGGGCGGAGG TTGAAGAGGC CGAGCGGCTG GTGCGCGCCT TTCAGGTCGA GAACGGGGCG
TCCGTCACCG ACCGGGAGGG CGAGTCCGTC GCGGAGATGA CCCAACAGCT GGTCCTGTTG
CGCTCGGAGA TCGCCTCACA CGAGCGGGAT GCCGGGGCAC CGGCCGCCGC CGACGAGCTT
CGGGTCATCC GGCTCCAGGC CGATGCGCTC GAGGCGCGCC TGGCCGAGCT CAAGGCCGTT
CAGGGCGCTA CCATCGATCG GCGCATCGAG AGGCACGCCC TGGACCTGCG CCTGGACACG
GCCCGCAAGA ACCTGACCGA GCAGCTGCGC CAGCTGGAAG AGTCGGGCAA GCCCCAGCCT
GCTTTCGCCT CCTCCGCCCG GGTCGTCGCC TCGGCCGGGG TTCCCACGCG CCCGAACTCC
CTGCACCCCG CCGTGGTGGC CGTGCCCGCC CTAGGGGCCT TCGGCATCCT GGGCGCGATG
GTTGCGCTGC TGGTGGAGCG GCTCGCCACG GGCTACCGGA GCGAGCGCGA GGTGGAAGAG
GAACTGGGCG TTCCCTGCCT CGGCCTCGTC CCGCGCGCGC CGGTCGTGAG GGCGTTCGGC
ACCGCGGCCG ACGATGCGCG CTCACCCTGG AGCCGCGCGG TGGGCTCGCT CGCGATGACA
TTGCTGCACC GTTGCGGGCG GCCCGGCTCC CCCTGCGTGG TGCTGGTGAC GGCCTGCGCT
CGCGAGGAGG ACAAGGCGGG GTTGGCGACT GCCCTCGCCG TACGCGCCCG TCAGGATGGA
CTCCGCGTCC TGCTGGTGCG CTGGGACGAC GACGCCGCGC TTGGACAGGG TCTCCGCTCC
TACCCGCTTT CCGACGGTGG GGCCGCCTTG AAATCACTTT CCGCCGCCTT CGCGCGCGAT
CCGGCACTCG GCGTGGACCG CCCCGTCAGC GAGGGCGTCG GGCGCGATCT CGTGCTCGGC
CTGGGAGGTG ACCGCTCCGT CCGAGTCAGG CATATCCTGG CTTACGAGTA CGACCTGGTG
GTGGTGGACG CCCCGCCAGT GCTGGCCTCC TCGCAGGCGC GTTTGGTCGC GGATGAAGCC
GACGCCGCGC TGCTCGCGCT CGCCTGGGGG CGCACGGACC GCAAGGTCGC GGACCAGGCG
CTGCGCCTCC TGCGCCGGCC CATGCCGCTC GCGGGCGACG AGCCAGCTCC GGGCGAGCGC
GCCATCCTCG CCGTGCTCAC AGACGTGAAC CTGAAGGCCC ATGCCCGCTA TCGCCTGGGT
GACGTCGGCG AGCACCTGTT CGAGGCCCAG CGTCGGCGGG ACCGATCGCG CCGGACCGCT
CCACGGCCCT CCGCCCAGAG CCAGACGCAG GCCGATTTCG CGTCCGACAA GCGCCCCGAT
GCCGAGCCCG CGGCATCCCC GTTCCCGCGC AGGCGAGCAG GGGCGTGA
 
Protein sequence
MNEASDIARS ARQREQPTLR EPGATRETIA ALREVVGPGE ARHEPPFPAH EQCTGLLPAP 
AAGLTPAASA AYADSGLRMR DLLSTLRRRA RLVGAVAVAG ALLTGTLSAL LPPSYVTTAH
VMVEPAASEV AAGLLAPVVD THIILLTSEA RLRRVFDDLA AEPRYIEASA AVGAHPSFAE
RAKFLLRDAA RELGLASRPA EPESETPSTD LGPGQARAAM LVRQERQSSI ISVSFQDRSA
QRAALVANRM VTRHVQELSE RRTQEAAGRK DFQEQRVEEA RAEVEEAERL VRAFQVENGA
SVTDREGESV AEMTQQLVLL RSEIASHERD AGAPAAADEL RVIRLQADAL EARLAELKAV
QGATIDRRIE RHALDLRLDT ARKNLTEQLR QLEESGKPQP AFASSARVVA SAGVPTRPNS
LHPAVVAVPA LGAFGILGAM VALLVERLAT GYRSEREVEE ELGVPCLGLV PRAPVVRAFG
TAADDARSPW SRAVGSLAMT LLHRCGRPGS PCVVLVTACA REEDKAGLAT ALAVRARQDG
LRVLLVRWDD DAALGQGLRS YPLSDGGAAL KSLSAAFARD PALGVDRPVS EGVGRDLVLG
LGGDRSVRVR HILAYEYDLV VVDAPPVLAS SQARLVADEA DAALLALAWG RTDRKVADQA
LRLLRRPMPL AGDEPAPGER AILAVLTDVN LKAHARYRLG DVGEHLFEAQ RRRDRSRRTA
PRPSAQSQTQ ADFASDKRPD AEPAASPFPR RRAGA