Gene Mext_4786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4786 
Symbol 
ID5835204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5345830 
End bp5347290 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content71% 
IMG OID641370583 
ProductO-antigen and teichoic acid-like export protein 
Protein accessionYP_001642225 
Protein GI163854182 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.315626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.636766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA TCGCCGCCTT CGTCCTCAAT GCCGGGCTGA ACTTCATCCT CGGCATCGCC 
ATCGCCCGAA TGCTGGGGCC AGCCGATTTC GGCCGTTTCG CCCTGGCGAC GGCCGGCGCG
GTCGTGCTCA ACACGATCCT GTTCGAGTGG CTGCGGCTCT CGGCGACCCG GTTCTACTCG
GCGCGGGTGC GCGAGGCCGA GCCGTGGATC CGCCAGGGGC TGGACCGGGC TTACTGGGTC
ATCGCGCTGG CGCTGTTTGC CACGGCCGCC CTCTGCGCCG GGCTCGGGAT CGCCGTCAAT
CCGACCCCCG AGGGACGTCT GGTCATGACC GCCGGCACCA TGGTCGCGGC GATCGGCATC
GGGCTGTTCG ACTATCATGC GGCGCTCGCC CGCGCCCGCT TCATCGGCAG CGCCTATCTC
CGGCTCGTGG TGTGGAAGAA CGTCCTGGCC TTCGTGCTGA TGGCCGGCAC GGCATGGCTG
TTTCCGCAGC CGGTCTGGGT GCTGATCGCA GGGGGCTTGA GCCAGTTCCT GGCGGTGCTG
CCGATGCGCA AGATCCTGGG CGACGGGCTT TTGGGGCACG TGCCCGCCCT GCCCCATGGC
CGAGCTCGTG AAACTCTGCG CCTGTTCGCG GCCTACGGCC TGCCCTTGAT CGCGGCCAAC
GCCGTCTATC AGATTATGCC CTTCCTCAAC CGCGCCGCCA TCGCCGGCAC GGCCGGCTTT
GCCGAGGCCG GCTATTTCGC GCTCGCCGCC GATCTCGGCT CGCGGGCCTT CTCGACGCTC
GGGGCCGCGC TCGACCTGCT GCTGTTCCAG ATCGCCGTGC AGGCCGAGGA GCATCATGGC
CGCGAGGCCG CCGAGACCCA GGTCGCGCGC AACATCGCCA TCGTGGTGGC GCTGCTCCTG
CCCTGCGCCG CCGGCTACTG GGCCGTGACG CCGGCCCTCC AGGCGCTGAT CGTGCCGGCG
GAGTTTCGCG GGCCGTTCGC GGACTACACC GACCTGCTGA TCCCGGGCCT GTTCTGCCTC
TCGATCATGA ACTTCGCCCT CAATCCCATC TTCCAGATCC GTCGCCGGAC GAGCCCGGTG
GTCGCCGCCG CGCTCATCGG GCTGGCCGTC AACGCCGTCG GCCTCGTCTT GCTGCCGCGA
ATGATGGGAC CGCAGGGCGT TGCTGTTGCG CAGACCCTCG GCCTCGTCGC GGCGGTCGCC
GTGCTGGGCC TGCGGGCGCT GACGGGGATC GAGCGCCTGC GCCTGCCGGG CCGCGACCTC
GCCCTCACCG CCGCCGCCTG CCTTGCCATG GTTCTGGCCG TGCTGCCGTT CCGCGGCTTG
GAGCCGGCGC TCGCCCTGCC CGCCTGCATC GCGGCCGGAA TGCTCGTCTA CGGCGCCCTC
GTCTGGTTCC TCGACATCGC CGGCCTGCGC AGCGCCGTGC GCCAGCGTTT CCCGAAGCGG
CTGCCGGCCG CCGCGCGGTA G
 
Protein sequence
MAVIAAFVLN AGLNFILGIA IARMLGPADF GRFALATAGA VVLNTILFEW LRLSATRFYS 
ARVREAEPWI RQGLDRAYWV IALALFATAA LCAGLGIAVN PTPEGRLVMT AGTMVAAIGI
GLFDYHAALA RARFIGSAYL RLVVWKNVLA FVLMAGTAWL FPQPVWVLIA GGLSQFLAVL
PMRKILGDGL LGHVPALPHG RARETLRLFA AYGLPLIAAN AVYQIMPFLN RAAIAGTAGF
AEAGYFALAA DLGSRAFSTL GAALDLLLFQ IAVQAEEHHG REAAETQVAR NIAIVVALLL
PCAAGYWAVT PALQALIVPA EFRGPFADYT DLLIPGLFCL SIMNFALNPI FQIRRRTSPV
VAAALIGLAV NAVGLVLLPR MMGPQGVAVA QTLGLVAAVA VLGLRALTGI ERLRLPGRDL
ALTAAACLAM VLAVLPFRGL EPALALPACI AAGMLVYGAL VWFLDIAGLR SAVRQRFPKR
LPAAAR