Gene Mext_4085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4085 
Symbol 
ID5832966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4544568 
End bp4545914 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content67% 
IMG OID641369876 
Productcapsule polysaccharide export protein-like protein 
Protein accessionYP_001641526 
Protein GI163853483 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.160773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.368018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCG ACGACGAGGT CAAGCAGGCG ACGCGGGTCC GGCCGAGCCC GAGCGTCCAA 
TCGCTCATGG ATCTTGCCCG GCGCTCCGTG CCGGATCTGC GGCGCAACGC CGAGACGATC
GAGCCGGTCT CCCCGGGCAA GGGCCGCTTC CCCGGTCGGG CGCTGGTCGA GCGGGCACGC
CGCGGCCTCG AATGGACCCG GCTGCCCGGT CTGCGGCCGC GGCAGGCGCG TCCGCCCGAG
CGGATGGCCA AGCGACTGTT CCGCAGCTAC TGCCTGTTCG CGCTGGCGCC GACCGCGGTG
GTCGGCCTCT ACGTCTTCGC GATCGCCAGT CCGCAATACA TTGTCGTGTC GCAGTTCGCC
GTGCGCGGCA ACGTCGAGCC GATGGCCAGC GCCGAACTCG GCCTGCACAG CGACCTGATC
CAGAAGCACA ACAGCCAGGA CAGCTTCATC CTGCGCGACT ACATCGGCAG CCGGCCGATG
GTGGAGGCCG TCGATGCCAA GCTCGGCCTC GAACGGATGT TCTCGCAGGA CGGGATCGAT
TTCTGGGCCG GCTACGGCGC GGGTCAGCCG ATCGAGAAGC TGGTGCGCTA CTGGCGCCGG
CAGGTGATCC CGCACATCGA TGCGATTTCC GGCGTGATCC ACCTGAAGGT GCGCGCCTTC
AAGCCCGAGG ACGCGGTCGC GATCTCGGAA GAGGTGATCG CCCGCTCCGA GACCTTGATC
AACAGCATCT CGCGCCGCGC CCAGAGCGAC ATGATCGCCA ACGCGAAGAA GGAGGTCGAG
CAGTCGTTCG AGCGGCTCAA GCAGGCGCGG GTCGCGATGC AGGAGTTCCG CAACCGCTGG
GGCATCATCG ATCCCGTGAA GTCGGCCGAG GCGGCCGTGA CCACGATCGA GCTCCTGCGC
AAGGACAAGA TCAAGGCGGA GAACGACCTG CGCGTGCTGC GCGACTCCAA GCTCGACGAG
AAGAGCCGTG GCATCCAGGT GCTCGTGGCC ACCCTCGGCG CGCTGGACGG GCAGATCAAG
GATCTGCAGG GCCGCCTCAC CACCGACGGC ATCGTCTCGA ATTCCGAGCA CAACCTCACC
CAGGCCCTGC TCGAATACGA GGGCCTGATG GTCGAGCAGA CGGTGGCGGA GAAGCTCAAC
GCCTCGATGC AGATGATCCT CGACCGCGCC CGGGTCGCGG CGGCCAAGCA ACAGATCTAT
CTGGCGACCT TCGTGCCGCC CCTGCTGCCG ACTTATTCGG AATACCCGGC CCCCTTCTAC
GCCCTGTTCG CGGCCCTGTT CTGCTTCACC GTCCTGTGGA GTTCGGTCTC GCTCGTGACC
GCAGCGGTCA ACGACAACCG GCTCTAG
 
Protein sequence
MSSDDEVKQA TRVRPSPSVQ SLMDLARRSV PDLRRNAETI EPVSPGKGRF PGRALVERAR 
RGLEWTRLPG LRPRQARPPE RMAKRLFRSY CLFALAPTAV VGLYVFAIAS PQYIVVSQFA
VRGNVEPMAS AELGLHSDLI QKHNSQDSFI LRDYIGSRPM VEAVDAKLGL ERMFSQDGID
FWAGYGAGQP IEKLVRYWRR QVIPHIDAIS GVIHLKVRAF KPEDAVAISE EVIARSETLI
NSISRRAQSD MIANAKKEVE QSFERLKQAR VAMQEFRNRW GIIDPVKSAE AAVTTIELLR
KDKIKAENDL RVLRDSKLDE KSRGIQVLVA TLGALDGQIK DLQGRLTTDG IVSNSEHNLT
QALLEYEGLM VEQTVAEKLN ASMQMILDRA RVAAAKQQIY LATFVPPLLP TYSEYPAPFY
ALFAALFCFT VLWSSVSLVT AAVNDNRL