Gene Mext_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0049 
Symbol 
ID5834000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp55414 
End bp57084 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content69% 
IMG OID641365833 
Productsulphate transporter 
Protein accessionYP_001637548 
Protein GI163849505 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCC TCGGATCGAC GACGCGGACC TCCACGACCG ACGCGCCGAC GAAACTCCGC 
TCCGACATTC TGGCAGGGCT CACCGCCGCG GCCGTCGTCC TCCCCAAGGC GATGGCTTTC
GCCACGGTGG CCGGGCTGCC GGTCGCGGTC GGCCTCTACA CCGCCTTCGT TCCCACGCTC
ATCTATGGCC TGTTGGGCTC ATCCCACGTC CTGAGCGTCA GTTCGACGAC GACGCTGGCG
ATCCTGACAG CCGCCGAAAT CGGCAGCGTG GTGCCGGATG GCGATCCGGC GCGGCTGGTT
GCCACCACAG CGACGCTGAC GGCCCTTGTC GGCGCGCTGC TGCTCGGAGC CCGGCTGGTG
AAGCTCGGCT TCATCGCCAG CTTCATCTCC GTGCCGGTCC TGACCGGGTT CAAGGCCGGA
ATCGCCTGCG TGATCCTGCT GGACCAAGCT CCCAAGCTGC TCGGGCTCCA TTCCGCGAAG
CAGTCGTTCT TCATCGATCT CGCGAGTCTC GTTCGCCACC TCCCCGAGAC GTCGCTGCCG
ACCCTGGCTG TCGCAGGGGT GACGCTGGCC GTCCTCGTCG GCGCTGAGCG CCTCAGGCCC
CATTCGCCGG TTCCGCTGGT CACGGTCGCT GCCGCCGTCG CGGCCTCTTG GCTGCTCGGC
CTCAACGCAT GGGGAGTCGC GACGGTCGGG GAGATCCCGC CGGGGCTCCC CTCCGTGAGC
ATGCCCGACC TGACGCTTGT CCAGGCGCTC CTGCCGGGCG CCATGGGCAT CGCCCTGATG
AGCTTCACGG AGAGCATCGC CGCGGGCCGG GCCTTCGTGG CTTCGGGAGA TCCGCCCATC
GATGCCAATC GTGAACTGGT CGCCACGGGT GCAGCGAATT TGGGGGGAGC CTTGCTCGGG
GCGATGCCGG CCGGCGGCGG GGCATCGCAG ACCGGGGTCG TGCGGGCCGC CGGAGGCCGG
ACGCAGGCGG CCTCATTCGT GACGGCCGCG CTTGCCCTCG CGACGATGCT GCTCCTGTCG
CCGGTCCTGG GCCTCCTGCC GCAGGCAACC CTCGCGGCGG TCGTGATCGT CTACTCGGCC
AGCCTCATCC AGCCGGCGGA GTTCCGGGAC ATCTTCAAAG TGCGGCGGAT GGAGTTCCAC
TGGGCAATCG TGGCGGGGAT CGGGGTGCTC GTCTTCGGGA CGCTTCAGGG TATCACCGTC
GCCATCGTCC TCTCGCTGGT CGGCCTGGGC CTCCAGACGG CCCATCCCCG CATCTCTGTC
ATCGCCCGCA AGCACGGCGC GGACGTGCTG CGCCCCCTGT CGCCGGAGCA CCCCGATGAC
GAGACGTTCG TCGGCCTCCT GATCCTGCGC CCGGAGGGGC GCCTGTACTT TGCCAACGCG
CAGAACGTGG CAGACCGGAT TCGGGCGCTC ATTGCCGAGC ACAAGCCGCG TATCGTCGCC
CTCGACCTCA GCCGTGTGCC CGACATCGAG TATTCGGCGC TGCAGATGCT GCGGGATGGT
GCCCGGCGGA CCAGCATGAC GTTTTGGCTC GTAGGCCTCA ACCCTGACGT CCTGAACATG
GTGCGGCGCG CCGGTTTGGA TCAGGAACTC GGACCGGACC GCCTGTTGTT CAACGCGCGA
ACCGCCATCG AGCGCTACAA GGCGCTTCTG GCGCCCTCGG CGCCTGACTA G
 
Protein sequence
MTGLGSTTRT STTDAPTKLR SDILAGLTAA AVVLPKAMAF ATVAGLPVAV GLYTAFVPTL 
IYGLLGSSHV LSVSSTTTLA ILTAAEIGSV VPDGDPARLV ATTATLTALV GALLLGARLV
KLGFIASFIS VPVLTGFKAG IACVILLDQA PKLLGLHSAK QSFFIDLASL VRHLPETSLP
TLAVAGVTLA VLVGAERLRP HSPVPLVTVA AAVAASWLLG LNAWGVATVG EIPPGLPSVS
MPDLTLVQAL LPGAMGIALM SFTESIAAGR AFVASGDPPI DANRELVATG AANLGGALLG
AMPAGGGASQ TGVVRAAGGR TQAASFVTAA LALATMLLLS PVLGLLPQAT LAAVVIVYSA
SLIQPAEFRD IFKVRRMEFH WAIVAGIGVL VFGTLQGITV AIVLSLVGLG LQTAHPRISV
IARKHGADVL RPLSPEHPDD ETFVGLLILR PEGRLYFANA QNVADRIRAL IAEHKPRIVA
LDLSRVPDIE YSALQMLRDG ARRTSMTFWL VGLNPDVLNM VRRAGLDQEL GPDRLLFNAR
TAIERYKALL APSAPD