Gene Mext_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3643 
Symbol 
ID5834070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4022090 
End bp4023595 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content69% 
IMG OID641369436 
Productsulphate transporter 
Protein accessionYP_001641092 
Protein GI163853049 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR01593] toxin secretion/phage lysis holin 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGA CGGTTTCAAC CGCCCGCGGG CGGTTCGGCT CGACTTGGTT CGCCAACTGG 
TTCGCCAATC CTTGGGCCGA CATCCTCTCC GGCATCGTCG TCGCGCTGGC CCTGATCCCG
GAGGCGATCG GCTTCTCGGT GATCGCCGGG GTCGATCCGA AGGTCGGGCT CTACGCCTCG
GTCGTGATCG CCTGCGTCAT CGCCTTTGCC GGCGGGCGCC CGGCCATGAT CTCGGCCGCC
ACCGCCGCCA CGGCGGTGGT GATGGTCGAT CTCGTGCGCG ACCACGGCGT CCAGTATCTC
TTCGCCGCCA CCATCCTGAT GGGCGTGTTC CAGATCCTGG CGGGGCTTGC CCGGCTCGGG
CGGCTGATGC GCTTCGTCTC GCGCTCGGTG ATGACCGGCT TCGTCAACGC GCTGGCGATT
CTGATCTTCC TGGCGCAGCT GCCCGAACTC ACCGGCGTGA CCCCGGCCAC CTACGGGCTG
ATCGCGCTGG GGCTCGCCAT CATCTACGGC TTCCCCAAGC TCACCCGCGC GGTGCCCTCG
CCGCTCGTCG CCATCGCCGT GCTGACGGCG CTGACCGCGT ATCTCGGGCT CGACGTGCGC
ACCGTCGCCC ATCTCGGCGC ACTGCCGACC ACCCTGCCGA GCTTTGCCCT GCCGGACGTG
CCGCTGACCT TCGAGACCCT GCGCATCATC CTGCCCTACT CCGCGACGCT CGCCGCGGTC
GGCCTGTTGG AGAGCCTGCT CACGGCGCAG ATCGTCGACG ACATGACCGA TACCGGCAGT
TCCAAGAACC GCGAATGCAT GGGCCAGGGG CTCGCCAACA TGAGTTCGGC CGTGTTCGGC
GGCATGGGCG GCTGCGCGAT GATCGGGCAA TCGGTCATCA ACGTGTCGTC CGGCGCCCGC
GGCCGGCTCT CGACCCTGGT GGCCGGCAGC TTCCTCCTCG CGCTGCTGGT GCTGTTGCAG
GATCTCCTAG CCATCGTCCC CGTCGCGGCG CTGACGGCAG TGATGATCAT GGTCTCGCTC
AACACCTTCT CCTGGCGCTC GCTCCTGGCG CTGCGCACCA ACCCGCTGCC CTCCTCCCTG
GTGATGCTCG CCACCGTCGC GGTGGTGGTC GCCACCCGCG ATCTGGCCAT CGGCGTGCTC
GTCGGCGTCC TGCTCTCGGG CGTGTTCTTT GCCGGGAAGG TCGCGCGGAT GAGCCGGATC
ACCGCCGAGC TGTCGCTGGA CGGACGCACC CGCACCTACC GGGTGGCGGG CCAGGTGTTC
TTCGCCTCCG CCGGCAGCTT CGCCGAGGCG ATCGATGTCC GCGAGCCGGT GGAGCGGCTC
GTCATCGACG TGCACGCGGC GCATTTCTGG GACATCTCGG CGGTGGGCGC CCTCGACCGC
GTGGTGATGA AGGCCCGCGC CGCCGGTCGA ACCGTCGAGG TCGTGGGCCT CAACGAGGCC
AGCGCCACCC TGGTCGAGCG CTTCGGCCAG CACGACAAGG CGGACGCCTC GCTACCGGCG
CATTGA
 
Protein sequence
MSSTVSTARG RFGSTWFANW FANPWADILS GIVVALALIP EAIGFSVIAG VDPKVGLYAS 
VVIACVIAFA GGRPAMISAA TAATAVVMVD LVRDHGVQYL FAATILMGVF QILAGLARLG
RLMRFVSRSV MTGFVNALAI LIFLAQLPEL TGVTPATYGL IALGLAIIYG FPKLTRAVPS
PLVAIAVLTA LTAYLGLDVR TVAHLGALPT TLPSFALPDV PLTFETLRII LPYSATLAAV
GLLESLLTAQ IVDDMTDTGS SKNRECMGQG LANMSSAVFG GMGGCAMIGQ SVINVSSGAR
GRLSTLVAGS FLLALLVLLQ DLLAIVPVAA LTAVMIMVSL NTFSWRSLLA LRTNPLPSSL
VMLATVAVVV ATRDLAIGVL VGVLLSGVFF AGKVARMSRI TAELSLDGRT RTYRVAGQVF
FASAGSFAEA IDVREPVERL VIDVHAAHFW DISAVGALDR VVMKARAAGR TVEVVGLNEA
SATLVERFGQ HDKADASLPA H