Gene Mfla_1992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1992 
Symbol 
ID4000801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp2128481 
End bp2130160 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content58% 
IMG OID637938909 
Productsulphate transporter 
Protein accessionYP_546100 
Protein GI91776344 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.998362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGCAG ATTTCTTTAC ACGCACTGAT ATCATGATCG CCATTCTCGA AGCCAAGCAC 
GCCGGCCTGC TCTCTCGTCA ATATTGGTCC CGCAACATCA TGTCCGGATT GATCGTCGGC
GTTGTGGCCC TTCCGCTGGC CATGGCCTTT GCGATCGCCT CCGGCGCCAA GCCTGAACAA
GGGTTATATA CGGCGATAGT GGCAGGCACA CTCGCCTCCC TGTTCGGCGG CAGCCGGCTG
CAGATTACCG GGCCGACGGG GGCATTCATC GTCATTCTCT CCGGCATTAC AGCGAAATAT
GGCATTGCCG GCCTGCAGAT CGCCACCCTG ATGGCAGGCG CAATGCTGTT GCTCATGGGG
CTGGCACGCA TGGGAAGCGT GATTCGGTAT ATCCCGGAAC CTGTGATCAT CGGCTTTACC
AGCGGAATCG CCGTCATTAT TTTCGTAGGG CAGTGGCAGG ATTTTTTTGG CCTTGCTCCA
CAAGCATCGC ATCATTTTCA TGAAAAGCTG TTGCACCTGG TCGAAGCGCT GCCGCAGCTG
CAATGGCAGA CTTCATTACT TGGCGCGTTC ACGCTGGCAG TGCTGGTCCT GTCCAACCGG
TACTTGAAAA AGATTCCTGG TCCCCTGGTC GCCATCGTGG CAGCCACTGT CGTGCAGTCC
GTCTTCCAAT TTCAGCAAGT TGCTACCATT GGCAGCGCAT TCGGCGGCAT TCCGCAGGCA
TTACCGCATT TTTCCCTCCC CGAAGGCATC GGCCCCGCTG TCATGATCGA GCTGATCGGC
CCAGCGTTCA CTATTGCATT GTTGGGCGCC ATCGAATCAC TGCTGTCAGC CGTCGTGGCG
GACAATATGG CAGGAACACG CCACCAGCCG AACCAGGAGC TGATTGGACA AGGCATTGCC
AATATCGCCT CGCCATTGTT TGGCGGCTTT GCCGCCACCG GCGCGCTGGC GCGTACCGCC
ACCAACATCC GCAACGGAGC CAACAGCCCG TTGTCCGGCA TCGTACATGC CGTCACTCTG
GTGTTGATCG TGGTCTTGCT GGCCCCATTG GCCGTGAATG CTCCCCTGGC GGCGCTGGCG
GCCATCTTGT TCGTGGTGGC CTATAACATG AGCGAAGTCC ATCGCTTCCT GCATATTGCC
AGGACTGCGC CGCGGGCGGA TGTCGCCGTG CTGCTGATCA CTTTCCTGCT TACCGTATTC
AGCGACCTGG TCATTGCCGT CAATATCGGC GTCGTACTTG CCGCACTGCT GTTCATGAAA
CGCATGGCGG ACACGGTGAA TGTGACGCAG CTGAGTGATG ACGACTTGCA GCAGGAATAT
GGCCCGCATG CATGGCACCT GCCGCCGGGC ACACTGGTAT ACCGCCTGGA AGGACCTTTT
TTCTTTGGAG CGGCCGAGCA CCTGCAGCGC CGCCTGCAAA CCATAGGCGA GAACACCGAC
ACCATCGTGC TGCGTATGGC GCGCGTGCCG ATCATGGATG CTACCGGTCT GCAGGCGTTG
TGGAACCTGC TGGATACATG CAAGCAACAC AATATTCGAC TAGTGATTGT GGAAGCCAGG
CCCAATATTC TGGAGAAGCT GCGCCGCTCG GGGATTATCG GCCAGATAGG CCCGCATCAT
GTGCTGCCCC ACCTGCACTT ACTATGGCAG GAGGCACCTT CCTCGCCCTT GCCTGGTTGA
 
Protein sequence
MSADFFTRTD IMIAILEAKH AGLLSRQYWS RNIMSGLIVG VVALPLAMAF AIASGAKPEQ 
GLYTAIVAGT LASLFGGSRL QITGPTGAFI VILSGITAKY GIAGLQIATL MAGAMLLLMG
LARMGSVIRY IPEPVIIGFT SGIAVIIFVG QWQDFFGLAP QASHHFHEKL LHLVEALPQL
QWQTSLLGAF TLAVLVLSNR YLKKIPGPLV AIVAATVVQS VFQFQQVATI GSAFGGIPQA
LPHFSLPEGI GPAVMIELIG PAFTIALLGA IESLLSAVVA DNMAGTRHQP NQELIGQGIA
NIASPLFGGF AATGALARTA TNIRNGANSP LSGIVHAVTL VLIVVLLAPL AVNAPLAALA
AILFVVAYNM SEVHRFLHIA RTAPRADVAV LLITFLLTVF SDLVIAVNIG VVLAALLFMK
RMADTVNVTQ LSDDDLQQEY GPHAWHLPPG TLVYRLEGPF FFGAAEHLQR RLQTIGENTD
TIVLRMARVP IMDATGLQAL WNLLDTCKQH NIRLVIVEAR PNILEKLRRS GIIGQIGPHH
VLPHLHLLWQ EAPSSPLPG