Gene Mchl_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4103 
Symbol 
ID7114410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4324920 
End bp4326575 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content67% 
IMG OID643526820 
Productsulphate transporter 
Protein accessionYP_002422828 
Protein GI218532012 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGCG TTAAATCGAC GACACGGGCC CCCTGGAAAG TCGAGCCGGT CGCGTTTCGC 
TTTGACCTCG TGGCAGGGCT CACCGTCGCG GCCGTCGTCC TGCCCAAGGC CATGGCTTAC
GCCACGGTGG CCGGGCTGCC GGTGGCGGTT GGCCTCTACA CCGCCTTCGT TCCCTCGATC
ATCTACGGCC TGCTGGGCTC GTCGCGTGTG CTGAGCGTCA GTTCGACGAC GACGCTAGCC
ATCCTGACCG GGGCCGAGCT CGGTAGCACG GTGCCGGACG GCGACGCGGC GAAGCTTCTC
ATCGCGACGG CAACGCTGAC GGCGCTTGTC GGCGCGCTGC TATTGACGGC ACGCTTGCTG
AAGCTCGGCT TCGTCGCCAG CTTCATCTCC GTGCCGGTCC TGACGGGGTT TAAGGCTGGG
ATCGGCCTGG TGATCCTGCT GGATCAGGCT CCCAAGCTTC TCGGACTCCA CATCGCGAAG
CAATCGTTCT TCGCCGACTT GGCGAACCTC GTCCGACACC TCCCCGAAAC CTCCCTGCCG
ACGCTGACGG TTGCAGGGGC CACGCTGGCC GTGCTCGTCG GCATGGAGCG TCTCAGGCCC
CATTCGCCGG CCCCGCTGGT TACGGTCGCG GGTGCCATCG CGGCCTCGTG GCTGTTGAGC
CTCGGCGCGT GGGGCGTCTC CACCGTCGGG ACGATCCCCC AGGGTCTTCC ATCCCTGACC
CTGCCCGATC CGACGCTCGT CCGGGCGCTG CTGCCGGGTG CGATAGGCAT CGCCCTGATG
AGCTTCACGG AGAGCATCGC GGCAGGGCGG GCCTTCGCGG CTCCAGGAGA TCCGCCCATC
GACGCCAATC GCGAACTGGT CGCCACGGGC GCGGCAAACC TTGGGGGAGC CATACTCGGA
TCCATGCCGG CCGGCGGCGG GACATCGCAG ACCGGCGTCG TGCGATCCGC CGGCGGCCGG
ACGCAGGTAG CCTCTCTCGT GACGGCCGCG GCTGCCCTTG CCACGATGCT GCTCCTGGCT
CCTGTGCTGG GCCTCCTGCC GCAGGCGACC CTCGCGGCGT TGGTCATTGT CTACTCCGTG
GGCCTCATCC AGCCGGCGGA GTTCCGGGCC ATCTACAAGG TGCGGCGGAT GGAATTCCGG
TGGGCGGTCG TGGCGGCTGT CGGTGTTCTC GTGTTCGGGA CACTCCAGGG CATCGTCGTC
GCCATCGTCC TCTCGCTCCT CGGCCTTGCC CTCCAGACCG CGCACCCACG GGTCTCCGTT
ATCGCCCGAA AACGCGGAGC CGATGTCCTG CGTCCCCTCA CGTCCGAGCA CCCGGACGAC
GAGACGTTCG AAGGCCTCCT GATCCTCCGG CCCGAGGGGC GGCTGTACTT CGCCAACGCG
CAAAACGTAG CAGACCGGAT CCGGGCCCTC ATCGCCGAAC ACAAGCCGCG CGTCGTCGCA
CTCGATTTCA GCCGCGTACC CGATATCGAG TATTCGGCGC TGCAGATGCT CCAGGAAGCG
GCACGGCGGA CCAATGTCAC GTTCTGGCTG GTAGGACTCA ACCCGACCGT GCTCGACATG
GTGCGGCGTG CCGGCCTGGA TCGCGAACTT GGAGTGGAAC GACTATTGTT CAACACTCGA
ATGGCAATCG AACGCTACAA GGCTTTTCCG ACCTAA
 
Protein sequence
MKGVKSTTRA PWKVEPVAFR FDLVAGLTVA AVVLPKAMAY ATVAGLPVAV GLYTAFVPSI 
IYGLLGSSRV LSVSSTTTLA ILTGAELGST VPDGDAAKLL IATATLTALV GALLLTARLL
KLGFVASFIS VPVLTGFKAG IGLVILLDQA PKLLGLHIAK QSFFADLANL VRHLPETSLP
TLTVAGATLA VLVGMERLRP HSPAPLVTVA GAIAASWLLS LGAWGVSTVG TIPQGLPSLT
LPDPTLVRAL LPGAIGIALM SFTESIAAGR AFAAPGDPPI DANRELVATG AANLGGAILG
SMPAGGGTSQ TGVVRSAGGR TQVASLVTAA AALATMLLLA PVLGLLPQAT LAALVIVYSV
GLIQPAEFRA IYKVRRMEFR WAVVAAVGVL VFGTLQGIVV AIVLSLLGLA LQTAHPRVSV
IARKRGADVL RPLTSEHPDD ETFEGLLILR PEGRLYFANA QNVADRIRAL IAEHKPRVVA
LDFSRVPDIE YSALQMLQEA ARRTNVTFWL VGLNPTVLDM VRRAGLDREL GVERLLFNTR
MAIERYKAFP T