Gene Msil_2846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2846 
Symbol 
ID7093009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3127085 
End bp3128899 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content64% 
IMG OID643466157 
Productsulphate transporter 
Protein accessionYP_002363126 
Protein GI217978979 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.145004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG TCACACTCGC CAATCCTGCC GGACGGCGCA ACTGGTTCAG CGTGGCGACG 
CTGAACATCG TTCGCGCCGA TTTCCTCGCC GGCCTGACAG TCGCCGCTCT GTCGCTGCCG
CAGAGCATGG CCTACGCGCT TCTGGCGGGG GTCGACCCTC GCTTTGGGCT CTATACGGCG
ATCGTCTTTA CAGCGGTGGC GGCGATTTTC GGCTCGTCGC GCCATCTCAT CAACGGCCCG
ACGGGCGCCG TCTCGCTGGT CGTATTCAGC GCCCTCGCCA TTTTCGATCC CGAAGCGCGG
CTCGACGCCT ATGAGGCAAT GTTCCTGCTG ACGCTGATGA TGGGCGCCCT GCAGATCCTG
ATCGCGGTGA CGAGGCTCGG CGACCTCACG CGCTATATTT CGGAATCCGT CGTGACCGGC
TTCATCATTG GAGCGGCGAG CCTTACCATC ATCGGCCAGA TCGCCAATGC GCTTGGGGTG
AAGGCGCAGG GCACTGGCCA TCAGCATGTG CTCGAGCGCC TCTATCTGAC CTTGACGCAG
GACGCGCCCA TCAATCTCAA GGCCGTGACG ATCAGCGGCG GCGCGATTGC GCTGGCTCTC
GTCTCCCGCA GAATCGTCAA GCGATTCAAG CTGCCGCAGC TTGATATGCT TTTCGTCTTC
ATAGCGGTCT CGGTCGCCGC CTATCTCGCC GGCTGGTCGA CCGCCGCGCC CGGCGCAAAG
CCGGCGATCG CGCTGATCGA AGCGATCCCG TCCAGCCTGC CCGGCTTCCA TATTCCCGAA
GTCAAAGCCG CCTGGGCGCT AGACCTCAGC GCCAGCGCCG CCGCCATCGC AGTGCTTGGC
CTGCTCGAAG CTCTGGCGAT CGCCAAGGCG ATTGCCCAGA AGTCAGGCCA GACCCTCGAC
TACAACCGTC AGATCCTGGC CGAAGGGCTT GGCAATCTTG TCAGCGGGTT CTTCCGGGGC
ATGCCGGGCG CCGGGTCGCT GTCGCGAACC GCCATCAATT ACCAGGCGGG CGCGATCACC
CGCTTCTCGG GTCTGTTTAC CGCAGGCTTC GTGGCCGTCA CCGTGCTGAC CCTTGCCCCG
CTGGCGTCCT ACGTCCCGAA GGCGCTGCTT GCCGGACTGC TCATCGTCGC GGCGGCCCGC
CTGTTCGACA TCGAGCGGCT GCGCTACGTT CTGCGCGGAT CGCGCTACGA CGCTGTGCTG
TTGATCGCGA CGGCCTTCGC CGCCATCGCC ATCAACATCG AATTCGCCAT TCTCATCGGC
GCCGCCGTCT CGATCGCCTG GTATGTGACA AGAGCCTCAA GGCTCAAGGC CGCCGAGCTG
GTGGTGACGC CAGAGCGCGT CGTGCGCGGA CGCGTCTCCT CCGATCCGCC GAGCCAGGGC
GTGTTGATCT ATGACTTCGA GGGCGAGCTG TTCTTTGGCG CCGCCCCCGA TTTCGAACGC
TATCTCGAAA CCGCCGCCAA AGAGGCCGAC GCGCAGGGCA TCAACTATAT TGTCCTGCGT
CTGAAACGGG TGCGCAATCC CGATGTCGTG GCGCTCGAAG TGCTCGATCA TTTCCTTTCC
TCCACGAAGG CGAAAGGCCT GACCGTGCTG CTGGCGGGCG TGCGCCCGGA CCTCCTCGCC
GCCCTTGGCA AGATCGGCGT CGCCGATCGC CTGTCCCCCG ATTTCATCTT CATCGAAGAG
GAGCAGGATT ACTCCGCCAC GCTGAAGGCG ATCAGGCGAG CCTATGCGCT CGCCGCAATC
GAAGCGAAAT CCAGGGGCGC GGAGCCGGAA TGGGAGAGTT TCAAGGCCAA CAAGCTCGCC
TATTATCTGG TTTAA
 
Protein sequence
MTNVTLANPA GRRNWFSVAT LNIVRADFLA GLTVAALSLP QSMAYALLAG VDPRFGLYTA 
IVFTAVAAIF GSSRHLINGP TGAVSLVVFS ALAIFDPEAR LDAYEAMFLL TLMMGALQIL
IAVTRLGDLT RYISESVVTG FIIGAASLTI IGQIANALGV KAQGTGHQHV LERLYLTLTQ
DAPINLKAVT ISGGAIALAL VSRRIVKRFK LPQLDMLFVF IAVSVAAYLA GWSTAAPGAK
PAIALIEAIP SSLPGFHIPE VKAAWALDLS ASAAAIAVLG LLEALAIAKA IAQKSGQTLD
YNRQILAEGL GNLVSGFFRG MPGAGSLSRT AINYQAGAIT RFSGLFTAGF VAVTVLTLAP
LASYVPKALL AGLLIVAAAR LFDIERLRYV LRGSRYDAVL LIATAFAAIA INIEFAILIG
AAVSIAWYVT RASRLKAAEL VVTPERVVRG RVSSDPPSQG VLIYDFEGEL FFGAAPDFER
YLETAAKEAD AQGINYIVLR LKRVRNPDVV ALEVLDHFLS STKAKGLTVL LAGVRPDLLA
ALGKIGVADR LSPDFIFIEE EQDYSATLKA IRRAYALAAI EAKSRGAEPE WESFKANKLA
YYLV