Gene Msil_3716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3716 
Symbol 
ID7093070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4076159 
End bp4077190 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content65% 
IMG OID643467002 
ProductBile acid:sodium symporter 
Protein accessionYP_002363961 
Protein GI217979814 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0440669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACC GGCAGCTCTT GTCCCGCTTG GGGATCGATC CCTATCTTTT CGCGCTCATC 
GCCACGGTGA CGCTCGCGCT GATTTTTCCC GCGCGCGGCG CCGCGGCGGA AGTGGCGGGC
TACGCGGCCT ATGGCGCGGT CTCGCTCTTG TTCTTCCTTT ATGGCGCGCG CCTCGCGCCG
CGTGCCGTCA TCGAGGGATT TTCCCATTGG CGGCTGCAAT CGACAGTGCT GTTCCTGACA
TTTGTTCTGT TTCCGGCGAT CGGTATCGCC CTCACGGCGG CTCTGCGCCC CTTCCTGTCG
CCGCCGCTTG CGGTCGGCCT GCTTTACCTT TGCCTGATGC CCTCGACGAT CCAGTCCTCG
ATCGCCTTCA CCTCAATTGC CCGCGGCAAT GTCGCGGCGG CGCTTTGCAG CGCTTCGGCC
TCCAACGTGC TCGGCGTTTT CATCAGCCCG ATGCTTGTCG CTTTGCTTTT GTCGACGCAG
AGCCACGGCT TCAACGTCGC GGCCGTGGAG GATGTGGCCT TGCAGCTTCT TCTGCCTTTC
GCCCTCGGGC AGCTCGCCCG GCCGCTGATC GGCCGCTGGC TCCTGGCGCA TAAGGTGATG
ACGTCGATCG TCGATCGCGG CTCGATCCTG CTGATCGTCT ATGTCGCCTT CGCCGAGGGG
ACCGCCGCCG GCGTCTGGGC GCAGCTCAGC TGGCAGGGAC TGGCGCTGAT TCTTGCGCTC
GACTGCCTCA TTCTGGCGCT TGTCCTCGTC GCGTCGACGC TCCTCAGCCG CCGGCTTGGT
TTTTCGAAAG AGGATGAGAT CGCCATCGTC TTCTGCGGCT CGAAAAAAAG CATGGCGGGC
GGCGTGCCGA TGGCGAGCAT CCTGTTTCCG GGGCAGCCGC TCGGCCTCAT CGTGCTGCCG
CTGATGCTAT TTCATCAGGT GCAGCTGTTC GCCTGCGCCA TTCTGGCCCA GCGCTATGCC
CGCCGTCCCG CGGCGCCCGT GCGCGCCGAC GCGCAGATTA CGCCGCCAGA GCAGCGTCTG
GCCGCCGAAT AG
 
Protein sequence
MFDRQLLSRL GIDPYLFALI ATVTLALIFP ARGAAAEVAG YAAYGAVSLL FFLYGARLAP 
RAVIEGFSHW RLQSTVLFLT FVLFPAIGIA LTAALRPFLS PPLAVGLLYL CLMPSTIQSS
IAFTSIARGN VAAALCSASA SNVLGVFISP MLVALLLSTQ SHGFNVAAVE DVALQLLLPF
ALGQLARPLI GRWLLAHKVM TSIVDRGSIL LIVYVAFAEG TAAGVWAQLS WQGLALILAL
DCLILALVLV ASTLLSRRLG FSKEDEIAIV FCGSKKSMAG GVPMASILFP GQPLGLIVLP
LMLFHQVQLF ACAILAQRYA RRPAAPVRAD AQITPPEQRL AAE