Gene Msil_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3843 
Symbol 
ID7092539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4207266 
End bp4208459 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content67% 
IMG OID643467128 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002364087 
Protein GI217979940 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2807] Cyanate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0201188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATC AGCCGCGCCG GTCTTCCAGC GGGCGCGCCG GCGGCCCCTT CGCCGCCATC 
GCCTGCGCGG TCGCGACAGT AGCCATCGTC GGCGTCGGTC TGTCGCTGAC CATGACCCTG
ATCGCCGTCA GGCTCGGCGA ACAGGGTTTT AGCGCGCGCG CAATCGGCAT CAACACCGCG
GCGGCCGGCT TTGCGACGCT CGCAAGCGCC AGTTTCATCC CGGATCTTGC GCGCCGCTTC
GGGGTCGGGC GGCTTTTGTT TGCCGCGCTG ATCTTATGCG TGCTCTGCCT TGCCGCCATG
GCGATCCGCG ACGACTACTG GCTCTGGCTC GGCCTTCGCG CTTTGTTCGG CGTCGGCCTC
ACAGTGCTGT TCGTCCTCAG CGAATATTGG ATCAACGCCG TCGCTCCGCC CGAGCGACGC
GGCGCGATCC TCGGCTTCTA CGCCGCCAGC GTCGCGCTTG GCTTCGCCGC CGGCCCGCTG
ATTCTCGCCT GCGTCGGCAC GGTCGGCTCC GCGCCCTTCC TGATCGCCAT GGCGCTGTTC
GCCGCGGCGG CCTTGCCGAT CGTTATCGGC AGCAAATCGG CGCCGGCGAT CGAAACGCAT
TCCGCGACGC CAGTGCTTGC CTTTCTTCTC GTCGCCCCCG TCGCTACGCT GGCCGGCCTG
CTGCATGGCG CAATCGAGAC GGCAAGCATG GGATTGTTGC CCGTCTTCGC GCTCCGCTCC
GGCCTTGGCG CGGAAACCGG CGCATGGTTC GTCACGCTGT TCGCCCTTGG CAATGTCGCG
TTTCAGTTTC CCGTCGGATT TCTCGCCGAT ATGATCGAGC GTCGCCGCCT GCTCATGATG
ATCGCTCTCG TCAGCCTGAT CGGGGCGATT GCTCTCTCGG CCCTTGAGCC CTCCGCCTCG
CTGCTGTTTG GCGCCCTCCT GCTGATCTGG GGCGGCGTCG CGGGTAGTTT TTACGCCGTC
GCGCTCGGCT ATCTCGGCGC GCGCTACAAA GGGCCGGAGC TCGCGAGCGC GAACGCTGCT
TTTGTCATGC TCTATTCGGG CGGCATGCTG GGCGGGCCTC CGATCATGGG CGCCGGGATG
GACGCGCTCG GGCCGCATGG ATTTTTTCTG GCGATCGCCG CGCTGCTTGC GATCTATCTT
TTGATCGCGC GTCTCGCTGG CCCGCGCGAA GGCTCCGCGA AGCCCCGTTC TTGA
 
Protein sequence
MLNQPRRSSS GRAGGPFAAI ACAVATVAIV GVGLSLTMTL IAVRLGEQGF SARAIGINTA 
AAGFATLASA SFIPDLARRF GVGRLLFAAL ILCVLCLAAM AIRDDYWLWL GLRALFGVGL
TVLFVLSEYW INAVAPPERR GAILGFYAAS VALGFAAGPL ILACVGTVGS APFLIAMALF
AAAALPIVIG SKSAPAIETH SATPVLAFLL VAPVATLAGL LHGAIETASM GLLPVFALRS
GLGAETGAWF VTLFALGNVA FQFPVGFLAD MIERRRLLMM IALVSLIGAI ALSALEPSAS
LLFGALLLIW GGVAGSFYAV ALGYLGARYK GPELASANAA FVMLYSGGML GGPPIMGAGM
DALGPHGFFL AIAALLAIYL LIARLAGPRE GSAKPRS