Gene Msil_2574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2574 
Symbol 
ID7093228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2808076 
End bp2809152 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content66% 
IMG OID643465889 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_002362859 
Protein GI217978712 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.839971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGAT CTTCAATGCT TTCGAAAAAT TTGGGACAAA GGTGGCCCGG CGTTCATCTC 
GCGCTTCTAT CGGCGCTCCT GTTTGGCGCG CTGACGCCTC TGTCCAAATT GCTGCTGGGC
TCGCTCGACC CGCAACTGCT CGCCGGCGTC CTCTATCTCG GCGCCGGCGT GGGCCTCGCC
ATCACCCAAA TCGCGCGCGC GTCATTCGGA GCCTCGACGC GTGAGGCCCC TTTGCGCGGC
GCCGATCTGC CGTGGCTTAT TGCCATTGTG CTCTTCGGCG GGTTTCTTGC GCCGCTGGCG
CTGATGCTGG GTCTGGCGCA GACCGACGCG GCCTCGGGCG CCTTGTTGCT CAATCTCGAA
TCCGTCGCGA CGCTGGCGAT CGCCTGGACG CTGTTCCGGG AGAATGTCGA CAGGCGCCTG
CTGCTTGGCG CCTTCGCGAT TCTCGCCGGG GCGGTGCTTT TGTCCTGGAA TGGCGGGAGC
GTCCGGCTCG ATCGCGGGGC CCTGCTGATC GCCGGGGCCT GCCTCGCCTG GGGCCTCGAC
AATAATCTCA CGCGAAAATT GTCGTCGGCG GATCCGGTCC AGATCGCCAT GATCAAGGGC
TTCGCCGCGG GAGGCGCCAA TATCGGCCTC GCGCTGTGGC GCGGCGCGGA GCCGGGCTCG
GCCGGACTCA TGGGCGCCGC CGCTTTGGTC GGCTTTCTCG CCATTGGGGT CAGCCTGGTC
GCTTTCATCC TGGCGCTGCG GCATCTTGGG GCGGCGCGGA CAGGGGCCTA TTTCGCTTTG
GCGCCCTTTA TCGGCGCCCT GCTCGCCGTC CTGCTGCTGC ACGAGCCTCT GACGGCAAAA
CTGGTCGCCG CGGGACTTCT GATGGGCGCT GGCCTGTGGC TGCATCTGGC CGAGCGGCAC
GCGCATGAGC ACACCCATGA GCCGCTGGAG CACGAGCACG CCCATGTCCA TGACGCGCAT
CATAGCCATG GCCATGACGA GCCGATGAGC GAGCCGCACT CGCACTGGCA TAGCCACGCG
CCGCTGACCC ATGCCCACCC GCATTATCCG GATTTGCATC ACCGCCACCG GCATTGA
 
Protein sequence
MARSSMLSKN LGQRWPGVHL ALLSALLFGA LTPLSKLLLG SLDPQLLAGV LYLGAGVGLA 
ITQIARASFG ASTREAPLRG ADLPWLIAIV LFGGFLAPLA LMLGLAQTDA ASGALLLNLE
SVATLAIAWT LFRENVDRRL LLGAFAILAG AVLLSWNGGS VRLDRGALLI AGACLAWGLD
NNLTRKLSSA DPVQIAMIKG FAAGGANIGL ALWRGAEPGS AGLMGAAALV GFLAIGVSLV
AFILALRHLG AARTGAYFAL APFIGALLAV LLLHEPLTAK LVAAGLLMGA GLWLHLAERH
AHEHTHEPLE HEHAHVHDAH HSHGHDEPMS EPHSHWHSHA PLTHAHPHYP DLHHRHRH