Gene Msil_1418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1418 
Symbol 
ID7091758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1533654 
End bp1535222 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content63% 
IMG OID643464756 
Producttranscriptional regulator domain protein 
Protein accessionYP_002361745 
Protein GI217977598 
COG category[S] Function unknown 
COG ID[COG5616] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTATC TTTTCGAAGA TTTTGAACTG GACACGGCCA GGGTCGAACT TCGGACGAAT 
GGCGTCGCGA TCGCGATCGA GCCACAGGTC TTTGCGCTGC TGTGTTTCCT CGTCGAAAGC
CGTGACCGGG TCGCGACGAA GGAAGAGATC GTCGCGCGCG TTTGGAACGG GCGGGTCATC
TCCGATTCCG CGATCGCCAG CCGCATCAAA TCGGCGCGCC GCGCGCTCGG TGACGATGGA
GGCGCGCAGC GCCTCATCCG CACAATCCAT GGAATTGGCT TCCGCTTCGT CGCGGATGTC
CGGCTGGCGG CGACGCGCAT CGAGTCCATC GCGCCCACGG CCGAGACGGC GCCGGATACG
GATCGGTCGC AAGCCGTCGA GACGTCGCGG CCGAGCATTG CGGTCCTTCC TTTTCGCCTT
CTCGGCGCGG CGGATCCGCA ATTTTCCATC GGCGACGCGC TTCCTCACGA TCTCATCACC
GAACTGTCGC GGCTACGCTG GCTCTTTGTC ATCGCGAGAG GTTCGTCCTT CCGCTTTCGC
GGCGCGGAGC CGGACGTCGG CCGCGTCCGG ACGGCGCTGA ACGTCCGCTA TTGCCTGTCC
GGCGTCGTAG AGATCCACCA TAGCGCAATG ATCATCTCGG TCGAACTTTC CGACGCCGAG
GACAGCGGCG TCGTTTGGAG CGAGAGATTT CGAACGCAAG CCAGCGCGGT GCATGAGATC
CGCGAAGAGA TCGTCCGCGC CGTGATCAAT GCGCTCGAAT TGCAGATCCC GCTCAACGAG
GCGCGTCGGG CGCGGCTGAA ATCGCCGGAG CGTCTCGACG CATGGTCAGC CTATCATCTC
GGGCTGCATC ATATGTATCG CTTCAACAAA GCCGATAATT CTGTCGCAAC CGCGCTGTTC
GAGCGCGCTG CCGCGATGGA GCCGGGATTT GCGCGCGCCT ATGCCGGCCT TTCGTTCACC
CATTTTCAAA GCGCCTTTCT GAGCTACGCC GACAATGTTT CCGAGGCGGC GAATCTGGCG
CAGCGCGCCG CCGAACAAAG CCTCGAACGC GATCCTGTCG ATCCTTTCGG CAATTTCACC
ATGGGCCGCG CCTTCTGGCT TTGCGGCGAT CTCGACGCCA GTCTTCCCTG GCTGGAGCGC
GCCAATGCGC TCAACCCGAA TTACGCCCAG GCCAAATATT CGCGAGCTTG GGCGCAGGCG
CTGCTCGGCA ACGCTGCGTC GAGTCGCGCG AATGTGGACG AAGCGCTGGC CCTGAGCCCG
CTTGACCCGC TTCTCTATGG CATGTTCGGC GTTCGCGCTT TTTCCCATCT TGTGATGGGA
GAGTCCGCCG AAGCTGCGGA ATGGGCCGAG CGCGCGGCGC GCTCTCCCGG GGCGCACGCC
TTGATCGAGA TGATCGCCGT CCTCGCGCAT GGCCTCAACG GAAACGATGC GCGCGCGAAA
GCGTGGGCCC GTTCCGCGCG CGCTCGGGTT TGCGATCTCA ACAAGGCCGC CTTCCTGCGC
GCTTTTCCGT TTCGCGACCA ACTCATGCTC AAGCGCGTTT CCGACGAGCT TGAGAGGTTC
GGGTTTTAG
 
Protein sequence
MIYLFEDFEL DTARVELRTN GVAIAIEPQV FALLCFLVES RDRVATKEEI VARVWNGRVI 
SDSAIASRIK SARRALGDDG GAQRLIRTIH GIGFRFVADV RLAATRIESI APTAETAPDT
DRSQAVETSR PSIAVLPFRL LGAADPQFSI GDALPHDLIT ELSRLRWLFV IARGSSFRFR
GAEPDVGRVR TALNVRYCLS GVVEIHHSAM IISVELSDAE DSGVVWSERF RTQASAVHEI
REEIVRAVIN ALELQIPLNE ARRARLKSPE RLDAWSAYHL GLHHMYRFNK ADNSVATALF
ERAAAMEPGF ARAYAGLSFT HFQSAFLSYA DNVSEAANLA QRAAEQSLER DPVDPFGNFT
MGRAFWLCGD LDASLPWLER ANALNPNYAQ AKYSRAWAQA LLGNAASSRA NVDEALALSP
LDPLLYGMFG VRAFSHLVMG ESAEAAEWAE RAARSPGAHA LIEMIAVLAH GLNGNDARAK
AWARSARARV CDLNKAAFLR AFPFRDQLML KRVSDELERF GF