Gene Msil_3893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3893 
Symbol 
ID7092590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4270528 
End bp4271748 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content64% 
IMG OID643467178 
Productprotein of unknown function DUF214 
Protein accessionYP_002364136 
Protein GI217979989 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGATCC GCTTCCTGCT CGGAAGCCTT GCGCTGCAGA ATCTCGGCCG GCGCAAGGCC 
CGCACCGTGC TGCTGCTGGC CGCCGTCGCG ATCTGCAGCG GCGCCGTCTT CACCGGCGCC
GTCCTGTTGC GCAGCATTGA AAGCAGCATG CTCGTCGGCT TCACGCGCCT CGGGGCCGAC
ATGCTCGTGG TCCCGCAAGG AACGCTCACC AACTTAACGG CGGCGCTGTT GACGGCCGAG
CCCACCGATC TCACGCTCGA AGACAACATG CTCGGCCGGC TGGCGGCGCT GAAAGGCGTC
CGGCGCATCG GCCCGCAATT GATTTTTCGA ACAGACGCCT CCGGCTACGG GCACGGAGAC
GAGCCGGTCG ATTTGATCGC TTTCGATCCC GCCCGTGATA TCACCGTCCA GCCATGGCTC
GACAGTCGCC TTGATCGGCC CATGCGAGAA GGCGACGTCA TCATCGGCGG GCGCCGCGAG
GAGCCGCTCG GCTCCGAAGT GCTGATCTTC GGCAAACCGC TCATCGTCTA TGGAAAGCTC
GGGAAATCCG CGGTGGGGAC GCACGAGCGC GGGCTTTTCA TCGCTTTTTC GACGCTGAAC
GACCTGCGGG AAATCATGGT GAACATCTGC GGGAAAAAGG CGCCGCTCGA GCCTCATAAG
CTATCCGGGG TCCTCGTCGA ACTCGCGCCC GGCGCCACAA CGCAGCAGGT ACGGTTCGCC
ATCCTGGCGA ACTTTCCCGA TGTCAAGGTC ATTGCCGGCG AATCGATGCT GACCTCCATC
CGTCAAAGTC TCACCATCCT GCTCGACGGC GTGCTCGCAC TCATGCTCGT CATGTTCCTC
AGCACGGCAT TGATGGTCGG CGTGTTGTTT TCGGTGATCA TCACGGAGCG GCGCCGCGAA
CTCGGATTGC TCAAGGCGAT CGGCGCCCGT AGCGGGCAGA TCATCGGGAT GCTGCTCACA
GAGGCGGCGC TCGCGACGGC CGCGGGCGGG CTGATCGGCT GCGCGCTCGG CCTGTTGCTG
CTGCGTGGTT TCGAGCATTC GCTCGTCTAC TATCTCGCGA GCGTCGGAGT CCCATTTGTT
TGGCTGAATA CGGGCGCTGT CATGCTGATC GCGTTCTCCT GCGTTCTGCT GGCTTCCGCG
ACCGGGGCGG CGGGCGCATT CTACCCGGCG TGGCGGACCA GCCGCGAGCA GCCCTATGAT
CTCATTCGAT CCGAAGGCTG A
 
Protein sequence
MGIRFLLGSL ALQNLGRRKA RTVLLLAAVA ICSGAVFTGA VLLRSIESSM LVGFTRLGAD 
MLVVPQGTLT NLTAALLTAE PTDLTLEDNM LGRLAALKGV RRIGPQLIFR TDASGYGHGD
EPVDLIAFDP ARDITVQPWL DSRLDRPMRE GDVIIGGRRE EPLGSEVLIF GKPLIVYGKL
GKSAVGTHER GLFIAFSTLN DLREIMVNIC GKKAPLEPHK LSGVLVELAP GATTQQVRFA
ILANFPDVKV IAGESMLTSI RQSLTILLDG VLALMLVMFL STALMVGVLF SVIITERRRE
LGLLKAIGAR SGQIIGMLLT EAALATAAGG LIGCALGLLL LRGFEHSLVY YLASVGVPFV
WLNTGAVMLI AFSCVLLASA TGAAGAFYPA WRTSREQPYD LIRSEG