Gene Msil_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1894 
Symbol 
ID7091092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2060778 
End bp2062349 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content68% 
IMG OID643465221 
Productprotein of unknown function DUF323 
Protein accessionYP_002362201 
Protein GI217978054 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein
[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTTC CCATTTTGCT GCTATTGCTG AGCCTTGTCG CCGCGCTCGC CTCCGGCGTC 
CGCCCGGCGG GGGCGGCGGA GGGCGCCGGC AAGGGCGTCG CGCTTCTCAT CGCCAACGCC
GCCTATTCCG GGGGACCGGC CTCTGCGACA GTCGTCGCCG GCGCGCAAAA TCTGGCGCGC
GAACTGTCCC GCCTCGGTTT TGCCGTGACG ACCCGGTCCG ATCTCGACCG GGCGGCGATG
CGGCGCGCGA TCGACGGCTT CATCGCCCGG ATCACGCCAG GCAGTCCGGC GTTGTTTTTC
TTCGCGGGCT ATGGCGTCGA GGCAAAGGGA TCGAGCTATC TCATTCCAGT TGACGCCGGA
ATATGGACGG AGGCCGACGT TCGCAAGGAA GGCGTCGGCG TTGACGGCGT TCTCGCGGCG
ATGGCCAAGG CCGGCGCAAA GCCGAATGTG ATGATCCTCG ACGCCTCCCG CCGCAACCCG
TTCGAGCGGC GCTTCCGCAG CTTTTCGGCG GGGCTCGGCG TGATCGACCC GCCGCCGCAG
ACGCTGCTTT TTTCCGCCGC CGAGCCGGGC AAGGTCGTCG GCGATCAGGA CAGCGAGAGC
TCCGTTTTCA TCGCCGAATT GATCAAGGAG CTGAGCGCGC CCGACTCCAC CGCCGACGAT
GTTTTCAATC GCACGCGCCT TGGCGTTTCC CGCGCGACCA ACGGGCAGCA GGCCCCGGTG
ATGAAATCGA CGCTGACCGA GCCCTTTTTC CTCTCGGCGC AGAGCGCGGC CGGCGCAGGC
TTCGCCGATG AAGCGCCGGG CCTTGAGGCA GATGGCGCGG CGTTCGGGCC TCCCTTGGAC
AAGCGGCCCG GCGCCGTCTT TCGCGACTGC GCCGCCTGTC CGAGCCTCGT CATTGTTCCC
GCAGGCGCAT TCACCATGGG CTCCGACGAT TTCGAGGCGG AGCAGCCGGC CCATTCCGTC
TCGATCGCAA AGCCTTTCGC GATGGGCCGG TTTGAGGTCA CCAACGCGCA ATGGGACGCC
TGCGTCACTG GCGGCGGATG CGGCGGCTGG CGTCCGCCAG ATCGCGGCGC AAGCGGGGGC
GGCGTCCCCG TCAGCGAGGT GAGCTTTGTC GACGCAGGCC GCTATCTCGA CTGGCTGTCG
CATAAGACCG GCCGCGCCTA TCGCCTGCCG AGCGAAGCCG AATGGGAATA TGCCGCCCGC
GCGGGAACGA CGAGCCGCTT CTGGTGGGGC GATGAGGTGG GGACCGATCA CGCCAATTGC
CGCGGCTGCG GCGGCCCGGG GCGCCCGCTC GCCGCCGGCT CCTATCCGGC CAATCCATTC
GGCCTTTACG ACACCGCCGG CAATATTGCC GAATGGACGG CCGATTGCTG GACCGCCTCC
TATGCCGGCG CGCCGCGCGA CGGCTCGGCC GTCAGGGCCC CCGCGGGCGG GGGGGCGTGC
AAGCAGCGCG TCGTGCGCGG CGGCTCATTC GACGCCGGCG CGCGCTACGT GCGTTCGGCC
TCCCGCTTTC TCTACGATGC GGAGCTTCGC TATTATACGA ATGGCTTTCG CGTCGTGCGC
GATCTTCCCT GA
 
Protein sequence
MRFPILLLLL SLVAALASGV RPAGAAEGAG KGVALLIANA AYSGGPASAT VVAGAQNLAR 
ELSRLGFAVT TRSDLDRAAM RRAIDGFIAR ITPGSPALFF FAGYGVEAKG SSYLIPVDAG
IWTEADVRKE GVGVDGVLAA MAKAGAKPNV MILDASRRNP FERRFRSFSA GLGVIDPPPQ
TLLFSAAEPG KVVGDQDSES SVFIAELIKE LSAPDSTADD VFNRTRLGVS RATNGQQAPV
MKSTLTEPFF LSAQSAAGAG FADEAPGLEA DGAAFGPPLD KRPGAVFRDC AACPSLVIVP
AGAFTMGSDD FEAEQPAHSV SIAKPFAMGR FEVTNAQWDA CVTGGGCGGW RPPDRGASGG
GVPVSEVSFV DAGRYLDWLS HKTGRAYRLP SEAEWEYAAR AGTTSRFWWG DEVGTDHANC
RGCGGPGRPL AAGSYPANPF GLYDTAGNIA EWTADCWTAS YAGAPRDGSA VRAPAGGGAC
KQRVVRGGSF DAGARYVRSA SRFLYDAELR YYTNGFRVVR DLP