Gene Msil_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1350 
Symbol 
ID7091688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1454056 
End bp1455237 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content60% 
IMG OID643464688 
Producthypothetical protein 
Protein accessionYP_002361677 
Protein GI217977530 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3591] V8-like Glu-specific endopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.172226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCA GCAAAAGCGG CCTTGCAAAA ATGGCCGGCC CAGCGAGTTT ATCTGTGCTG 
ATTTGCGGAC TTGCCTCGCA AGTCCAGGCG CAGGCGGTGT CGTCGTCGCG CGTGCAGCCC
CTTTCCGGCA ACCCCAATGG CGTCATCGAT TTCGCCAACG CAAAGCCAAT GCCGCTTCCT
CGTATCGACA AGGCCCCCGC GCCAGGAGCG CCGCGCGCTC CCTCCGCGGC AAAAAAATCA
TCGGTTGGCG TGGTTCCCGG CTCTGATTCG GGGGACGGCC AGACGAGCCC AATTCAGCTC
GCGCCGGCGA AATCTTCGCT GAAGGCTGTA ACGCCTTCGC TTGGCGTGAC GCAGGCGAAT
GGAGTGACGT CGCAGGCGAA TGGACAGGCC GCTCCCTATC ACCCTTACAC GATCGCCCAG
GCGAGCGCGC TCAAGGACAA AACGCAGAAT TCCTTTCCCT ATCGCGCAGC AGGGAAGCTG
TTCTTCCTGG ACAATGGCCA AACCTTTGTC TGCTCGGCGT CGCTCATCAA GTCCGGACTT
CTTGTCACCG CTGCGCATTG CGTGGCAAAT TTCGGCACGG GCCAATATTA TTCCGACTGG
GTCTATGTCC CGGCCTACGA TAATGGCAAG GCGCCTTATG GGAAATGGAA AGGCGTTCAG
GCTTTCGCGC TGCCAAGCTA TGTGAATGGG AGCGATTCCT GCGCCACTCC CGGCGTCGTC
TGCGAAAATG ACGTCGCGGT CATCACCCTC TACAAAGGCC GGTCGCTGCC GGGCAAGCGG
ACAGGCTGGC TCGGCTACGC ATATGGCGGA TTTGGCGTAA ACCCATCAGG CCAGGCGCAC
ATCACGCAGC TCGGATATCC CGTCGGCCTC GACAAAGGCG CTGTCATGGA GCGTGGCGAT
TCGCAAAGCT ACATTGATCC GGCTAACTCC AACAACACGA TCATCGGATC CAACATGGAC
GGCGGCTCCA GCGGCGGTCC TTGGATCCTC AATTTCGGCA TCGGCCCAAC GTTTACGGGC
GAAGACGCCG GCGTCGATCC AGATCCCAAC ACCGTCGTCG GCGTGACCAG TTGGGGCTAC
ACCGTCACCT CTCCCAAGGA GCAGGGAGCG TCGCCCTTCA CGAGCAATAA TATCGTCAAC
CTTGTCAACG CCGCTTGCGC CGCCTCGCCG TTGGTGTGCT AG
 
Protein sequence
MKFSKSGLAK MAGPASLSVL ICGLASQVQA QAVSSSRVQP LSGNPNGVID FANAKPMPLP 
RIDKAPAPGA PRAPSAAKKS SVGVVPGSDS GDGQTSPIQL APAKSSLKAV TPSLGVTQAN
GVTSQANGQA APYHPYTIAQ ASALKDKTQN SFPYRAAGKL FFLDNGQTFV CSASLIKSGL
LVTAAHCVAN FGTGQYYSDW VYVPAYDNGK APYGKWKGVQ AFALPSYVNG SDSCATPGVV
CENDVAVITL YKGRSLPGKR TGWLGYAYGG FGVNPSGQAH ITQLGYPVGL DKGAVMERGD
SQSYIDPANS NNTIIGSNMD GGSSGGPWIL NFGIGPTFTG EDAGVDPDPN TVVGVTSWGY
TVTSPKEQGA SPFTSNNIVN LVNAACAASP LVC