Gene Msil_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1016 
Symbol 
ID7091844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1102480 
End bp1103700 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID643464355 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_002361347 
Protein GI217977200 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00284664 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGATT CCTTTGGCGG CTCTAAGCCC TTGCGTCCCG CGACGCTGCT GGTTCATGGC 
GGCGGCGTCC GCACCCCCTT CGGCGAGACG TCTGAGGCGC TGTTTTTGAC GCAGGGCTAT
GTGTTCGCCT CGATGGAAGC CTGCGCCGCG CGCTTTGCCG GCGAAGAGCC GGGCTTTGTC
TATTCGCGCT ACGGCAATCC GACGGTCGCG ATGTTCGAGG GCCGCATGGC GCTGCTCGAA
GGCGCCGAGG CTGCGCGGGC GACCGCGACC GGCATGGCCG CAGTGACGGC CTCGGTGATG
TCGCAGGTCC GGGCCGGCGA TCATGTCGTC GCCGCCCGCG CGCTGTTCGG CGCCTGCCGC
TATATCGTCG AGGACCATCT GCCGCGCTAT GGCGTCGCTT CGACCCTCGT CGACGGCGAT
GATCTTGACC AATGGCGCGC TGCCGTGCGG CTGGAAACCA AGGTCTTCTT TCTGGAAAGC
CCGACCAACC CCTGTCTCGA CGTCTATGAC ATCGCCGCGA TCGCAAAGAT CGCTCATGAC
GCCGGCGCGA TCCTCGTCGT CGATAATGTG TTTGCGACGC CGATGCTGCA AAAGCCGTTG
ACGCTCGGCG CCGACCTCGT CGTCTATTCG GCGACGAAGC ACATCGACGG CGGCGGCCGG
TGTCTTGGCG GCGTCATCCT CGGCGCGAAA GCGCTGGTCG AGGGCGATCT ACAGCAGTTC
TTGCGGCAGA CCGGCCCCGC GCTCTCGCCC TTCAACGCCT GGGTGCTGTT GAAGGCCCTC
GAGACGCTGG CGATCCGCGT CGAACGGCAG ACGAAGAGCG CCGCCCGAAT CGCGGATTTC
TTGAGCGAGC AGCCGGCCGT CGCCTTCGTG CGCTACCCTG GCCGCGCCGA TCATCCGCAC
GCCGACATCG CGCGGCGACA GATGTCCGGC GGCGGCACGC TGGTCGCCTT CGAGATCGTC
GGCGGCAAAC CGGCCGCTTT CGCCTTCGGC CGCGCCCTGA AGCTCATCAA GATTTCAAGC
AATCTTGGCG ACGCCAAAAG CCTGATCACC CATCCGGCGA CGACGACGCA TCACCGGCTG
CCCCCGGAGG CGCGGGCGGC GCTCGGCGTC TCGGAGGGGC TGGTGCGTCT GTCGGTCGGG
CTCGAGGACG AAGAGGACCT GATCGACGAT CTCAAGGCGG CGCTCGACGC GCTCCAACGC
CAGGACATCG CCGCGGAGTG A
 
Protein sequence
MTDSFGGSKP LRPATLLVHG GGVRTPFGET SEALFLTQGY VFASMEACAA RFAGEEPGFV 
YSRYGNPTVA MFEGRMALLE GAEAARATAT GMAAVTASVM SQVRAGDHVV AARALFGACR
YIVEDHLPRY GVASTLVDGD DLDQWRAAVR LETKVFFLES PTNPCLDVYD IAAIAKIAHD
AGAILVVDNV FATPMLQKPL TLGADLVVYS ATKHIDGGGR CLGGVILGAK ALVEGDLQQF
LRQTGPALSP FNAWVLLKAL ETLAIRVERQ TKSAARIADF LSEQPAVAFV RYPGRADHPH
ADIARRQMSG GGTLVAFEIV GGKPAAFAFG RALKLIKISS NLGDAKSLIT HPATTTHHRL
PPEARAALGV SEGLVRLSVG LEDEEDLIDD LKAALDALQR QDIAAE