Gene Msil_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1021 
Symbol 
ID7091849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1107758 
End bp1108924 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content65% 
IMG OID643464360 
Productsuccinyl-diaminopimelate desuccinylase 
Protein accessionYP_002361352 
Protein GI217977205 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01246] succinyl-diaminopimelate desuccinylase, proteobacterial clade 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.000463021 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGGATC TCGCCCATAC CGCCGTCGAA TTGTGCCGCG AGCTGTTGCG GCGCCCCTCT 
GTGACGCCGC TCGACGCCGG CGCCCAGGAC TTTCTGGCGG CAAAGCTTCG CGAGGCCGGA
TTTGCCACGC ATAGCGTCGT TTTCTCGGAT GAGAGCACGC CCGATATTCA AAATCTCTAC
GCCCGCGCCG GCGCCGGCGG CCGGCATCTC GTCTTCGCCG GACATACCGA CGTCGTGCCG
CCGGGAGATT CGGCGAGCTG GCGCTTCGAT CCGTTCGGCG GCGAAATGGA GGGTGGCCTG
ATTTTCGGCC GCGGCGCCGT GGACATGAAG GGCGCGATCG CCGCTTTCGC CGCCGCCGCC
ATGGCTTTCG TCGCCGAGGG CGGCGCGCAA AAAGGGTCGA TCAGCTTCCT TATCACGGGC
GACGAGGAAG GTCCGGCGAT CAACGGCACC GACAAGCTGC TGCGCTGGGC GCATCAGCGC
GGCGAGCGCT TCGATCATTG TATTCTCGGC GAGCCGACCA ACCAGCAAGC ACTCGGCGAC
ATGATCAAGA TCGGTCGGCG CGGTTCGCTG AACGGGACGC TGACCGTGAA AGGCGTGCAG
GGCCATGTCG CCTATCCGCA TCGCGCCAAA AATCCCATTC CACACCTGAT GCGGCTGCTG
GCGGCGCTCA CGGCGGAGCC GCTCGACCAG GGCACGGAGC TGTTCGACGC CTCCAATCTC
GAAATCGTCA GCGTCGACGT TGGCAATCCG ACCTTCAACG TCATTCCGGC CGAGGCGCGG
GCGCGCTTCA ACATCCGCTT CAACGATATC TGGACGCCGG ATGCGCTCGC GGCCGAGCTA
CGCGCGCGCG CCGAGAAGGC GGGGGCGGCA GCGGGCGCCG CTTCCGCGCT CCATTTTGAA
CCCTGCAATG CGCTGGCCTT CGTGACGCAG CCCGACGCGT TTACCGATCT TGTCAGCGCG
GCGATCGAAC AGGCGACCGG ACGCAAACCA AAGCTCTCGA CGAGCGGGGG CACGTCCGAC
GCCCGGTTCA TCCGCGCCTA CTGCCCCGTT CTCGAATTCG GCCTTGTCGG CTCGACCATG
CACGCCGTCG ACGAACGCGC GCCGGTCGAG GATATCTCTG CGCTTGCGTC AATTTACGCA
GACATATTGA ATTCTTACTT TAAGTAG
 
Protein sequence
MSDLAHTAVE LCRELLRRPS VTPLDAGAQD FLAAKLREAG FATHSVVFSD ESTPDIQNLY 
ARAGAGGRHL VFAGHTDVVP PGDSASWRFD PFGGEMEGGL IFGRGAVDMK GAIAAFAAAA
MAFVAEGGAQ KGSISFLITG DEEGPAINGT DKLLRWAHQR GERFDHCILG EPTNQQALGD
MIKIGRRGSL NGTLTVKGVQ GHVAYPHRAK NPIPHLMRLL AALTAEPLDQ GTELFDASNL
EIVSVDVGNP TFNVIPAEAR ARFNIRFNDI WTPDALAAEL RARAEKAGAA AGAASALHFE
PCNALAFVTQ PDAFTDLVSA AIEQATGRKP KLSTSGGTSD ARFIRAYCPV LEFGLVGSTM
HAVDERAPVE DISALASIYA DILNSYFK