Gene Msil_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2221 
Symbol 
ID7091343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2400882 
End bp2402501 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content68% 
IMG OID643465542 
ProductLeucyl aminopeptidase 
Protein accessionYP_002362517 
Protein GI217978370 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.258541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCAT CCGTAAAAAT TCAGTTCGCG CCTTTGGACA AGCTGGCGTT GGCCGCGCCC 
GGCGCGAGCG GAGCCCCGAG CGATTCGGCC AAATCGGCGC AAACGCTGGT GATCTTCGCC
GGGCCCGATC TGAAGCTCGG GGCGGCAACG CTGAAGCTTA TCGGCGCGGA GGCGGAGGCG
CTGATCCGGC GCGGCGCGGC CACGGCCAAA TTCAAGGGAA AAGTCTCCTC GGCGCTGGAT
CTGATCGCGC CGGCGGGCAT TGCCGCCGAC CGGCTGCTGG TGATCGGCGC GCCAGGCGAG
GAGGCCGCCG AGCAGAAGAC GCCCGAGGCC GGCAAGCCTG AGGCGGCGGC TCCCGCGAAA
CCGGCAGCCC CGCCGACCCT CTCCGATTAC GCCAACCTCG GCGGCGTCGT CGGCGGCAAG
CTTGGACGCG GCGCCGCGGC GACGATTGTG TTCGACCTGC CGCGCGCCCC CGAGGACGCC
GCGGCGGCGG CGGCGGAATT CGCGCTTGGC CTGCAACTGC GCGACTATCG CTTCGACCGC
TACAAGACCA AGAAAAAGGA CGACGCCGAC GAGAATGGCG TCAGCGAGAT CGTCGTCGCG
CTTGCCGATC CCGAGGCGGC GCGTGAAAAG GCCGCAGGCC GGGAAGCGGT CGCCGCGGGC
GTCATCACCG CCCGTTCGCT GGTCAATGAG CCGGCCAATA TTCTCTTTCC CGAAGAATTC
GCCGCGCGCG CCAAGGAGCT CGAAAAGCTG GGCGTCGAGG TCGAGATTCT CGACGAGCCG
GCGATGCAGG CGCTCGGCAT GGGCGCCCTG CTCGGCGTCG GTCAGGGCTC GTCGAGGCAA
AGCCGGGTCG TCGTCATGCG CTGGCGCGGC GCCGGCGAGG GCGGCGACTC GAAGCCGATC
GCCTTCGTCG GCAAAGGCGT CACCTTCGAC ACCGGCGGCA TTTCGATCAA GCCGGCCGCC
GGCATGGAGG ACATGAAGGG CGATATGGCC GGCGCCGCCT GCGTCGTCGG GCTGATCGAG
GCGCTTGCCG CGCGCAAGGC CAAGGTCGAC GCCATCGGCG CCATTGGCCT CGTCGAGAAC
ATGCCGGGGC CGGACGCGCA GCGTCCGGGC GACATCGTCA AATCCATGTC GGGCCAGACC
ATCGAAATCA TCAACACCGA CGCGGAAGGG CGCCTCGTGC TTGGCGACGT GCTCTGGTAT
GTGCAAGACC GCTTCAAGCC GAAATTTATG ATCGACCTTG CGACCTTGAC CGGCGCCGTG
CTCGTCGCGC TCGGCCAAGA GCACGCGGGG CTCTTCACCA ATGACGACGA CCTTGGCGAA
AAGCTCCTCG CCGCCGGCAA GGCGACCGGC GAAAAGCTCT GGCGCCTGCC GCTCGCCCCC
GCATATGACA AGATGATCGA TTCGAAATTC GCCGACATGA AGAACACGGG CGGGCGCCAC
GCCGGCTCGA TCACGGCGGC GCAGTTCCTG CAGCGCTTCG TCAACGGGAC GCCCTGGGCT
CACCTCGATA TCGCCGGCAC GGGCATGAGC TCGCCGTCGA GCGACGTCAA TCAGAGCTGG
GGCTCGGGCT TTGGCGTGCG GCTGCTCGAC CGTCTCGTCT CGGACAATTA CGAATCCTGA
 
Protein sequence
MPPSVKIQFA PLDKLALAAP GASGAPSDSA KSAQTLVIFA GPDLKLGAAT LKLIGAEAEA 
LIRRGAATAK FKGKVSSALD LIAPAGIAAD RLLVIGAPGE EAAEQKTPEA GKPEAAAPAK
PAAPPTLSDY ANLGGVVGGK LGRGAAATIV FDLPRAPEDA AAAAAEFALG LQLRDYRFDR
YKTKKKDDAD ENGVSEIVVA LADPEAAREK AAGREAVAAG VITARSLVNE PANILFPEEF
AARAKELEKL GVEVEILDEP AMQALGMGAL LGVGQGSSRQ SRVVVMRWRG AGEGGDSKPI
AFVGKGVTFD TGGISIKPAA GMEDMKGDMA GAACVVGLIE ALAARKAKVD AIGAIGLVEN
MPGPDAQRPG DIVKSMSGQT IEIINTDAEG RLVLGDVLWY VQDRFKPKFM IDLATLTGAV
LVALGQEHAG LFTNDDDLGE KLLAAGKATG EKLWRLPLAP AYDKMIDSKF ADMKNTGGRH
AGSITAAQFL QRFVNGTPWA HLDIAGTGMS SPSSDVNQSW GSGFGVRLLD RLVSDNYES