Gene Msil_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2371 
Symbol 
ID7090355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2582419 
End bp2584104 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content66% 
IMG OID643465693 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_002362663 
Protein GI217978516 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0175028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TCCGCTGTCT TCTTGCGCCG GCGTGGCTTG CGGCGATCCT CACGGGAGTC 
CCGGCGGCTT ATGCGGCGCC GGCGCCGGTC GCCGTGCAAA ATCCAAGCGA AGTCCACGCC
ATTGTGGTCG GCATTGACCA ATATCAGCAC CTTGCCCAGC TTCGGGGCGC CGTGGCCGAC
GCCAAGGATA TCGAAGCCTC GCTGCGCGTC ATGGGCGTCA CGGACATCGC GGCGCTCTAC
GACGACAAGG CCGACCGCGA TTCGATCCTG AAGGCGGTCT ATGACCTCAG CGCGCGGGTC
AAGCGCGGCG ATCTTGTGAT CCTGTCGATT GCCGGCCATG GCGCGCAGGA GCCGGAGCGC
GTCAAGGGAT CGGAGGCGGA CGGCAAGGAC GCCGTCTTTC TGCTTGCCGG ATTCGAGCCG
GCCGGCGCCG GCACGCGCCA GCGCATTCTC GACAAGGAAT TCAATCATCT GATCAAGATC
TTTGAATCGC GCGGCGCGCG CGTCGCCTTC GTCGCCGATT CCTGCTCCGG CGGCGGCCTC
GCTCGCGAAG TCGATCCGCG CGGCGAACAG CTGATCTACC GCGCCGTGCC GACCTACCAG
ATCACCGAAG ACGAGCTGAA GCCGGTGTCG TCGCCGTCCG ACGCCTTTTC GACCGAGCTC
GATTTCGACC GCTCGATTTT CCTCGCCGCC GTCGACAAGC ACTCCAAGGC GCCGGAAGTG
AAGGTCCCGG GCGTCGAGGG CTATCGCGGC GCCTTGAGCT ACGCCTTCGC CCGCGCGCTG
GAAGGCGCGG CCGACGCCAA TGGCGACGGG AAGATCACGG TCGAGGAGCT GTTCGCCTAT
GTGCGGCAGG TGACCTATCA ATTATCTGAC CAGCGTCAGA TGATCGTCTC GGCGAGCCCG
CCGTCCGTCA AGGCGAGCGT CGAGCCGGTG GTGGAGCTGA CGCGCTCTGT CACCTATATC
GCGCCGCCGA GTCCTTCGCC GACGGGAGCC TTGTCCGCCC GCATCGGCGC GGGCGCGGGC
GTCAATCTGC CGCCGAAGAA GCCGACGCTC GCGGTCGAGC GGCCGGTGCG CATTGCGACG
CTCGACGGCT CCAATACGGA GCTGATAGGT CTCGCGCCGC TGGAGGCGAA ATTCGAGATC
GTGTCGCCGC GCCAGAATCC CGAGCTGGTG TGGGATCCGA AATCCGGCGA CGTGATCGCG
GCGGGCGACG TCATCGCGCA TGAGGTCAGC CGCGACGATC TGCCGGCGGT GATCGATCGC
GCTGCCGCCA TCCGCGCCTT GAAGGCAATG GCGACGCGAT CCGTGCAGCC GATCCTGCTC
ATGCCGGACA GCAAGCTACA CCGCCTCGGC GCGCAGGTCG ACGTGTCCAT CGATCAGCTG
AGCGATCGAT CGCTGATCAT GTTCAATATC GCTGGCGACG GCACGGTGCA GCTTCTCTAC
CCGCAGAACG CCGCCGAAGC GGGCCCGATG GAGAAGCCGC AATTCCGCCT GCCGATCAAA
GTGCGGGCGC CCTTCGGCGC CGATCAGATC GTCGCGATCT CGGCGCCGGA TCGCATGCCC
GAGCTCGCCC AGGCGCTCGG CCGGCTGAAC GCGCGCCGCA CCGCGGGTCA ACTCGTCAAA
TTCATCGCGC AGAATTCGGA CGCGGATATG CGCGTTGGTT CGGTCGGCGT GTTCACCTCG
CCCTGA
 
Protein sequence
MKKLRCLLAP AWLAAILTGV PAAYAAPAPV AVQNPSEVHA IVVGIDQYQH LAQLRGAVAD 
AKDIEASLRV MGVTDIAALY DDKADRDSIL KAVYDLSARV KRGDLVILSI AGHGAQEPER
VKGSEADGKD AVFLLAGFEP AGAGTRQRIL DKEFNHLIKI FESRGARVAF VADSCSGGGL
AREVDPRGEQ LIYRAVPTYQ ITEDELKPVS SPSDAFSTEL DFDRSIFLAA VDKHSKAPEV
KVPGVEGYRG ALSYAFARAL EGAADANGDG KITVEELFAY VRQVTYQLSD QRQMIVSASP
PSVKASVEPV VELTRSVTYI APPSPSPTGA LSARIGAGAG VNLPPKKPTL AVERPVRIAT
LDGSNTELIG LAPLEAKFEI VSPRQNPELV WDPKSGDVIA AGDVIAHEVS RDDLPAVIDR
AAAIRALKAM ATRSVQPILL MPDSKLHRLG AQVDVSIDQL SDRSLIMFNI AGDGTVQLLY
PQNAAEAGPM EKPQFRLPIK VRAPFGADQI VAISAPDRMP ELAQALGRLN ARRTAGQLVK
FIAQNSDADM RVGSVGVFTS P