Gene Msil_1434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1434 
Symbol 
ID7091775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1551613 
End bp1553652 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content65% 
IMG OID643464772 
Productexcinuclease ABC, C subunit 
Protein accessionYP_002361760 
Protein GI217977613 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCCAA CTGACAAAGA CGCTCCCGAG CCGGAGTCGC CGCCAGGCGC CGATAGCCCG 
CTGCCGCCGG AGTTCGATTT CGCCGTCGAG GATGGCGATG AGGAGATCCA GGACTTCGCC
GATCTCGACC TTCCCGAGGA CGACGCCGCC CCCGCCTCGG TGCGGCGCGG CGCGGCCGTC
ATCCGCAGCT TCTGGCGCCA GGCGCCGCAA GGTCCCGGCG TCTATCGCAT GATCGCCGCC
GACGGAGAGG TGCTCTACGT CGGCAAGGCG AAGAGCGTGC GCAAGCGGAT CGCCAGCTAT
ATGCGCCCGC TCGGCCACAA CAACCGGATC GCGCGGATGA TCGCGCTGAC CGCCTCGATG
GTCTTCATCT CGACCAGCAC CGAGACGGAA GCGCTGCTGC TCGAGACGAA TTACATCAAG
CAGATGAAGC CGCGCTTCAA CGTGCTGATG CGCGACGACA AGTCGTTCCC TTACATCCTT
CTGACAGGCG ACCACGCGGC TCCGCAGATC CTGAAGCATC GCGGCGCGCG CAATCGAAAG
GGCGACTATT TCGGGCCTTT CGCCAGCGTA TGGGCGGTCA ACCGCACCAT GAATGCGCTT
GAGCGCGCCT TTCTCCTGCG CTCCTGCTCG GACAGCTATT ATGAAAACCG CACGCGGCCC
TGTCTGCTGC ACCAGATCAA GCGCTGTTCG GCGCCCTGCA CCGGCGAGAT CGACCTTGAC
GACTATCGCC GGCTGGTCGG CGAGGCGCGC GACTTTCTCT CCGGCAAGAG CCGCGCCGTG
CGCGATCTTC TCGCGACCGA AATGACCAGC GCCTCGGACG CGCTTGAGTT CGAGCGCGCC
GCGCGCCTGC GCGACCGCAT CGCCGCTCTC TCCGCCATCC AGGGCGCCCA GGGCGTCAAT
CCAAAAACCG TCGAGGAGGC CGACGTCTTC GCGATCGTCG AGGAAGCCGG GCAATTCTGC
GTCGAAGCCT TCTTTTTCCG CACCTACCAG AACTGGGGCA ACCGCGCCTA TTTTCCGCGC
GCCGACAAGA GCCTGGCCTC CGCCGAGGTG CTCGACGCGT TTTTGGCGCA GTTTTACGCC
GACAAGCCCG CTCCGCGGCT GATTTTGCTC TCGCATGAGA TCGAAAACGG CGCCGTGCTG
AGCGAGGCTC TCTCCCTCCG CACCGGACAT CGAATCGAAA TCGCGCGGCC GCAGCGCGGG
GAAAAGCATG AGCTCGTCGA ACATGCCTGC CAGAACGCAA GGGAGGCGAT GAGCCGCCGT
CTGTCGGAAA CCGCCTCGCA GGAGAAGCTG CTGGCGGCGC TGGCGGCGGC GCTCGGCCTC
TCCGCCCCTC TTCGGAGGGT CGAAATCTAC GACAATTCCC ATATCATGGG GACGAATGCG
GTCGGAGCAA TGGTCGTCGC CGGCCCTGCC GGCTTCATGA AGGCGCATTA TCGCACCTTC
AACATAAAGG GCGAGGATCT CACGCCGGGC GATGACTACG GTATGATGCG CGAAGTTCTG
CGGCGTCGCT TTCTGAGGCT GGCCAAGGAC GAAGCGGCGG CGGACGGGCC CTCGACGCGA
GACGACGATG AAGACATTTT CCCGCAGCGG CCGGATTTGA TCCTGATCGA CGGCGGCCAG
GGTCAATTCG ACGCCGCCAA CGCCATTCTC GATGAATTAT CCGTGACCGG GGTCGCCGTG
GCCGGGATCG CCAAGGGCGT CGACCGCAAC GCCGGCCGCG AAAGCTTTTT CGTCGCAGGC
AAGGCGCCGT TCCGGCTCTC GCCGCGCGAT CCCGCCCTCT ATTTCGTGCA GAGGCTGCGC
GACGAGGCGC ATCGTTTCGC GATTGGGACG CATCGCGCCC GCCGTAAGAA GGAATTTACC
CGCAGTCCGC TCGACGAGAT CGCCGGCGTC GGCCCGGCGC GCAAGCGCGC CCTGCTGCAC
GCTTTCGGCA CCGCCAAGGC GATTTCAAAG GCCGCTTTAT CCGATCTCGA AAAAGTCGCG
GGCGTCAATG CGGCGACGGC GCGGCTCGTT TATAACTATT TCCACGAGGG CGGCGGCTAA
 
Protein sequence
MVPTDKDAPE PESPPGADSP LPPEFDFAVE DGDEEIQDFA DLDLPEDDAA PASVRRGAAV 
IRSFWRQAPQ GPGVYRMIAA DGEVLYVGKA KSVRKRIASY MRPLGHNNRI ARMIALTASM
VFISTSTETE ALLLETNYIK QMKPRFNVLM RDDKSFPYIL LTGDHAAPQI LKHRGARNRK
GDYFGPFASV WAVNRTMNAL ERAFLLRSCS DSYYENRTRP CLLHQIKRCS APCTGEIDLD
DYRRLVGEAR DFLSGKSRAV RDLLATEMTS ASDALEFERA ARLRDRIAAL SAIQGAQGVN
PKTVEEADVF AIVEEAGQFC VEAFFFRTYQ NWGNRAYFPR ADKSLASAEV LDAFLAQFYA
DKPAPRLILL SHEIENGAVL SEALSLRTGH RIEIARPQRG EKHELVEHAC QNAREAMSRR
LSETASQEKL LAALAAALGL SAPLRRVEIY DNSHIMGTNA VGAMVVAGPA GFMKAHYRTF
NIKGEDLTPG DDYGMMREVL RRRFLRLAKD EAAADGPSTR DDDEDIFPQR PDLILIDGGQ
GQFDAANAIL DELSVTGVAV AGIAKGVDRN AGRESFFVAG KAPFRLSPRD PALYFVQRLR
DEAHRFAIGT HRARRKKEFT RSPLDEIAGV GPARKRALLH AFGTAKAISK AALSDLEKVA
GVNAATARLV YNYFHEGGG