Gene Msil_0683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0683 
Symbol 
ID7093764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp742549 
End bp744129 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content64% 
IMG OID643464017 
Product2-isopropylmalate synthase 
Protein accessionYP_002361016 
Protein GI217976869 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.725586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACC CCGCCTCCCC GTCGCGCTTC GAGTCCGCGC GAAATTCCGA ATCCGTCGTC 
ATCTTCGACA CCACCTTGCG CGACGGCGAG CAATCGCCCG GCGCGACCAT GTATCTTGAA
GACAAGCTGC AGGTCGCCGA AGTGCTCGAC CATATGGGCG TCGACATCAT CGAAGCCGGA
TTCCCAATCG CCTCCGAAGG CGATTTTGAG GCTGTCTCAG CCATTGCGGA ACGCACCAAA
AACGCCGTGA TCGCCGGGCT CGCCCGCGCT ATCGAAGGCG ACATCGCCCG CTGCGGCGAA
GCGGTCCGCA AAGCGCGCCG GCCGCGTATT CATACTTTCG TCTCGACCTC GCCGATTCAT
CTTCAGCATC AGATGAACAA GAGCGAGGCC CAGGTGCTCG AGATCATCGC CCGGACGGTG
ACGCAAGCGC GCAATCTCGT GGACGACGTC GAATGGTCGG CGATGGACGC GACGCGCACC
CCGATCGACT ATCTCTGCCG CTGCGTCGAG GCGGCCATCC GCGCCGGCGC CACGACGATC
AATCTGCCGG ACACCGTCGG CTACGCCTTG CCCGAAGAAT ATGAGGCGAT GTTCCGCCAA
ATCCGTGAGC GCGTGCCGGA CGCCGACAAG GCTGTGTTTT CGGTGCATTG CCACGACGAT
CTCGGGCTCG CCGTCGCCAA TTCGCTGGCG GGCGTGCGCG GCGGCGCGCG CCAGGTCGAA
TGCACCATCA ACGGTCTTGG CGAACGCGCA GGCAACGCCG CGCTCGAAGA AGTCGTCATG
GCGCTGAAGA CGCGCGGCGA TTCGCTGCCC TATCACACGG AAATCGATTC AACCCTGCTG
ACTCGGGCCT CGAAGCTGGT GTCGGCGGTG AGTTCGTCGC CGGTGCAGTT CAACAAGGCG
ATCGTCGGCC GCAACGCCTT CGCCCATGAG AGCGGCATCC ATCAGGACGG GATGCTCAAG
AACGCCCAGA CCTATGAGAT CATGACCCCG GCCAGCGTCG GCGTCAGCAA GACCTCCCTG
GTGATGGGCA AGCACTCCGG CCGCCACGCC TTCAAGGACA AGCTGCGCGA GCTTGGCTAT
GAGCTCGCCG ACAACGCCCT GCACGACGCC TTCGTGCGCT TCAAGGATCT CGCCGACCGC
AAGAAGGTCG TCTACGACGA GGATTTGATG GCGCTCGTCG ACGATGAAAT CGTGCATGCG
CATGACCGCA TCAAGCTCGT CGACCTCACC GTTTTCGCCG GGACCAAGGG ACCGCAGTCG
GCGGCGCTGA CCCTCGACAT CGACGGAACG CATGTGACGC ATCAGGCGAC CGGCAACGGC
CCGGTCGATG CGATCTTTAA CGCCATCCAG GCGCTGGTGC CGCATGGGGC CGTGCTCGAA
CTGTTTCAGG TTCACGCGGT CACCAAGGGA ACCGACGCGC AGGCGGAGGT TTCGGTGCGG
CTCGCCGAGG ACGGCAAGAC GGTGACGGCC AAGGGCGCCG ACCCCGACAC GCTGGTCGCC
GCGGCGAAGG CCTATATCGC CTCTCTGAAC AAGCTGATGA TGAAGCGCGG CAAGTCGCAG
CGGGACGCGC TCGCCGGATG A
 
Protein sequence
MTDPASPSRF ESARNSESVV IFDTTLRDGE QSPGATMYLE DKLQVAEVLD HMGVDIIEAG 
FPIASEGDFE AVSAIAERTK NAVIAGLARA IEGDIARCGE AVRKARRPRI HTFVSTSPIH
LQHQMNKSEA QVLEIIARTV TQARNLVDDV EWSAMDATRT PIDYLCRCVE AAIRAGATTI
NLPDTVGYAL PEEYEAMFRQ IRERVPDADK AVFSVHCHDD LGLAVANSLA GVRGGARQVE
CTINGLGERA GNAALEEVVM ALKTRGDSLP YHTEIDSTLL TRASKLVSAV SSSPVQFNKA
IVGRNAFAHE SGIHQDGMLK NAQTYEIMTP ASVGVSKTSL VMGKHSGRHA FKDKLRELGY
ELADNALHDA FVRFKDLADR KKVVYDEDLM ALVDDEIVHA HDRIKLVDLT VFAGTKGPQS
AALTLDIDGT HVTHQATGNG PVDAIFNAIQ ALVPHGAVLE LFQVHAVTKG TDAQAEVSVR
LAEDGKTVTA KGADPDTLVA AAKAYIASLN KLMMKRGKSQ RDALAG