Gene Msil_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1990 
Symbol 
ID7094188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2160579 
End bp2161964 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content68% 
IMG OID643465316 
Producttranscriptional regulator, GntR family with aminotransferase domain 
Protein accessionYP_002362294 
Protein GI217978147 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACT GGACCCCCGA CCTCACCGGC AGCGACAAGC CGCGTTATCT CGCCATCGCC 
GATCTCATCG CCGAGGACCT CCGCAGCGGC CGGCTCTCGA TTGGCGATCG TCTGCCGCCG
CAGCGCCAGC TCGCCGCCCG GCTCGGGGTT GACTTCACCA CAGTCGCGCG CGGCTATGTC
GAGGCGAAAA AGCGCGGTCT TGTTGAATCG CGCGTCGGCC GCGGCACCTT CGTCTGCGCG
CCGCCGCGTC AGGCGCCGCC GCCTCCAAGG CCTGCGCGCA GCGACTTCGT CGATCTGTCG
ATGAATTTGC CTCCCGAACC GGACGATCCG GCCCTGATTG CGCGGATGCA GGATGGCGTG
GCGGAGGTCA GCCGCGATCT CGTTTCGCTG CTACGTTATC AGGCCTTCGG CGGCTCTCCG
GCGGACAAGG ACGCCGCCTC CGCCTGGCTC GGCCGGCGCT CGCTCGTTCC CTCGCAGGAT
CGTCTGTTCG TGACGCCGGG CGCGCATCCG GCCCTGCTTG GAATCTTCAG CATTCTGGCG
GCCCCCGGCG ACGTCGTCCT GTGCGAGGAA TTGACCTATC CCGGCATGCG CGCCATCGCC
GCGCAGCTGC GCCTCAAACT GGTCGGCCTG CCGATGGACG CGGACGGCGT CGACCCTGAC
GCCTTCAAAT CCTCCTGCGA GACGCTGAAG CCGAAGGCGA TCTATCTCAA TCCGACGCTG
CACAATCCGA CGACGCTCAC CATCCCCGCG ACGCGGCGCG TCGCGATCGC CGCGGTTGCG
CGGCGCTACA ATGTCCCGAT TGTCGAGGAC GACGCCTATG GCTTCATTCC CACGCAAGGC
CAGCCGCCGT TTGCGGCCAT TGCGCCCGAC CTCACCTGGC ATGTGGCGGG CCTTGCCAAA
TGCATAGGCG CGGGCCTGCG CGCCGCCTAT GTTGTCGCGC CCGACGCGCG CTCCGGCTGG
CCTTTCGCCG CCGCCATGCG GGCGGCCAAT GTCATGGCCT CGCCGCTGAC GGCGGCCATC
GCCACGCGTT GGATCGAGGA CGGCGCCGCC GACACCATTT TGCGGTTCAT CCGCGCCGAG
ACCGCGGCGC GTCAGCAACT CGCCGCCGAC ATTCTGCCAA AGGGCAGCTT CCGCTCCGAT
CGCCTCAGCT TCAATCTCTG GATGGAGCTG CCAAAACCCT GGACCCGTTC GGCCTTCATC
GGCCACATGG GATCGACCCG GATCGGCGTC GTCGCCAGCG ACGCCTTCAC CGTCGGCGGC
GATCCGATCG AGGCGATCCG CATCTGCATC GGCGGTCCGA CCGGACGCGA AGAGATCCGC
TCAGCGCTGG AATATATCGC GCATGCGCTG GCGCAATCGC CGGCGCATGC GCTGCAATTT
CTGTGA
 
Protein sequence
MPDWTPDLTG SDKPRYLAIA DLIAEDLRSG RLSIGDRLPP QRQLAARLGV DFTTVARGYV 
EAKKRGLVES RVGRGTFVCA PPRQAPPPPR PARSDFVDLS MNLPPEPDDP ALIARMQDGV
AEVSRDLVSL LRYQAFGGSP ADKDAASAWL GRRSLVPSQD RLFVTPGAHP ALLGIFSILA
APGDVVLCEE LTYPGMRAIA AQLRLKLVGL PMDADGVDPD AFKSSCETLK PKAIYLNPTL
HNPTTLTIPA TRRVAIAAVA RRYNVPIVED DAYGFIPTQG QPPFAAIAPD LTWHVAGLAK
CIGAGLRAAY VVAPDARSGW PFAAAMRAAN VMASPLTAAI ATRWIEDGAA DTILRFIRAE
TAARQQLAAD ILPKGSFRSD RLSFNLWMEL PKPWTRSAFI GHMGSTRIGV VASDAFTVGG
DPIEAIRICI GGPTGREEIR SALEYIAHAL AQSPAHALQF L