Gene Msil_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1371 
Symbol 
ID7091709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1481318 
End bp1482688 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content61% 
IMG OID643464709 
Producttranscriptional regulator, CarD family 
Protein accessionYP_002361698 
Protein GI217977551 
COG category[K] Transcription 
COG ID[COG1329] Transcriptional regulators, similar to M. xanthus CarD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.401785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTCCG TCAAGAAGAC CCTACAGCCG TCAAAAATCA AATCAGCGGC GCCTGCCAAT 
GCATCGAAAC AATTAAAGGC GTCTTCGTCC ACGATTAAGG CGAAGACCGG CCAGGATGCG
GGGGAGCCCA AGATCAAGGC GCGGAGCGCC GCGGCGCCGT CAAAGCGCGC GGCGGCGGCG
GATTCGTCTC GCGCAAAGAG CGCTCCAAAG AGCGCCGCCA AGATCATCGC CGACGCCGCG
GCTGTAACCA AAGATAGCGC GGCGGGCAAA GAAAGAAGAA CTGCGGACCA AAGAAAAGCG
GAGCGGGAGA CGGTCAAGAG AACCGCGCCC GAGGCGAAAG CCGACGCTTC AGTGAAACCC
TCTTCTTCGG ACAAGAGCTT GTCGACTAAA AACACTTCGA TCAAAAACAG TTCAGCTAAA
AACGCTTCGG CCAAGAAAAC GTCAGCCGGA AAGAATTCGG CGAATCCGGC CCCCGAAGAC
GCCGATAAGA AATCGGAGCC TGCGCCGAGC GTCTCGGCGG TTTCCGTCGC CCGCTTTCAG
GCCGCCAGAA CGGCTGCGGC GGCCGCGCAG CAAAAAATTA CGAGCCACGC AGCAGCGCCT
CTTCGCGCCG CGGCGTTTTC ATTGTCGCCC CCGGACCCAG ACGAGCGAGA TATAGTGACA
AAGAAAGTGG AAAAGAAAAC GACCGCCCTT GAAGTAGCGG TCAAGGTCGA GGCTCCTGCC
GAGACCGTGG TGGTCGAGGC CGCTGTCGCC CCGGTTGCGG CCATTGACCC GAAGGCTGCG
CTCGCCAAGC CGGCCAAGGC CTCCGGCTCG CGCCAGAGCG GATTCAAGCC GAACGAATAC
ATCGTTTATC CGGCGCATGG GGTCGGTCAG ATCGTCGCGA TCGAAGAACA GGAAGTGGCT
GGCTTCAAGC TCGAACTGTT CGTCATCAGC TTCGTCAAGG ACAAGATGAT TCTCAAGGTG
CCGACGCCCA AGGTGACGAG CGTCGGGATG CGCAAACTCG CCGAGGCCGA TGTCGTCCGC
CGTTCCCTCG ACACGCTGGC CGGCCGGGCG CGCATCAAGC GCACCATGTG GTCGCGCCGG
GCGCAGGAAT ATGAGGCGAA GATCAATTCG GGCGATTTGA TCGCGATCGC TGAAGTCGTT
CGCGATCTTT ACCGTTCGGA TTCGCAGCCG GAGCAGTCCT ATTCGGAGCG TCAGCTCTAT
GAGGCCGCGC TCGACCGCAT GGCGCGGGAA GTCGTCATCG TGGAGAAGCT GACGGAGACC
GAAGCGCTGA AGGCGATCGA GGCGCAGCTG CAGAAGGGTC CGCGCCGCGG CGGCAAGGCG
GAAGAGATTG AAGTCGACGA CACGGAAACG GAGATCGAGC AAGCCGCCTG A
 
Protein sequence
MRSVKKTLQP SKIKSAAPAN ASKQLKASSS TIKAKTGQDA GEPKIKARSA AAPSKRAAAA 
DSSRAKSAPK SAAKIIADAA AVTKDSAAGK ERRTADQRKA ERETVKRTAP EAKADASVKP
SSSDKSLSTK NTSIKNSSAK NASAKKTSAG KNSANPAPED ADKKSEPAPS VSAVSVARFQ
AARTAAAAAQ QKITSHAAAP LRAAAFSLSP PDPDERDIVT KKVEKKTTAL EVAVKVEAPA
ETVVVEAAVA PVAAIDPKAA LAKPAKASGS RQSGFKPNEY IVYPAHGVGQ IVAIEEQEVA
GFKLELFVIS FVKDKMILKV PTPKVTSVGM RKLAEADVVR RSLDTLAGRA RIKRTMWSRR
AQEYEAKINS GDLIAIAEVV RDLYRSDSQP EQSYSERQLY EAALDRMARE VVIVEKLTET
EALKAIEAQL QKGPRRGGKA EEIEVDDTET EIEQAA