Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0462 |
Symbol | |
ID | 7091193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 510917 |
End bp | 513226 |
Gene Length | 2310 bp |
Protein Length | 769 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643463792 |
Product | RNA binding S1 domain protein |
Protein accession | YP_002360798 |
Protein GI | 217976651 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCCG CCAACGACCG CATTCTGAGC GCCATCGCCA CCGAAATTTC CGCCCGGCCG GAACAGGCGA AGGTCGCGGT CGATCTCATC GATTCCGGCG CAACGGTTCC CTTCATCGCG CGCTATCGCA AGGAGGCGAC CGGCGGCCTC GACGACAACC AGCTTCGCCT GCTCGAAGAA CGCCTTGTCT ATCTGCGCGA GCTTGAGGCG CGTCGCGCCT CCATCCTCGA CCAGATCAAA ACGCAGGGAA AATTGACCGA GGCGCTGGAG CAAAAAATCG CCGCAGCCGC GACCAAGGCG GAACTTGAGG ATATTTATCT CCCCTTCCGT CCCAAACGCC GCACCCGCGC GGAAATCGCC AGGGAGCGCG GCCTTGGGCC CCTGGCCGAG CGCATTCTTG TCGATCGCGC CTGCGTCCCC ACCCAACTCG CGCAAGGCTT TCTCTCTGCG GACGTCCCCG ACATTAAAAG CGCGCTCGAT GGAGCCCGCG ATATTCTGGC CGAAACTTTT AGCGAAAACG CCGATCTCGT TGGCGAGCTT CGCGCCTATA TGCAGCGCCG CGCGGTGCTG CGCGCCCGCG TGATCGAGGG CAAGGAAGAG GCCGGCGCCA AATTTTCGGA TTATTTCGCC CATAGCGAGC GCTGGGCGAC GACGGCCGGC CATCGCGCCC TCGCCATGCT GCGCGGGCGC AACGAGGAAT TTTTGTCCCT CGACATCGAG GTCGACGCCG ATATTGAGGC CCCGATCAGG CCGGTCGAGG AGATCATCGC CCGCCATTAC GCCATCGACG CCAAGGCCGG CGCGGCGGAT TCCTGGCTGA TGGAGGTCGC GCGCTTTGCT TGGCGCGCCA AACTTTCGCT GCATCTGTCG CTGGATCTTA TGAGCGAGCT GCGCGAACGG GCCGAGGCCG AGGCGATCGA CGTTTTCGCT CGCAATCTCA AGGATCTGCT GCTGGCGGCC CCGGCCGGTC CGCGCGCCAC CATGGGGCTC GACCCCGGCA TTCGCACCGG CGTCAAAGTC GCGGTGATCG ACGCCACCGG GAAACTGCTC GACACATCGA CGGTCTATCC GTTCCAGCCG CGCAATGATG TGCGGGGCGC AGAGGCGGAA CTCGCCCGGC TCATCCGCAA ACATGGCGTC GAACTGATCG CCATCGGCAA TGGCACGGCC AGTCGCGAGA CCGAGCGTCT CGCCGCCGAA ATCATCAAGC AGCTGCCGGC GCCGCAGCCC ACCAAAGTCG TCGTCAGCGA GGCCGGCGCC TCGGTCTATT CGGCCTCCGC GCGGGCCGCC GCGGAAATGC CGGACCTCGA CGTTTCGCTG CGCGGCGCGG TCTCGATCGC ACGGCGTCTG CAGGACCCGC TCGCCGAACT CGTCAAGATC GAACCAAAGG CGATCGGCGT CGGTCAATAT CAACATGACG TCAATCAGTC GCGCCTCGCC CGCGCCCTCG ACGCCGTCGT CGAGGACGCC GTCAACGCCG TCGGCGTCGA TCTCAACATG GCCTCGGCGC CGTTGCTGTC GCGCGTCTCG GGGCTCAGCG CTTCGCTTGC CGAGGCGATC GTCGGCCATC GCGATCAACA CGGAGCCTTC AAGACGCGCC GAGCGCTCCT CGAAGTGCCG CGTTTGGGAC CCCGCGCCTT TGAGCTCAGC GCCGGATTTT TGCGCATCCC GAACGGCGAC GAGCCGCTCG ACGCGTCATC CGTTCATCCC GAGGCCTATG GCGTCGCGCG CAAAATCGTC GCGGCCTGCG GGCGCGATCT GCGCGCGCTG ATGGGCGACG GCGCGGCGCT GAAAGCGTTG AATCCGGCCC GTTTCATCGA CGAGCGCTTT GGCCTGCCGA CGGTGCGCGA CATTCTCCTC GAATTGGAAA AGCCCGGTCG CGATCCGCGT CCCGAATTCA AGACGGCGGT CTTCGCGGAA GGGATCGATG AGATCGCCGC GCTGAAGCCC GGCATGGTCC TCGAAGGCAC GGTGACCAAT GTCGCCAATT TCGGCGCCTT CGTCGACATC GGCGTGCATC AGGATGGGCT TGTCCATGTG TCCCAATTAG CCGACCGCTT CGTCAAGGAC CCGGCCGAAG TCGTCAAGGC GGGCGCCGTC GTCAAGGTCC GCGTGCTGGA AGTCGATCTG AAACGGAAGC GGATCGCTTT GTCGATGCGC AAGGAGGATC TTGCTAAAGG GTCTGGCGCC GCCCCGCCGC CGCCGAGCGA CGCGTTTCGG CCGCAGGACC GACGTCCGAA AGCTGTGGAT CGAGGCGCGT CAGAGGGCGC GCTCGGCGCC GCTCTCGCCG AAGCGCTTCG CCGTAAATAA
|
Protein sequence | MSSANDRILS AIATEISARP EQAKVAVDLI DSGATVPFIA RYRKEATGGL DDNQLRLLEE RLVYLRELEA RRASILDQIK TQGKLTEALE QKIAAAATKA ELEDIYLPFR PKRRTRAEIA RERGLGPLAE RILVDRACVP TQLAQGFLSA DVPDIKSALD GARDILAETF SENADLVGEL RAYMQRRAVL RARVIEGKEE AGAKFSDYFA HSERWATTAG HRALAMLRGR NEEFLSLDIE VDADIEAPIR PVEEIIARHY AIDAKAGAAD SWLMEVARFA WRAKLSLHLS LDLMSELRER AEAEAIDVFA RNLKDLLLAA PAGPRATMGL DPGIRTGVKV AVIDATGKLL DTSTVYPFQP RNDVRGAEAE LARLIRKHGV ELIAIGNGTA SRETERLAAE IIKQLPAPQP TKVVVSEAGA SVYSASARAA AEMPDLDVSL RGAVSIARRL QDPLAELVKI EPKAIGVGQY QHDVNQSRLA RALDAVVEDA VNAVGVDLNM ASAPLLSRVS GLSASLAEAI VGHRDQHGAF KTRRALLEVP RLGPRAFELS AGFLRIPNGD EPLDASSVHP EAYGVARKIV AACGRDLRAL MGDGAALKAL NPARFIDERF GLPTVRDILL ELEKPGRDPR PEFKTAVFAE GIDEIAALKP GMVLEGTVTN VANFGAFVDI GVHQDGLVHV SQLADRFVKD PAEVVKAGAV VKVRVLEVDL KRKRIALSMR KEDLAKGSGA APPPPSDAFR PQDRRPKAVD RGASEGALGA ALAEALRRK
|
| |