Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2156 |
Symbol | |
ID | 7093377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 2328416 |
End bp | 2330293 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643465480 |
Product | protein of unknown function DUF882 |
Protein accession | YP_002362456 |
Protein GI | 217978309 |
COG category | [S] Function unknown |
COG ID | [COG3108] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGACGAG CCCGCAGCCG TTTCGTCCCC GCAAGGCAGG CGCTGCGCCT CGCCGCGGTT TCGGCTGCCG CGCTGCTCTG CAGCCTTTTT GCCGGCTCCT CGACCGAAAC CGCCGAAGCC AATGGCGATA CGCGCACGCT GAACCTCTAT CATTCGCACA CAGGCGAATC GATCCAGGCG ACCTTCCGGG TCAATGGCTC CTATGACCCT GCGGTTCTGG AAAAGCTGAA TTATTTCCTG CGCGACTGGC GCAACAACGA CCGCACGCGG ATGGACCCGC GCCTGTTCGA TACGGTGTGG GAAGTCTATC GCACGGCGGG CGCGACGCAG CCCATCGTGA TCTTCTCCGC CTATCGCTCG CCCGAGACCA ACGCCATGCT GCGCCGGCGT TCGAGCGCGG TCGCCGAATA TTCGCAGCAC ATGCTCGGCA AGGCGATGGA CACCACCATG CCCGGCATGT CGATGGAGCA GATCCGCGAG ATCGGCATCA AGATGCAACG CGGCGGCGTC GGCTTCTATT CGCGCGAGAA TTTCGTGCAT CTCGATGTCG GCGGCGTGCG CAGCTGGCCG CGGCTCAGCT ATGAGCAGCT CGCCCGGCTG TTCCCCGACG GCAAGAGCGC GCATCTCGCC TCGAACGGCC GCTCGCTGCC GCGCTATGAG GAAGCCCGCG CCGAAATCGC TTCGCGCGGC GGCGTCGTCA GCGACGCGCC GCAAGCCATG GCCGGCGGCG GCTTCTTCGG CTGGCTGTTC GGCGGCGGTC GCGACCGCCA GGAGGAGGCC GAGGCCGTCC GCGGCGCAAT GCCCGGCCGC GGCCCGGCGC CGTCCGGCGC GACGCAGGTC GCCGCGCTCG AAAAACCCGC TCCCGCCGTA AGGATGACGC GCGCCGAACG TCGCGCCGCC GAAAAAGCCG CCAAAGCCGC GCCAGCCCTC GCCGCAATCG ACGCCGCCGA TCCGCCGGAC GCCAACGCCA ATCGCTCGCT CGTCGCGGCT CTCGCGCCGC CGCCCGCGCC CATCCCCGCC GCGCAGCCGC GGCTCGCAGC CGCGGTTCCG GCGCCAACCG CGCCGACGCC GCCCCCGCGC GCCGCCGAGC AGGCCGCTGC CCCGGATGCG CCCGTCAAGG ACGCCGCGAA AGACAGGGAC AACGAAAAAG ATGCGGCGAA GGAAAAGCCA GACGCCGTAT CGAAGCAGTT GATGGCTGCA GTCCCCTTGC CGCCGGCTCG GCCGGATGAC CTCATCGCCT ATGCCGACGT GCCGCTGCCG CCGTCGCGCG CCGAGCATCT GATCCAGACC GCGTCGCTGA CTGCCGCACC GATGACGACG CCCCAAGTCG TGGCAGATGC CGTTACAAAG CCGGTCCCGA CGACAGTCGC GGCGAAGATG GACGCGGCCC GAGCCGAGGC GGCCAAATCC GACCCCGCCA AAGCCGGCAT GGCGATGACG CTCGCCCGCG CCGCAAGCCT TCCCGTCGTC ATCACGCGCG GCCCGAAAGA TCAGCAAGTG ATGCCCGCGA GCGTCCTTGG CTATGCCGCG AGCTCCGGCG CAGGACCGCG TCAGCGCATC GCAAGACTCG GCAAAGATGG CGACGCGCCC GACATCGTTT CGGCGCGGCT CGACCGGTCT AATTTCAGCG ATCTCACCAG CGAAACGCCG ACGGCCGAAG CGCCTCCCGC CTCCCTTCTC GGCCAGGCCC TGACCGGCCT GCGGCAGGCG GCCCGGATTG TCCCGGACGC CCTCGCAGCC GCGCCATCGG CCAGCTATAA ATTGGCGTTC GGCGCGGCCT CGGGGATTCT CAACTACGCC AGCTTCACCA AACCGCAGCC GCCCAAAGAG GCGTCGCATG CGGATAAGGT CAGCATCGTC GACGCTTCGG CGAAATAA
|
Protein sequence | MRRARSRFVP ARQALRLAAV SAAALLCSLF AGSSTETAEA NGDTRTLNLY HSHTGESIQA TFRVNGSYDP AVLEKLNYFL RDWRNNDRTR MDPRLFDTVW EVYRTAGATQ PIVIFSAYRS PETNAMLRRR SSAVAEYSQH MLGKAMDTTM PGMSMEQIRE IGIKMQRGGV GFYSRENFVH LDVGGVRSWP RLSYEQLARL FPDGKSAHLA SNGRSLPRYE EARAEIASRG GVVSDAPQAM AGGGFFGWLF GGGRDRQEEA EAVRGAMPGR GPAPSGATQV AALEKPAPAV RMTRAERRAA EKAAKAAPAL AAIDAADPPD ANANRSLVAA LAPPPAPIPA AQPRLAAAVP APTAPTPPPR AAEQAAAPDA PVKDAAKDRD NEKDAAKEKP DAVSKQLMAA VPLPPARPDD LIAYADVPLP PSRAEHLIQT ASLTAAPMTT PQVVADAVTK PVPTTVAAKM DAARAEAAKS DPAKAGMAMT LARAASLPVV ITRGPKDQQV MPASVLGYAA SSGAGPRQRI ARLGKDGDAP DIVSARLDRS NFSDLTSETP TAEAPPASLL GQALTGLRQA ARIVPDALAA APSASYKLAF GAASGILNYA SFTKPQPPKE ASHADKVSIV DASAK
|
| |