Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1361 |
Symbol | |
ID | 7091699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1469441 |
End bp | 1470451 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643464699 |
Product | nuclease (SNase domain protein) |
Protein accession | YP_002361688 |
Protein GI | 217977541 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.048766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAA GTCGAAGCAA AAAGCAGGCC TTGCGCGCGG CGAGGCCGCC AGGCTCCGGA CCCGCGCCCG CCAAGCGGGA AGCCGAGGCT CGCGTCACGG GCCTGATCGC AGCGCTCGTC GCGGGTTTTG CGCTCAACGC CGCGCCAGGG AGGGCGTCGG CGGAGGGGAT TCAAGCCGCG CAAGCGGCGC AGGCCAATCA AGCGGCGAAC TCGGCTGAGG ACTGCGCCAT TCCGCCGGCA AGCCAGGGCC GGCGGCGCGT CGACCTCGAA AAAGCCCGTG TCAAAAGCGT CGACGAACGA CTGGAGCTGA CGCTCGACGA CGGCCGCAGG CTGAAGATCG CCGGCCTCGA TCCGCCGCGC CCGACGCCGG GCGCCCCCGA GCTCGACATT GAAACGGGAC AAAAGCTCGG CGCCTGGCTC TCCGGCAAGG CGGTGATCTT CCGGCCCGTC TTCACCGCGC CGGATCGCTG GGGGCGCATC AATGCCGAAG TGTTCGCGCC GGCCGGCGAG GACGACGGGT CGCCGCCGAT CTCCGTCGCG GGCGCGGCGC TCGACGCCGG ACTGGCGCGA TTTGAGTCCG GCGCGGCGGG ACGGCGGTGC CGCGCCTTTC TGCTTGCCGC CGAAGGCGCG GCGCGCGCCG CCGCGCTTGG CCTTTGGGGC GATCCATATT ATGCGGTAAT CGCCGCCAGC GACCGCGCCT CCTTTGCTGA AAAGACCGGG ACAAGCGTCA TCGTCGAGGG CCGCGTCGCC CGTGTCGAGG AGGATAAATT CCGTACCCTG CTGCTGTTTG GCGAGCGCCG CGGCTGGGAT TTTTCCGTTA CAATATTGCA ACGCAATCGG AAACTCTTTA CGGCCGCGGG GTTGGATGTC TCGACGCTGA AAGACAAAAC AGTGAGGGTG CGAGGCTTGC TCGATATGCG GTTCGGGCCG CAGATCGAGA TTGAACGCCC TGACGAAATC GAAGCGATGA CGCCAGGGCA GGACGCCGCC GCCGCTTCGT CCCGGCGGTA A
|
Protein sequence | MKISRSKKQA LRAARPPGSG PAPAKREAEA RVTGLIAALV AGFALNAAPG RASAEGIQAA QAAQANQAAN SAEDCAIPPA SQGRRRVDLE KARVKSVDER LELTLDDGRR LKIAGLDPPR PTPGAPELDI ETGQKLGAWL SGKAVIFRPV FTAPDRWGRI NAEVFAPAGE DDGSPPISVA GAALDAGLAR FESGAAGRRC RAFLLAAEGA ARAAALGLWG DPYYAVIAAS DRASFAEKTG TSVIVEGRVA RVEEDKFRTL LLFGERRGWD FSVTILQRNR KLFTAAGLDV STLKDKTVRV RGLLDMRFGP QIEIERPDEI EAMTPGQDAA AASSRR
|
| |