Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3043 |
Symbol | |
ID | 7092720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3356107 |
End bp | 3357414 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643466353 |
Product | protein of unknown function DUF264 |
Protein accession | YP_002363315 |
Protein GI | 217979168 |
COG category | [S] Function unknown |
COG ID | [COG5323] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACAT TCAGCCCGGG CCAGGATGCG GCGCGCCGCT TGCTGGAGGG ACCGCAGCGT TACACTTGTC TTGCCGGAGG CACGCGCTCT GGCAAGACTT TTCTGATCAT CCGCGCAATC ATTATACGCG CTCTCCAGGC TGAAGAGACC CGGCACGCGG TTCTGCGCTT CCATGCCAAT GCGGCTCGCG CATCCATAGC GCTGGACACG CTGCCGCGCG TCATGCGCCT CTGCTTTCCG GACGCGACGT TGCGCGAGCG GCGGCAGGAC GGATATTTCG AACTTGGGAA TGGATCGCGA ATCTGGATCG GCGGCCTCGA CGACAAAGAC CGCGTGGAGA AAATACTCGG ACTGGAATAT GCGACAATTT TTTTGAATGA GGCGTCACAG ATCCCCTATT CGTCGGCTTT GATCGCTTTC ACGCGGCTCG CGCAGGTCGC GCCGCGGATT GATCAACGGG CTTTCGTCGA TCTAAACCCT GTCGGCAAGA CACATTGGAC CAATCAGCTG TTTGGAGAAA AGCGCGACCC GGTGTCGAGA CGACCGCTGC CAGACCCGGA GAGCTACCGC CGCGCCTTCC TCAACCCGCC CGACAACAAA GCGAATCTAT CGCGCGAATT TCTGGCAAGT CTCTCTCATT TGCCGGAAAA GCAACGCAAG CGCTTTCTTG ACGGCGTGTA TGTGGATGAA GTCGACGGCG CGCTTTGGAC CTATGCCGGA ATCGATGCAG GACGATGCGC GGCTGAGCGC ATATCCGTGG ATAAAAGAGC TGCGGTCGTT GTCGCTGTGG ACCCATCGGG AGCGGCGGGC CGGGACGATC TTGGAGCCGA TGAGATCGGC ATAATCGTCG CCGCCAGGGG CGTCGATGGC GACGCCTATA TTCTGGAGGA TCTATCGTGC AGGGATGCGC CAGCCGTTTG GGGCAGGCGG GCAGTGGTGG CCTTCCATAG ATATCAAGCC GACAGCATCG TCGCGGAAAG CAATTTTGGC GGTGAAATGG TCCGGGCGAC GATACAGGCG GCGGATCGGA ATGTTCCGGT AAAGCTCGTC ACTGCGAGTC GCGGCAAGGC CGTGCGCGCC GAACCGATCT CGGTGCGCTA CGCTCAAGGA CAGGTCCATC ATGTCGGTAG ATTTCCCAAG CTGGAAGACC AGCTCTGCGC CTTTTCAAGC GCCGGCTATA ACGGCGGCGG CAGCCCCGAT CATGCCGATG CGGCGATCTG GGCGCTGACG CATCTGTTTG GCGCAGACGA CGGGACCGGA ATCATCGAGT TTTATCGCCG CGAAGCTGAA ATCAAGCGTC GCTCCTGA
|
Protein sequence | MVTFSPGQDA ARRLLEGPQR YTCLAGGTRS GKTFLIIRAI IIRALQAEET RHAVLRFHAN AARASIALDT LPRVMRLCFP DATLRERRQD GYFELGNGSR IWIGGLDDKD RVEKILGLEY ATIFLNEASQ IPYSSALIAF TRLAQVAPRI DQRAFVDLNP VGKTHWTNQL FGEKRDPVSR RPLPDPESYR RAFLNPPDNK ANLSREFLAS LSHLPEKQRK RFLDGVYVDE VDGALWTYAG IDAGRCAAER ISVDKRAAVV VAVDPSGAAG RDDLGADEIG IIVAARGVDG DAYILEDLSC RDAPAVWGRR AVVAFHRYQA DSIVAESNFG GEMVRATIQA ADRNVPVKLV TASRGKAVRA EPISVRYAQG QVHHVGRFPK LEDQLCAFSS AGYNGGGSPD HADAAIWALT HLFGADDGTG IIEFYRREAE IKRRS
|
| |