Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1107 |
Symbol | |
ID | 7093869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1193718 |
End bp | 1195802 |
Gene Length | 2085 bp |
Protein Length | 694 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643464447 |
Product | hypothetical protein |
Protein accession | YP_002361438 |
Protein GI | 217977291 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA TGGCTCTTTC CGGCGCTGTA TTTTTGGCGG GCGGCCTGTC GTTACTTGGG GCAAACTCTC TCCTTGCCGC CGAAATCCCG GGGACTGTTC CAAAGACCCT GCCTCGACCT TCCACGGTCG CGCCGGATGC GAATAATGCG GCGCTCGCCG CATCGGGCCT CAAGGGATTT TATGAGAAAT ATGGCAAGTA TCAGGTTTCG GCCGCGGCGG CCGGAAGCAA CAACGTCAGC CACGTCGCAA CTGTAACGAA ACCCAGCAGG GACGCCGTTG TCGACAAGGC CTTTGTGATG GCCGCCTCGA ATAATTTTGT GCCTATCGCC AATGGCGACA TCACCATCGA CGGCAATTCC ATCACCTGGA TCGAAGCCGT AACGAACGAT ATCATTGGAT TTCCAAGCTA CTTCAATAAC GTTCGCGCCG ATGTCACGGC GTTCGTAAAG CCGAAGTTGG ACGCGGCGGC GCCAGGCGGC GTTGGTTTTA CTTTTTCCGA AATCCGGAAC AACGCAAATA TTGACGGCGA GGTCCTCGTG GTCGTCTTCA AGGCGCCCTC GACAAAAGGT TCGCGGACGA TTTCATTACT GTTCGGCGGC CAGAAATTGT CCGGCGACCG CTTCGAACTT GTGCTCGCAA ATGCGATCGA CCCGAAAAAG GCCGGCGTCG TCGCCAATAT GGGCCTTGGA ATCTCATACA GTTTCCAATC CAACGGCATC CAACAATATA GTTCGATCGA CGTGAATGGA AAGCGCCTGT CGACCGCCGC CGGCGGAGAG GACGACGGCA AGCCGTTCAA CGGCGCGTTG ATTACCGTCG GAGGGGTTGG CGACGCGATC GACAATCCTG CCAACCCCCT GGCGACGCCC ACCAACCCGC GCAGCGACGA CGAGCTTTAT TCCTTGCTCC CCTATCTCAA AAAGACCGAC AAGATTATCC GGATCGACAC GAAAAACCCC TCGAACGACG ACAATATCTT CTTCGCTTAT TTCGACCTTT CGGCCAACGC CAGCGTCAAC AAGGATACCG ACGAAGACGG ACTGCTCGAC GCCTGGGAGC TCTATGGATA TGACGCCGAC GGCGACGGCA AAATCGATGT CGACCTTCCC AAGCTCGGCG CCAATTATAA GAAGAAGGAT ATCTTCATCG GCTACGCCTG GATGAATGCG GGGCCGGCGG AAGGAGCGTC ACATCAGCCC TCGGCGGCCG TGCTCACGGC GGTGCAGAAG GCCTTCGCCG CCGCTCCCGT CTCAAATCCC GGCGGGTCCA AAGGCATTGC CGTCCATTTC GTCAGCAAAG GCGCCGTCCC TCACAAGGAT AATCTGGATC CGGTGTGGAC CGATTTTGAC ACGATCATGG ATCCGCTGTT CACCAGCGCA GAACGTCATG TCTTTCATCG CCTGTTGTGC GCCCATAAAT ACAGCGGCGG GACCAGCAGC GGGCTTTCGC GCGGCATCCC GGAGAGCGAT TTCATCGAGA CTCTGGGCGG GTGGGGCTCC AACCCGGGGA CGTTCAAAGA GCAGGCTGGC ACCATCATGC ATGAACTTGG GCACAATCTT GGCCTGCGTC ATGGCGGCGT CGATCACGAG AACTACAAGC CCAATCATTT GAGCGTCATG AACTACAATT ATCAGGTCAA TTGGCTGTCC AAGTCCGGCA AGGATCTTTT GGATTATGAG CGCTTCGACC TCGGTGCCCT GGACGAGAAC AAGCTGAAAG AAAAGAACGG CCTTGGCAGC CCCTTGGTCG CAGCCTACGG CTTGCGCTGG TTCAGTGGCG GCATCTCAAA AATAAAGGCC AACGGCGCCA ACAAGAATGT CGACTGGAAT GCGAATGGCG GGATCGACAA CAATGACGTT GCGGTCGACC TCAACAATTC CGGCGCAAAG AGCGTCCTCA ACGCAAATTT TATCGAGTGG AATGAAATTG TCTTCGACGG CGGCGCCATC GGCGACGGCG CCAACGCCAC GCCAAGCGCC GCCAGAGCGC GTAAATCCAA CATCATCACG AAGCCGCAGG ATCTTCAGGA GTTGACCTAT GAAGAGTTCG TCAGAAAACA GGCGAACCCT GTCCTGGTGA AATGA
|
Protein sequence | MKKMALSGAV FLAGGLSLLG ANSLLAAEIP GTVPKTLPRP STVAPDANNA ALAASGLKGF YEKYGKYQVS AAAAGSNNVS HVATVTKPSR DAVVDKAFVM AASNNFVPIA NGDITIDGNS ITWIEAVTND IIGFPSYFNN VRADVTAFVK PKLDAAAPGG VGFTFSEIRN NANIDGEVLV VVFKAPSTKG SRTISLLFGG QKLSGDRFEL VLANAIDPKK AGVVANMGLG ISYSFQSNGI QQYSSIDVNG KRLSTAAGGE DDGKPFNGAL ITVGGVGDAI DNPANPLATP TNPRSDDELY SLLPYLKKTD KIIRIDTKNP SNDDNIFFAY FDLSANASVN KDTDEDGLLD AWELYGYDAD GDGKIDVDLP KLGANYKKKD IFIGYAWMNA GPAEGASHQP SAAVLTAVQK AFAAAPVSNP GGSKGIAVHF VSKGAVPHKD NLDPVWTDFD TIMDPLFTSA ERHVFHRLLC AHKYSGGTSS GLSRGIPESD FIETLGGWGS NPGTFKEQAG TIMHELGHNL GLRHGGVDHE NYKPNHLSVM NYNYQVNWLS KSGKDLLDYE RFDLGALDEN KLKEKNGLGS PLVAAYGLRW FSGGISKIKA NGANKNVDWN ANGGIDNNDV AVDLNNSGAK SVLNANFIEW NEIVFDGGAI GDGANATPSA ARARKSNIIT KPQDLQELTY EEFVRKQANP VLVK
|
| |