Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2386 |
Symbol | |
ID | 7093938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 2600754 |
End bp | 2601899 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643465708 |
Product | protein of unknown function DUF201 |
Protein accession | YP_002362678 |
Protein GI | 217978531 |
COG category | [R] General function prediction only |
COG ID | [COG2232] Predicted ATP-dependent carboligase related to biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCGGCC TGAAAACGCA CGCAGGCGCC GCGGTTCTGA TCGCGGCGTC CTCCGGCCGA GCTCTCGCCG CCGCGGCGCG GCGCGCGGGC TTTCGTCCCC TCGTCGCCGA TCTTTTTGAC GATTGCGACA CGCACAGCCT CTGCGCGGCA AGCCTGATTG CGGGGGATTG GCGCGCGGGG TTTTCCCGCG ATCCTCTGAT CGCCGCACTG GAAACACTGG CGAAAGCGGC CTCGCCGATC GGCCTTGTCT ACGGCGCGGG ATTCGAGGAT CGGCCACTCC TTTTGGAGGA GATCGCCGGG CGCTGGCCAG TTTTCGGCAA TCCGCCCGAG CGGCTGCGAC GCGCCAAGGA CCCGATGGCG CTCGCCGCGC TTTGCCACGC GCTTGGCGTT CCCCATCCGG AGATCCGCCT CGGCTTGCCG AACCCCTGCG GCGGCTGGCT CGTCAAAAGC GTCGGCGGCG CCGGCGGCTC CCATGTCGCC CCCGCGGGCT CCGCGCGACC TGAAAACGAA AGCATTTATT TTCAACGGCT TGCCCCTGGG CAGCCGATCT CCGTCCAATG CCTTTGCGAC GGAAGCCGGG CCATCTCGCT CGGCCTCAGC CGGCAATGGA CGTCGCCCGC GCCGGACGAA CCCTTCCGCT ATGGCGGCTG CGTGCGCCCC GCCGGACTTT CCTCCGATCT CGAAACGCGC CTCGCTGACG CCGCCTGCGC GATCGTCGGC GCGCAAGGGC TCGTCGGGCT GAACAGCGTC GACTTTCTTG TCGACGAGAA CGACTTTTAT CTGATCGAGG TCAATCCGCG GCCGGGCGCG GCGCTCGATA TTTTCGAAGA CCGCGAAGGC CGTCTGTTTC AGGCGCACAT CGACGCATGT CTGGGGCGGC TTCCGGTCCG GCCGCTCGAA TTCGAAGCAG CGACGGCCGC TGCAATCGCC TATGCGCAAA GAGATATTGC GGCGATGCCT GAGCTCGACT GGCCGGATTG GACGGCCGAC CGGCAAAAGC CGCAAAGCGC CGTGGGGTTG TATGATCCGC TCTGCACCAT TAAAGCCTGC GCAGCGCAGA CGTCCGCCGC GCGCGCCCTG GTTGAGGCGC GCGCCGGCGC TTTGTTCGAC GCCATAAATT GTAAACTGGG GGGAGAAGCA TCGTGA
|
Protein sequence | MSGLKTHAGA AVLIAASSGR ALAAAARRAG FRPLVADLFD DCDTHSLCAA SLIAGDWRAG FSRDPLIAAL ETLAKAASPI GLVYGAGFED RPLLLEEIAG RWPVFGNPPE RLRRAKDPMA LAALCHALGV PHPEIRLGLP NPCGGWLVKS VGGAGGSHVA PAGSARPENE SIYFQRLAPG QPISVQCLCD GSRAISLGLS RQWTSPAPDE PFRYGGCVRP AGLSSDLETR LADAACAIVG AQGLVGLNSV DFLVDENDFY LIEVNPRPGA ALDIFEDREG RLFQAHIDAC LGRLPVRPLE FEAATAAAIA YAQRDIAAMP ELDWPDWTAD RQKPQSAVGL YDPLCTIKAC AAQTSAARAL VEARAGALFD AINCKLGGEA S
|
| |