Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1075 |
Symbol | |
ID | 7091904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1164268 |
End bp | 1165515 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643464415 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_002361406 |
Protein GI | 217977259 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.290615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCTTA CGCGCGCGGC GCGAACCATT TCCGTCCTTG CCCGGCCGGC GCTGCTGATC GCCGGCCTGC TGGCGCTTGC GGGCTGCGGC GGCGGAATGA CGGATCTCGA CGGCGCGGGC GCCGGGCCGC GCCCGACCTC GGTGTTTGTC GTCTCGACGC GCAAGGGCGA AAGCGGTCCC GCCAGCGAAC TCAGCAATGA TGGAAACGAG CGCTATTCGC TGCAGATGAT CGGCGCGCCG CTCAATCACC AGATCGGACA ATTGGAGCGC CCTTCCATCG GCAGTCCCGA TCCCGCGCGT CATTTTGCGC TTCAGACGCG CCGCGCGCTC GACGAGGATG GTTTTACGGC CGCGCTCGCC ACGCATCTGT CGGGGCGCAT CGGCTCCAAC CGCGACGTTC TCCTTTATGT GCACGGCTTC AACACGAGCT ATGACGAGTC GCGGTTCCGG CTCGCGCAGA TCGTCGCGGA CGGCCGCTTT GGCGGCGTCG CCGTCCTGTT CACCTGGCCG TCGACGAATA ATCTGCTCGA CTATGGCGCG GCGAAAGAGA ATGCGACGAT CTCGCGGGAC GCGCTGGCGA AGCTGATCCG GCAGCTGACG GATGCGCCCG ACGTCGGGCG CGTGCACATC CTCGCCCATT CGATGGGGGC CTGGCTGACC ATGGAGGCGC TGCGCCAGGA TTTTATCGCG GGCGGCGCGC GGCTGAACGA CAAGCTGGGC GATATCATGC TGGCCGCTCC CGACATCGAT CTGAATGTTT TCCGCCAGCA GATCAGCCGC CTCGACGCGT CGCACATCTT CGTGCTCGTT GCAGCCAATG ATCGCGCCTT GTCGCTCTCG CGCACGCTGA CCAGTGATCG GCCGCGCCTC GGCGCGCTCG ATCCGAAGAA CCCGGCCGAC CGATCGGCGC TCGAGACGCT CGGCGTCAGG GTTTATGATC TGAGCCGGGA GGCTGATATA TTCATCGGCC ACGGCGCCTA TGCGGACGCG CCCGACGCGC TGCGCACCAT CGGCGCGCAG ATCGCCGCGC CGCGGCCGCA AGACTCCAAT GTTCAGGCGG TTCTCGGCGA AAACCCCATC GACGACCGCA TTCACGCCAC GCCCTTGCCG CCGCCGGCCG CTGCAGCGCC TGGCGCGCCC GCCGCCGCTC CGGCCCGCCC CGAGGCGCCG ATCAGTGCGG TGGTCCCGCT TGCGACGGCG ACGCCGGGTT CCGCGACGCC GGCGTCTTCC GCCGCGCCGA CGCCCTGA
|
Protein sequence | MRLTRAARTI SVLARPALLI AGLLALAGCG GGMTDLDGAG AGPRPTSVFV VSTRKGESGP ASELSNDGNE RYSLQMIGAP LNHQIGQLER PSIGSPDPAR HFALQTRRAL DEDGFTAALA THLSGRIGSN RDVLLYVHGF NTSYDESRFR LAQIVADGRF GGVAVLFTWP STNNLLDYGA AKENATISRD ALAKLIRQLT DAPDVGRVHI LAHSMGAWLT MEALRQDFIA GGARLNDKLG DIMLAAPDID LNVFRQQISR LDASHIFVLV AANDRALSLS RTLTSDRPRL GALDPKNPAD RSALETLGVR VYDLSREADI FIGHGAYADA PDALRTIGAQ IAAPRPQDSN VQAVLGENPI DDRIHATPLP PPAAAAPGAP AAAPARPEAP ISAVVPLATA TPGSATPASS AAPTP
|
| |