Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1754 |
Symbol | |
ID | 7090866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1908534 |
End bp | 1909460 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643465077 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002362062 |
Protein GI | 217977915 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGGAG AGGAGTTCCT TCGGACGTCC TCTCAGGAAG CTCACCTTTC ATGGGGGGCA GGCTCGGTTC TCAGACGAAC TCAGGACGTG GACGCGCGAT GTGAGCGCAA TATGAAGCGC GACGATGTCG CAATTGCGAT CTGCTTTCAG GACGCCGGCA GCGAGGTGAA TTGGCGTCTC GACGGCAAAC AGACCCTGGC GAAAACCTAC GCCTCAGCGA TTGAATCGCG CGACATGTTC ATCGCCCCTC CAGGCTGCGA GATCGCCGTG CGTTGCCGCG GCAAGGGCGA AGGCCTGTGG CTTTTTCTTG AGCCGGAATT CGTCAATTCA GATCGCCGCA TTAAGTCCCT GGTCGAAAAA GCGGCTGTCT TCGACCATTC CTGGACGAAG GATCGGATGT GCTGGATGGT CGCCTCGGAG CTCCGAAAAG AATGCCAGAA CGGCTTTCCG CGGGGGCCGA TGTTCTTTGA AAGCGCAGCG ACGGTTTTCG TCGCGCAGCT CGCCTATTTT CTTCATGACG CCGATCTGAC GCCAGAACCG ACCCGCTCGC TTTCCGACGC CAAGCTGCGG CTGGTGACCG GTTACATCGA AGCCAATCTT CACCGCAATA TTACGCTGTC GGAGCTTTCG GCGCTGGTCG ATCTGACGCC GCGCTATTTT TGCGGCGCCT TCAAGGAGGC CACAGGCCGG CCGCCGCATC AATACCAGAT CGAGCAGCGC GTGGAGCAGG CGAAAAAGCT GCTCGCCGAG GCCGGATCTT CGCTGATCGA AATCGCCCTG ACGGTCGGCT TCAGCAGCCA AAGCCATCTC AATGAATATT TTCGCCGGAT CGTCGGAATG ACCCCTGCGC GCTACCGCAA CGAGGAGCTT CAAGTCCGCA GCGGCGCGTC GGCGCGCAGC CGGAGGCCGA ACTCGACCGC CAGCTAA
|
Protein sequence | MGGEEFLRTS SQEAHLSWGA GSVLRRTQDV DARCERNMKR DDVAIAICFQ DAGSEVNWRL DGKQTLAKTY ASAIESRDMF IAPPGCEIAV RCRGKGEGLW LFLEPEFVNS DRRIKSLVEK AAVFDHSWTK DRMCWMVASE LRKECQNGFP RGPMFFESAA TVFVAQLAYF LHDADLTPEP TRSLSDAKLR LVTGYIEANL HRNITLSELS ALVDLTPRYF CGAFKEATGR PPHQYQIEQR VEQAKKLLAE AGSSLIEIAL TVGFSSQSHL NEYFRRIVGM TPARYRNEEL QVRSGASARS RRPNSTAS
|
| |