Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3648 |
Symbol | |
ID | 7092921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 4007651 |
End bp | 4009381 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643466936 |
Product | transcriptional regulator, NifA, Fis Family |
Protein accession | YP_002363895 |
Protein GI | 217979748 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.4281 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCATG CACAGACACC TACGGCGCCA CAGCCGATCG GCCGCGCCGC CGATCCGGGG GAAACGACTC TCGTCGGCAT TTACGAAATT TCGAAGCTTC TTGCGTCGGT AAACAGGCTT GAGGTGTCCC TGGCCGGCGT GTTGACGTTG CTTTCCAGCT TTCTCGAGAT GCGGCATGGG CTGATTGCCC TCCTGGATAA AAACGGCAAG CCGGAAATCG TCGTCGGCTC CGGTTGGTCG GAACAGAATG CGAAGCTTTA TTTCGACCGC CTGCCGGAAC GGGCGATCGG CCAGATCGTC ACCACCAAAA TGCCGCTGGT CGTTGAAAAT GTTTACGCTT CGCCCCTGTT CGAAGGGTCG GACCTCACCG GCTGGGGACG GGCGGATGGC GAGCCCTTCG CTCTGATCGG CGTTCCAATC AAGGATGGCG ATGAAGTTAT AGGCGCCTTG ACTGTCGATC GTAACAATAC AAACCGCACC AGCGTCAGAT TCGATCATGA TGTGCGTTTT TTGACCATGA TCGCCAATCT GGTCGGCCAG ACCCTCCGCC TGCAAAAGCT CGTCGCGCGC GATCGCGAGC GGCTGATGCA GGAAAATGCG CGGCTCGAAA AAAGCGCCCG TCCTCGCTCG CCCGAGACGC GATTCAGCGG CATCGAGGGC ATCGTCGGCG ACAGCCCGGC CGTGCGCGCC GTCGTCAAGA AGATCCGGAT CGTCGCAAAG AGCCGTTCGA CCGTGCTGCT GCGCGGCGAA TCCGGCACCG GCAAGGAGCT TTTTGCGGCC GCGATCCATA ATCTGTCGCC GCGCAGCGGC AAGCCTTTCA TCAAGCTCAA TTGCGCGGCT CTGCCGGAGA GCGTGCTGGA GTCGGAGCTG TTCGGTCACG AACGCGGCGC GTTCACGGGG GCCCTCGCGA CGCGCAAGGG CCGGTTCGAA CTGGCTGACG GCGGCACGCT GTTCCTCGAC GAGATCGGCG ATATTTCACC CGCGTTCCAG GTCAAATTGC TGCGCGTGCT GCAGGAGGGC GAATTCGAAC GGGTCGGCGG CGCGCGTCCG CTGAAGGTCG ATGTGCGGCT GGTTTGCGCC ACCAACAAAA ACCTCGAGGA CGCCGTCAAG CGCGGCGAAT TCCGCGCCGA CCTTTATTAT CGCATCACGG TGGTTCCGAT CTTTCTGCCG CCGCTGCGCG AGCGCGAAGG CGACATTCTG CCGCTCGCCA ATGAGTTTCT GCACCGCTTC AACAGCGAGC AGAAAACCGA CCTTGTGTTG ACGGCCTCTG CGATCGCCGT GCTGAAGGAA TGCAAATTCC CCGGCAATAT TCGCGAACTC GAAAATTGCG TGCGCCGGAC GGCGACGATG GCGCCGGGCG ACGAGATCGA GCAAAACGAT TTCGCCTGCC ACAATGATGG TTGCCTGTCA GCGATCTTGT GGAAAGGGTC CGACGCGCCG CAAGTCAGCC ACAGGCACGT CGAGGCTCCT GTAGGCCCGG CGCGACTGCC GCCGGTCGAG ACGGCGCGCG ACATCCGCCC GCCCGACGAC GCCGCAGCGC CTCCCCATTT GGCCGATGGC GCCTTGCCGC CGGCGGGAGA GGGGGCGTTC CGGTCGGACC GCGAGCGGAT CGTCGACGCT ATGGAGCGCG CCGGCTGGGT CAAGGCCAAA GCCGCGCGCG TGCTGGGTAT TACGCCAAGG CAGATCGGCT ATGCGCTGAG AAAGCACAAT ATACGCGTGA AGAAATTCTA A
|
Protein sequence | MIHAQTPTAP QPIGRAADPG ETTLVGIYEI SKLLASVNRL EVSLAGVLTL LSSFLEMRHG LIALLDKNGK PEIVVGSGWS EQNAKLYFDR LPERAIGQIV TTKMPLVVEN VYASPLFEGS DLTGWGRADG EPFALIGVPI KDGDEVIGAL TVDRNNTNRT SVRFDHDVRF LTMIANLVGQ TLRLQKLVAR DRERLMQENA RLEKSARPRS PETRFSGIEG IVGDSPAVRA VVKKIRIVAK SRSTVLLRGE SGTGKELFAA AIHNLSPRSG KPFIKLNCAA LPESVLESEL FGHERGAFTG ALATRKGRFE LADGGTLFLD EIGDISPAFQ VKLLRVLQEG EFERVGGARP LKVDVRLVCA TNKNLEDAVK RGEFRADLYY RITVVPIFLP PLREREGDIL PLANEFLHRF NSEQKTDLVL TASAIAVLKE CKFPGNIREL ENCVRRTATM APGDEIEQND FACHNDGCLS AILWKGSDAP QVSHRHVEAP VGPARLPPVE TARDIRPPDD AAAPPHLADG ALPPAGEGAF RSDRERIVDA MERAGWVKAK AARVLGITPR QIGYALRKHN IRVKKF
|
| |