Gene Msil_3648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3648 
Symbol 
ID7092921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4007651 
End bp4009381 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content62% 
IMG OID643466936 
Producttranscriptional regulator, NifA, Fis Family 
Protein accessionYP_002363895 
Protein GI217979748 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.4281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCATG CACAGACACC TACGGCGCCA CAGCCGATCG GCCGCGCCGC CGATCCGGGG 
GAAACGACTC TCGTCGGCAT TTACGAAATT TCGAAGCTTC TTGCGTCGGT AAACAGGCTT
GAGGTGTCCC TGGCCGGCGT GTTGACGTTG CTTTCCAGCT TTCTCGAGAT GCGGCATGGG
CTGATTGCCC TCCTGGATAA AAACGGCAAG CCGGAAATCG TCGTCGGCTC CGGTTGGTCG
GAACAGAATG CGAAGCTTTA TTTCGACCGC CTGCCGGAAC GGGCGATCGG CCAGATCGTC
ACCACCAAAA TGCCGCTGGT CGTTGAAAAT GTTTACGCTT CGCCCCTGTT CGAAGGGTCG
GACCTCACCG GCTGGGGACG GGCGGATGGC GAGCCCTTCG CTCTGATCGG CGTTCCAATC
AAGGATGGCG ATGAAGTTAT AGGCGCCTTG ACTGTCGATC GTAACAATAC AAACCGCACC
AGCGTCAGAT TCGATCATGA TGTGCGTTTT TTGACCATGA TCGCCAATCT GGTCGGCCAG
ACCCTCCGCC TGCAAAAGCT CGTCGCGCGC GATCGCGAGC GGCTGATGCA GGAAAATGCG
CGGCTCGAAA AAAGCGCCCG TCCTCGCTCG CCCGAGACGC GATTCAGCGG CATCGAGGGC
ATCGTCGGCG ACAGCCCGGC CGTGCGCGCC GTCGTCAAGA AGATCCGGAT CGTCGCAAAG
AGCCGTTCGA CCGTGCTGCT GCGCGGCGAA TCCGGCACCG GCAAGGAGCT TTTTGCGGCC
GCGATCCATA ATCTGTCGCC GCGCAGCGGC AAGCCTTTCA TCAAGCTCAA TTGCGCGGCT
CTGCCGGAGA GCGTGCTGGA GTCGGAGCTG TTCGGTCACG AACGCGGCGC GTTCACGGGG
GCCCTCGCGA CGCGCAAGGG CCGGTTCGAA CTGGCTGACG GCGGCACGCT GTTCCTCGAC
GAGATCGGCG ATATTTCACC CGCGTTCCAG GTCAAATTGC TGCGCGTGCT GCAGGAGGGC
GAATTCGAAC GGGTCGGCGG CGCGCGTCCG CTGAAGGTCG ATGTGCGGCT GGTTTGCGCC
ACCAACAAAA ACCTCGAGGA CGCCGTCAAG CGCGGCGAAT TCCGCGCCGA CCTTTATTAT
CGCATCACGG TGGTTCCGAT CTTTCTGCCG CCGCTGCGCG AGCGCGAAGG CGACATTCTG
CCGCTCGCCA ATGAGTTTCT GCACCGCTTC AACAGCGAGC AGAAAACCGA CCTTGTGTTG
ACGGCCTCTG CGATCGCCGT GCTGAAGGAA TGCAAATTCC CCGGCAATAT TCGCGAACTC
GAAAATTGCG TGCGCCGGAC GGCGACGATG GCGCCGGGCG ACGAGATCGA GCAAAACGAT
TTCGCCTGCC ACAATGATGG TTGCCTGTCA GCGATCTTGT GGAAAGGGTC CGACGCGCCG
CAAGTCAGCC ACAGGCACGT CGAGGCTCCT GTAGGCCCGG CGCGACTGCC GCCGGTCGAG
ACGGCGCGCG ACATCCGCCC GCCCGACGAC GCCGCAGCGC CTCCCCATTT GGCCGATGGC
GCCTTGCCGC CGGCGGGAGA GGGGGCGTTC CGGTCGGACC GCGAGCGGAT CGTCGACGCT
ATGGAGCGCG CCGGCTGGGT CAAGGCCAAA GCCGCGCGCG TGCTGGGTAT TACGCCAAGG
CAGATCGGCT ATGCGCTGAG AAAGCACAAT ATACGCGTGA AGAAATTCTA A
 
Protein sequence
MIHAQTPTAP QPIGRAADPG ETTLVGIYEI SKLLASVNRL EVSLAGVLTL LSSFLEMRHG 
LIALLDKNGK PEIVVGSGWS EQNAKLYFDR LPERAIGQIV TTKMPLVVEN VYASPLFEGS
DLTGWGRADG EPFALIGVPI KDGDEVIGAL TVDRNNTNRT SVRFDHDVRF LTMIANLVGQ
TLRLQKLVAR DRERLMQENA RLEKSARPRS PETRFSGIEG IVGDSPAVRA VVKKIRIVAK
SRSTVLLRGE SGTGKELFAA AIHNLSPRSG KPFIKLNCAA LPESVLESEL FGHERGAFTG
ALATRKGRFE LADGGTLFLD EIGDISPAFQ VKLLRVLQEG EFERVGGARP LKVDVRLVCA
TNKNLEDAVK RGEFRADLYY RITVVPIFLP PLREREGDIL PLANEFLHRF NSEQKTDLVL
TASAIAVLKE CKFPGNIREL ENCVRRTATM APGDEIEQND FACHNDGCLS AILWKGSDAP
QVSHRHVEAP VGPARLPPVE TARDIRPPDD AAAPPHLADG ALPPAGEGAF RSDRERIVDA
MERAGWVKAK AARVLGITPR QIGYALRKHN IRVKKF