Gene Msil_3629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3629 
Symbol 
ID7092902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3991820 
End bp3993268 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content60% 
IMG OID643466917 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_002363876 
Protein GI217979729 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.205847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGTT TAGCGAACAA GATCCAGGAC GTTTTCAACG AGCCAGGCTG CGACAAGAAT 
CAGGGCAAAT CCGACAAGGA ACGCAAGAAG GGCTGCACCA AGCAATTGCA GCCGGGGGCC
GCTGCCGGCG GCTGCGCCTT CGACGGCGCC AAAATCGCGC TGCAGCCGAT CACCGACGTC
GCCCACCTCG TTCACGGCCC GATCGCCTGC GAGGGCAATT CATGGGACAA CCGAGGCGCC
AAATCCTCTG GATCTCAGCT TTACCGAACC GGCTTTACGA CCGACATCAA CGAGACGGAC
GTCATCTTCG GCGGCGAGAA GCGGCTGTTC AAAGCGATCA AGGAAATTAT CGACAAATAT
GATCCGCCGG CCGTGTTCGT CTATCAGACC TGCGTGCCCG CGATGATCGG CGATGATATC
GGCGCCGTTT GCAAGGCCGC CGCCGCCAAA TTCAACAAGC CCGTCATCCC CGTCATTTCG
CCAGGTTTCG TCGGCCCGAA AAATCTCGGC AACAAGCTCG CCGGCGAGGC CATCCTCGAT
CATGTGATCG GCACGATGGA GCCCGAGTAC ACGACGCCCT ACGACATCAA CATCATTGGC
GAATATAATC TCTCCGGCGA ATTGTGGCAG GTGAAGCCGC TGTTCGACGA ACTCGGCATT
CGCATTTTGT CCTGCATCTC GGGCGACGCC AAATATAAGG AAGTCGCCTG GTCGCATCGC
GCCAAAGCCT CGATGATGGT CTGCTCCAAG GCGATGATCA ACGTCGCCCG CAAGATGGAG
GAGCGCTACG ACATTCCCTT CTTCGAGGGC TCTTTCTACG GCATCGAGGA CACCAGCGAC
TCGCTGCGCG AGATCGCCCG TCTGCTGATC GAAAAAGGCG CCCCGGCCGA GCTGATGGAG
CGCACCGAGG CGGTGATCGC CCGCGAGGAA GCGCTCGCCT GGAAGAGCAT CGAGCCCTAT
CGGGCGCGGC TCGCCGGCAA GCGCGTGCTG CTCATCACGG GCGGCGTCAA ATCCTGGTCG
GTCGTCGCCG CGCTGCAGGA AGCCGGATGT GAAATCGTCG GCACCAGCGT CAAGAAGTCG
ACCAAAGAGG ACAAGGAAAA GATCAAGGAG TTGATGGGCC AGGACGCCCA TATGATCGAC
GATATGACGC CGCGCGAAAT GTACAAGATG CTGAAGGACG CGAAAGCCGA CATCATGCTT
TCCGGCGGCC GTTCGCAGTT CATCTCGCTG AAGGCGAAAA TGCCCTGGCT CGACATCAAC
CAGGAGCGCC ACCACGCCTA TATGGGCTAT GTCGGCATGG CCGAGCTCGT CAAGGAGATC
GACAAGGCGC TCTACAATCC CGTGTGGGAA CAGGCGCGCC GCGCCGCCCC CTGGGAGACG
AAGCCTTCGG AAATGTTTTC GGAGCCCGAG CCGGAACTTG CGGCGCCAAC AGCGCTCGCG
GCGGAATAG
 
Protein sequence
MTSLANKIQD VFNEPGCDKN QGKSDKERKK GCTKQLQPGA AAGGCAFDGA KIALQPITDV 
AHLVHGPIAC EGNSWDNRGA KSSGSQLYRT GFTTDINETD VIFGGEKRLF KAIKEIIDKY
DPPAVFVYQT CVPAMIGDDI GAVCKAAAAK FNKPVIPVIS PGFVGPKNLG NKLAGEAILD
HVIGTMEPEY TTPYDINIIG EYNLSGELWQ VKPLFDELGI RILSCISGDA KYKEVAWSHR
AKASMMVCSK AMINVARKME ERYDIPFFEG SFYGIEDTSD SLREIARLLI EKGAPAELME
RTEAVIAREE ALAWKSIEPY RARLAGKRVL LITGGVKSWS VVAALQEAGC EIVGTSVKKS
TKEDKEKIKE LMGQDAHMID DMTPREMYKM LKDAKADIML SGGRSQFISL KAKMPWLDIN
QERHHAYMGY VGMAELVKEI DKALYNPVWE QARRAAPWET KPSEMFSEPE PELAAPTALA
AE