Gene Msil_3325 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3325 
Symbol 
ID7090821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3656076 
End bp3659147 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content62% 
IMG OID643466632 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002363593 
Protein GI217979446 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.723924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCA GTCGTCGCGG CTTTATCAAG CTCACCGGCG CCGGGCTAGC GGCGTCAAGC 
CTCGGGGCGC TTGGCTTCGA CCTCGCCGGC GCAGCGCTCG CCGCCGCAGT GCGGCCGTTC
AAGCTCACGG CGACGACCGA GACGCGAAAC ACCTGCACCT ATTGCTCGGT CGCCTGCGGC
ATTCTCATCT ATAGCATGGG CGACCGCGCC AAGAATGCGC GCTCGGAGAT CATGCACATC
GAGGGCGATC CGGATCATCC GGTCAATCGC GGCACGCTCT GCCCCAAGGG ATCGGCGCTG
CTCGATATCG TGCATTCGCC GGGACGCCTG ACATCGCCTC AATATCGCGC CCCCGGCGCC
AGCGCCTTCA GGCCGGTTTC TTGGGATTTC GCGCTCGGCC GCATCGCCAC GCTGATGAAA
GAGGACCGTG ACGCCAATTT CGTCGCCAAG AATGCGGCGG GAACCACGGT CAACCGCTGG
CTCACGACCG GCATGCTGGC GGCGTCCGCG TCGTCGAGCG AGACGGCGAT GCTGACGTGG
AAAGTCGCCC GATCGTTCGG AATGCTCGTC TTCGACAATC AGGCGCGCGT CTGACACGGA
CCAACGGTGG CCAGTCTGGC TCCAACATTT GGTCGCGGCG CAATGACGAA CACCTGGCAG
GACATCAAGA ACGCTGATGT CGTGCTGGTG ATGGGAGGCA ATGCGGCCGA AGCGCATCCC
TGCGGCTTCA AATGGGTGAT CGAAGCCAAG CTTGAGAACA ACGCCAAGCT GGTGGTCGTC
GATCCGCGCT TTACGCGTAC GGCCTCGGTC GCGGATTTCC ACGCCCCGAT CCGGCCCGGC
ACGGACATCG CCTTCCTCAA CGGCGTCATC CGCTATCTGC TCGAAAAGGA TGCGATCCAG
CACGATTACG TGCGCGCCTA CACCAGCGCC AGCCTGATCG TGAAGGATGG CTTTGGCTTC
GAGGACGGGC TTTTCACCGG CTATAAGGAG GAGACGCGCA GCTACGACAA ATCGAGCTGG
AACTATGATC TCGACGAGCA GGGCTTCGCC CGGATCGACG ACACGTGGCA GGATCCGCGC
TGCGTCATCA ATCTGCTGCG CAAGCATGTC GACAGATATA CGCCCGAGAC GGTGTCGCGA
ATCTGCGGCA CGCCGCAGGA CAAATATCTG AAAGTCTGCG AAATGATCGC CGCGACGGCG
GCGCCGGACA AGGCGCTGAC CAGCCTGTTC GCGCTCGGCT GGACACAGCA TTCGGTTGGC
GCCCAGAACA TCCGGGCGAT GGCGATGGTC CAGCTGCTGC TCGGTAATAT TGGCGTCGCC
GGCGGCGGCA TGAACGCCTT GCGCGGCCAT TCCAATATTC AGGGCCTGAC CGACGTCGGC
CTGCTGTCGA ACCAGATGCC CGGCTATATG ACGCTGCCGA GCGACAAGGA GCTCACCTTC
GAAGACTATA TGAAAACGCG GCAGTTCAAG CCGCTGCGTC CGGCCCAGAC CAGCTATTGG
CAGAACTACC GCAAATTCTT TGTGAGCTTC CAGAAGGCGG TCTATGGCGA GGCGGCGCGC
GCCGACAATG ATTGGGCCTA TGACTGGCTG CCGAAGCTCG ACGTGCCGAT GTATGACATC
ATCCGCGCCT TCGAGATGAT GGCGAACGGT CAGATGAACG GCTACATCTG CCAGGGCTTC
AATCCGCTGC AGGCGTTCCC CGACAAGGGC AAGATCCGCA GGGGGCTGAG CAAGCTGAAG
TTCCTCGTGA CGATGGATCC GCTCGACACC GAGACGTCGC GGTTTTGGGA GAATTTCGGT
CCGCAGAACC CGTCCGACCC GGCCAGCATC GCGACGGAAG TGTTCCAGCT GCCGACGACC
TGCTTCGCCG AGGAAAACGG TTCGCTCGTG AATTCGGCGC GCTGGCTGCA ATGGCACTGG
AAGGCGGCGG ACGGCCCGGG CGAGGCCAAA TCCGATCTTT GGATTATGTC TGGCATCTTC
CATCGCATGC GCGAAATGTA CCGCAAGGAC GGCGGCGCAT TTGCGGATCC GATCTTGAAC
CTGACATGGG ACTATGTCGA TCCGGTCGAG CCGAATCCGG AAGAGCTTGC GAAGGAGATG
AACGGCAAGG CGCTGACCGA GGTGAAGGAT GCGTCGGGCG CCGTCACGCT GAAGGCCGGG
CAGCTGCTCG ACGGCTTCGC GCAATTGCGC GACGACGGGA CGACGGCGTC TGGCTGCTGG
ATCTTTTCGG GGTGCTATAC CGAGAAGGGC AATCAGATGG CGCGCCGCGA CGCGAGCGAC
CCGCGCGAGC AGGGCATTGC GCCAAACTGG GCCTGGGCGT GGCCGGCCAA CCGGCGCATT
CTTTATAATC GCGCCAGCGC CGACGTCGCC GGCAAGGCGT GGAATCCGCA AAAGCCGATC
ATCGAATGGA ACGGCAGCCG ATGGGCGGGC ATCGACGTGC CCGATTATGG CCCGACGGTG
AAGCCGTCGG ACGGCGTCGG TCCGTTCATC ATGAATGCGG AAGGGGTTGG GCGCTTGTTT
GCGCGCGACC AGATGGCGGA GGGACCGTTC CCGGAGCATT ACGAGCCCTT CGAATCGCCT
TCGCCGAATA TTCTGCATCC GAAGGTGCGC TCGAACCCGG CGGCGCGGGT GTTCGCCGAC
GACAGGGCCG CCTTCGGCGA AGCTTCGGAG TTCCCCTATG TGGCGACCAC CTACCGGTTG
ACGGAGCATT TCCACTACTG GACCAAACAT GCGCTGATCA ACGCCATCCT GCAGCCGGAA
GAGTTTATCG AAATCGGCGA AGTTCTGGCG AAGGAAAAAG GCATCGAGCA GGGCGGCTGG
GTCCGTCTTG CCTCGAAGCG CGGCGTCGTC GTTTGCAAGG CCTATGTGAC CAAACGCATC
AAGCCGATGC TGGTCGACGG AAAACCAACG CATGTAATCG GGGTGCCCAT CCATTGGGGC
TTCACGGGGC AGGCCCGTAA GGGCTACGGC GCGAATACGC TGACGCCCTC CGTCGGCGAC
GCCAATACGC AGACGCCGGA ATTCAAGGCG TTCCTTGTCA GTATCGAAAG AACAACCGCC
CCTGTGGCCT GA
 
Protein sequence
MNVSRRGFIK LTGAGLAASS LGALGFDLAG AALAAAVRPF KLTATTETRN TCTYCSVACG 
ILIYSMGDRA KNARSEIMHI EGDPDHPVNR GTLCPKGSAL LDIVHSPGRL TSPQYRAPGA
SAFRPVSWDF ALGRIATLMK EDRDANFVAK NAAGTTVNRW LTTGMLAASA SSSETAMLTW
KVARSFGMLV FDNQARVUHG PTVASLAPTF GRGAMTNTWQ DIKNADVVLV MGGNAAEAHP
CGFKWVIEAK LENNAKLVVV DPRFTRTASV ADFHAPIRPG TDIAFLNGVI RYLLEKDAIQ
HDYVRAYTSA SLIVKDGFGF EDGLFTGYKE ETRSYDKSSW NYDLDEQGFA RIDDTWQDPR
CVINLLRKHV DRYTPETVSR ICGTPQDKYL KVCEMIAATA APDKALTSLF ALGWTQHSVG
AQNIRAMAMV QLLLGNIGVA GGGMNALRGH SNIQGLTDVG LLSNQMPGYM TLPSDKELTF
EDYMKTRQFK PLRPAQTSYW QNYRKFFVSF QKAVYGEAAR ADNDWAYDWL PKLDVPMYDI
IRAFEMMANG QMNGYICQGF NPLQAFPDKG KIRRGLSKLK FLVTMDPLDT ETSRFWENFG
PQNPSDPASI ATEVFQLPTT CFAEENGSLV NSARWLQWHW KAADGPGEAK SDLWIMSGIF
HRMREMYRKD GGAFADPILN LTWDYVDPVE PNPEELAKEM NGKALTEVKD ASGAVTLKAG
QLLDGFAQLR DDGTTASGCW IFSGCYTEKG NQMARRDASD PREQGIAPNW AWAWPANRRI
LYNRASADVA GKAWNPQKPI IEWNGSRWAG IDVPDYGPTV KPSDGVGPFI MNAEGVGRLF
ARDQMAEGPF PEHYEPFESP SPNILHPKVR SNPAARVFAD DRAAFGEASE FPYVATTYRL
TEHFHYWTKH ALINAILQPE EFIEIGEVLA KEKGIEQGGW VRLASKRGVV VCKAYVTKRI
KPMLVDGKPT HVIGVPIHWG FTGQARKGYG ANTLTPSVGD ANTQTPEFKA FLVSIERTTA
PVA