Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3325 |
Symbol | |
ID | 7090821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3656076 |
End bp | 3659147 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643466632 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002363593 |
Protein GI | 217979446 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.723924 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTCA GTCGTCGCGG CTTTATCAAG CTCACCGGCG CCGGGCTAGC GGCGTCAAGC CTCGGGGCGC TTGGCTTCGA CCTCGCCGGC GCAGCGCTCG CCGCCGCAGT GCGGCCGTTC AAGCTCACGG CGACGACCGA GACGCGAAAC ACCTGCACCT ATTGCTCGGT CGCCTGCGGC ATTCTCATCT ATAGCATGGG CGACCGCGCC AAGAATGCGC GCTCGGAGAT CATGCACATC GAGGGCGATC CGGATCATCC GGTCAATCGC GGCACGCTCT GCCCCAAGGG ATCGGCGCTG CTCGATATCG TGCATTCGCC GGGACGCCTG ACATCGCCTC AATATCGCGC CCCCGGCGCC AGCGCCTTCA GGCCGGTTTC TTGGGATTTC GCGCTCGGCC GCATCGCCAC GCTGATGAAA GAGGACCGTG ACGCCAATTT CGTCGCCAAG AATGCGGCGG GAACCACGGT CAACCGCTGG CTCACGACCG GCATGCTGGC GGCGTCCGCG TCGTCGAGCG AGACGGCGAT GCTGACGTGG AAAGTCGCCC GATCGTTCGG AATGCTCGTC TTCGACAATC AGGCGCGCGT CTGACACGGA CCAACGGTGG CCAGTCTGGC TCCAACATTT GGTCGCGGCG CAATGACGAA CACCTGGCAG GACATCAAGA ACGCTGATGT CGTGCTGGTG ATGGGAGGCA ATGCGGCCGA AGCGCATCCC TGCGGCTTCA AATGGGTGAT CGAAGCCAAG CTTGAGAACA ACGCCAAGCT GGTGGTCGTC GATCCGCGCT TTACGCGTAC GGCCTCGGTC GCGGATTTCC ACGCCCCGAT CCGGCCCGGC ACGGACATCG CCTTCCTCAA CGGCGTCATC CGCTATCTGC TCGAAAAGGA TGCGATCCAG CACGATTACG TGCGCGCCTA CACCAGCGCC AGCCTGATCG TGAAGGATGG CTTTGGCTTC GAGGACGGGC TTTTCACCGG CTATAAGGAG GAGACGCGCA GCTACGACAA ATCGAGCTGG AACTATGATC TCGACGAGCA GGGCTTCGCC CGGATCGACG ACACGTGGCA GGATCCGCGC TGCGTCATCA ATCTGCTGCG CAAGCATGTC GACAGATATA CGCCCGAGAC GGTGTCGCGA ATCTGCGGCA CGCCGCAGGA CAAATATCTG AAAGTCTGCG AAATGATCGC CGCGACGGCG GCGCCGGACA AGGCGCTGAC CAGCCTGTTC GCGCTCGGCT GGACACAGCA TTCGGTTGGC GCCCAGAACA TCCGGGCGAT GGCGATGGTC CAGCTGCTGC TCGGTAATAT TGGCGTCGCC GGCGGCGGCA TGAACGCCTT GCGCGGCCAT TCCAATATTC AGGGCCTGAC CGACGTCGGC CTGCTGTCGA ACCAGATGCC CGGCTATATG ACGCTGCCGA GCGACAAGGA GCTCACCTTC GAAGACTATA TGAAAACGCG GCAGTTCAAG CCGCTGCGTC CGGCCCAGAC CAGCTATTGG CAGAACTACC GCAAATTCTT TGTGAGCTTC CAGAAGGCGG TCTATGGCGA GGCGGCGCGC GCCGACAATG ATTGGGCCTA TGACTGGCTG CCGAAGCTCG ACGTGCCGAT GTATGACATC ATCCGCGCCT TCGAGATGAT GGCGAACGGT CAGATGAACG GCTACATCTG CCAGGGCTTC AATCCGCTGC AGGCGTTCCC CGACAAGGGC AAGATCCGCA GGGGGCTGAG CAAGCTGAAG TTCCTCGTGA CGATGGATCC GCTCGACACC GAGACGTCGC GGTTTTGGGA GAATTTCGGT CCGCAGAACC CGTCCGACCC GGCCAGCATC GCGACGGAAG TGTTCCAGCT GCCGACGACC TGCTTCGCCG AGGAAAACGG TTCGCTCGTG AATTCGGCGC GCTGGCTGCA ATGGCACTGG AAGGCGGCGG ACGGCCCGGG CGAGGCCAAA TCCGATCTTT GGATTATGTC TGGCATCTTC CATCGCATGC GCGAAATGTA CCGCAAGGAC GGCGGCGCAT TTGCGGATCC GATCTTGAAC CTGACATGGG ACTATGTCGA TCCGGTCGAG CCGAATCCGG AAGAGCTTGC GAAGGAGATG AACGGCAAGG CGCTGACCGA GGTGAAGGAT GCGTCGGGCG CCGTCACGCT GAAGGCCGGG CAGCTGCTCG ACGGCTTCGC GCAATTGCGC GACGACGGGA CGACGGCGTC TGGCTGCTGG ATCTTTTCGG GGTGCTATAC CGAGAAGGGC AATCAGATGG CGCGCCGCGA CGCGAGCGAC CCGCGCGAGC AGGGCATTGC GCCAAACTGG GCCTGGGCGT GGCCGGCCAA CCGGCGCATT CTTTATAATC GCGCCAGCGC CGACGTCGCC GGCAAGGCGT GGAATCCGCA AAAGCCGATC ATCGAATGGA ACGGCAGCCG ATGGGCGGGC ATCGACGTGC CCGATTATGG CCCGACGGTG AAGCCGTCGG ACGGCGTCGG TCCGTTCATC ATGAATGCGG AAGGGGTTGG GCGCTTGTTT GCGCGCGACC AGATGGCGGA GGGACCGTTC CCGGAGCATT ACGAGCCCTT CGAATCGCCT TCGCCGAATA TTCTGCATCC GAAGGTGCGC TCGAACCCGG CGGCGCGGGT GTTCGCCGAC GACAGGGCCG CCTTCGGCGA AGCTTCGGAG TTCCCCTATG TGGCGACCAC CTACCGGTTG ACGGAGCATT TCCACTACTG GACCAAACAT GCGCTGATCA ACGCCATCCT GCAGCCGGAA GAGTTTATCG AAATCGGCGA AGTTCTGGCG AAGGAAAAAG GCATCGAGCA GGGCGGCTGG GTCCGTCTTG CCTCGAAGCG CGGCGTCGTC GTTTGCAAGG CCTATGTGAC CAAACGCATC AAGCCGATGC TGGTCGACGG AAAACCAACG CATGTAATCG GGGTGCCCAT CCATTGGGGC TTCACGGGGC AGGCCCGTAA GGGCTACGGC GCGAATACGC TGACGCCCTC CGTCGGCGAC GCCAATACGC AGACGCCGGA ATTCAAGGCG TTCCTTGTCA GTATCGAAAG AACAACCGCC CCTGTGGCCT GA
|
Protein sequence | MNVSRRGFIK LTGAGLAASS LGALGFDLAG AALAAAVRPF KLTATTETRN TCTYCSVACG ILIYSMGDRA KNARSEIMHI EGDPDHPVNR GTLCPKGSAL LDIVHSPGRL TSPQYRAPGA SAFRPVSWDF ALGRIATLMK EDRDANFVAK NAAGTTVNRW LTTGMLAASA SSSETAMLTW KVARSFGMLV FDNQARVUHG PTVASLAPTF GRGAMTNTWQ DIKNADVVLV MGGNAAEAHP CGFKWVIEAK LENNAKLVVV DPRFTRTASV ADFHAPIRPG TDIAFLNGVI RYLLEKDAIQ HDYVRAYTSA SLIVKDGFGF EDGLFTGYKE ETRSYDKSSW NYDLDEQGFA RIDDTWQDPR CVINLLRKHV DRYTPETVSR ICGTPQDKYL KVCEMIAATA APDKALTSLF ALGWTQHSVG AQNIRAMAMV QLLLGNIGVA GGGMNALRGH SNIQGLTDVG LLSNQMPGYM TLPSDKELTF EDYMKTRQFK PLRPAQTSYW QNYRKFFVSF QKAVYGEAAR ADNDWAYDWL PKLDVPMYDI IRAFEMMANG QMNGYICQGF NPLQAFPDKG KIRRGLSKLK FLVTMDPLDT ETSRFWENFG PQNPSDPASI ATEVFQLPTT CFAEENGSLV NSARWLQWHW KAADGPGEAK SDLWIMSGIF HRMREMYRKD GGAFADPILN LTWDYVDPVE PNPEELAKEM NGKALTEVKD ASGAVTLKAG QLLDGFAQLR DDGTTASGCW IFSGCYTEKG NQMARRDASD PREQGIAPNW AWAWPANRRI LYNRASADVA GKAWNPQKPI IEWNGSRWAG IDVPDYGPTV KPSDGVGPFI MNAEGVGRLF ARDQMAEGPF PEHYEPFESP SPNILHPKVR SNPAARVFAD DRAAFGEASE FPYVATTYRL TEHFHYWTKH ALINAILQPE EFIEIGEVLA KEKGIEQGGW VRLASKRGVV VCKAYVTKRI KPMLVDGKPT HVIGVPIHWG FTGQARKGYG ANTLTPSVGD ANTQTPEFKA FLVSIERTTA PVA
|
| |