Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3656 |
Symbol | |
ID | 7092929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 4014674 |
End bp | 4017520 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643466944 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002363903 |
Protein GI | 217979756 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.480327 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTGA TCACCGAGAT CGATTTCGGC ACTCCCGCCC CAAAATCCGA GAAAATGGTC ACGCTGACGA TCGACGGGGT CGAAGTCAGC GCGCCCGAGG GGACCTCCAT CATGCGCGCG GCGATGGAGA TGGGCACGGA AATCCCAAAA CTTTGCGCGA CCGATTCGAT TGAGGCGTTC GGCTCGTGCC GGCTCTGCCT TGTCGAAATT GAGGGGCGCA ACGGCACGCC GGCCTCCTGC ACGACGCCGG TCGCGCCCGG CATGTCGGTC AAGACGCAGA CGCCGCGGCT GAAGGCCCTG CGCAAGGGCG TGATGGAGCT TTATATCTCC GACCATCCGC TCGACTGCCT AACCTGCTCG GCCAATGGCG ACTGCGAATT GCAGACGCAG GCTGGCGCGG TCGGTCTGCG CGACGTCCGC TACGGCTATG AGGGCGCCAA TCATTTCGAC AGCGTCAAGG ATCTGTCCAA TCCCTATTTC GCCTATGACC CGGCCAAATG CATTGTCTGC AACCGCTGCG TGCGCGCCTG CGAGGAAGTG CAGGGCACCT TCGCGCTGAC CATTGACGGG CGCGGCTTCG GCAGCCGCGT CGCCGCCGGC CAGGACGAGC CTTTCCTTGA GTCGGAATGC GTCTCCTGCG GCGCCTGCGT TCAAGCTTGC CCCACCGCGA CGCTGATGGA AAAATCGGTG ATCGAGATCG GCCTGCCGGA GCATTCGGCG GTGACGACCT GCGCCTATTG CGGCGTCGGC TGCACCTTCA AGGCGGAAAT GCGCGGCGAG GAGCTCGTCC GCATGGTCCC GTATAAGGAC GGCAAGGCCA ATCACGGCCA TTCCTGCGTC AAGGGACGGT TCGCCTATGG CTATGCGACG CATCGCGATC GCATCCTGAA GCCGATGATC CGCGCGTCGA TCGAGGATCC GTGGCGGGAG GTGTCCTGGG ACGAGGCGAT CGGCTACGCC GCGTCACAGT TCAGGCGCAT CCAGGCGGAA CACGGCAAGC AGGCGATCGG CGGCATCACC TCCTCGCGCT GCACCAATGA AGAGACCTAT CTCGTCCAGA AGCTGATCCG CGCCGGCTTC GGCAACAACA ATGTCGACAC CTGCGCGCGC GTCTGTCATT CGCCGACAGG CTATGGCCTC AAGACCGCCT TCGGCACCTC GGCGGGCACG CAGGATTTCG ATTCGATCGA AGAGACGGAC GTGATGCTGG TGATCGGCGC CAATCCGACC GACGGGCATC CGGTGTTCGG TTCGCGCATG AAGAAGCGTC TGCGCGCCGG CGCGAAGCTG ATCGTCATCG ATCCGCGCCG CATCGATCTG GTTGAAACGC CGCATGTCAA GGCGGATTTC CATTTGCCGC TGCAGCCGGG CTGCAATGTT TCAGTCGTGA CCTCGCTCGC CCATGTCATT GTGACCGAAG GCCTCGTCAA CGAAGCCTTC GTGCGCGAGC GCTGCGACTG GTCGGAATTT CAGGATTGGG CGGCTTTCGT CTCCGAGCCG CGCCATAGCC CCGAAGAGGT CGAGAAGATT TCAGGCGTGC CCGCCGGTCT CATCCGCGGC GCCGCCAGGC TTTACGCGAC CGGCGGCAAC GCCGCGATCT TCTATGGCCT CGGCGTCACG GAGCACAGCC AGGGATCGAC GACCGTGCTG GCGATCGCAA ACCTCGCCAT GGCGACCGGA AATCTCGGCC GGCCGGGCGT CGGCGTCAAT CCGCTGCGCG GCCAGAACAA TGTTCAGGGC GCATGCGACA TGGGCTCGTT CCCGCATGAA TTCGCGGGCT ATCGCCATGT GTCGGACACG GCCGCGCGCG ACGTTTATGA AAGCCTCTGG GGCGTCGAAC TCGAGCACGA GCCGGGCCTG CGCATCCCCA ATATGCTCGA CGCCGCCGTC GAGGGGACGT TCCGCGGCAT CTACATCCAG GGCGAGGACA TTCTGCAGTC GGACCCCGAC ACGCATCACG TCGCCTCGGG CCTCAAGGCG ATGGACTGCG TCATCGTCCA GGATCTTTTC CTGAACGAGA CGGCGAATTA CGCCCATGTC TTCCTGCCGG GCTCGACCTT CCTCGAAAAG GACGGCACCT TCACCAACGC CGAGCGCCGC ATCCAGCGCG TGCGCAAGGT GATGAGCCCG AAGAACGGCT ATGGCGATTG GGAGATCACC ATGCTGCTCT CGAACGCCAT GGGCTTCAAG ATGTCCTACA AGCATCCAAG CGAGATCATG GACGAGATTG CGCTGTCGAC GCCGAGCTTT GCTGGCGTCT CCTACGCCAA GCTCGACGAG ATGGGCTCGG TGCAATGGCC CTGCAATGCG GCGGCGCCCG AAGGTACGCC AGTCATGCAT ATCGGCGGCT TCGCCGGCGG CAAGGGCAAA TTCGTCATCA CCGAATATGT CGCCACCGAC GAGCGGACGG GGCCGCGCTT CCCGCTGCTG CTGACCACCG GACGCATCCT CAGCCATTAT AATGTCGGGG CGCAGACGCG GCGGACCGCC AATGTCGCCT GGCATCCGGA GGATCTTCTT GAGATCCATC CCCATGACGC GGAGCTGAGG GGCGTCAAGG ACGGCGACTG GGTGAGGCTG CATAGCCGCG CCGGCGAAAC GACGTTGCGC GCGCTGGTTA CCGATCGCGT TGCGCCCGGC GTCGTCTATA CGACTTTCCA TCACCCGATG ACGCAGGCCA ATGTCGTGAC GACGGATTTT TCCGACTGGG CGACCAATTG TCCGGAATTC AAGGTGACGG CGGTTCAGAT TTCGCCCTCG AACGGCCCGT CCGAATGGCA GGAAAGCTAT CGCCAGCAGG CCGAGCAGAG CCGCCGCATC CTGCAGGAGG CGGACGCGGC GGAATGA
|
Protein sequence | MALITEIDFG TPAPKSEKMV TLTIDGVEVS APEGTSIMRA AMEMGTEIPK LCATDSIEAF GSCRLCLVEI EGRNGTPASC TTPVAPGMSV KTQTPRLKAL RKGVMELYIS DHPLDCLTCS ANGDCELQTQ AGAVGLRDVR YGYEGANHFD SVKDLSNPYF AYDPAKCIVC NRCVRACEEV QGTFALTIDG RGFGSRVAAG QDEPFLESEC VSCGACVQAC PTATLMEKSV IEIGLPEHSA VTTCAYCGVG CTFKAEMRGE ELVRMVPYKD GKANHGHSCV KGRFAYGYAT HRDRILKPMI RASIEDPWRE VSWDEAIGYA ASQFRRIQAE HGKQAIGGIT SSRCTNEETY LVQKLIRAGF GNNNVDTCAR VCHSPTGYGL KTAFGTSAGT QDFDSIEETD VMLVIGANPT DGHPVFGSRM KKRLRAGAKL IVIDPRRIDL VETPHVKADF HLPLQPGCNV SVVTSLAHVI VTEGLVNEAF VRERCDWSEF QDWAAFVSEP RHSPEEVEKI SGVPAGLIRG AARLYATGGN AAIFYGLGVT EHSQGSTTVL AIANLAMATG NLGRPGVGVN PLRGQNNVQG ACDMGSFPHE FAGYRHVSDT AARDVYESLW GVELEHEPGL RIPNMLDAAV EGTFRGIYIQ GEDILQSDPD THHVASGLKA MDCVIVQDLF LNETANYAHV FLPGSTFLEK DGTFTNAERR IQRVRKVMSP KNGYGDWEIT MLLSNAMGFK MSYKHPSEIM DEIALSTPSF AGVSYAKLDE MGSVQWPCNA AAPEGTPVMH IGGFAGGKGK FVITEYVATD ERTGPRFPLL LTTGRILSHY NVGAQTRRTA NVAWHPEDLL EIHPHDAELR GVKDGDWVRL HSRAGETTLR ALVTDRVAPG VVYTTFHHPM TQANVVTTDF SDWATNCPEF KVTAVQISPS NGPSEWQESY RQQAEQSRRI LQEADAAE
|
| |