Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_4513 |
Symbol | |
ID | 1172105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | + |
Start bp | 4705625 |
End bp | 4708477 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637346233 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | NP_720033 |
Protein GI | 24375990 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTAA CTCGCAAGTC CGATGTCGCC CAAGTGGCCA ACAAACCGAC GTTAGGTATT AGCCGTCGTC AATTTATGAA GCAAGCAGGT ATTGCTACAG GTGGTATCGC CGCCGCTTCT TTAATGGGTA CTGGCATGAT GCGCCGCGCA GAAGCCAAAG ATGTGCCATA TGATGCTCCA ATCGAGATCA AACGTACTAT TTGTAGTGCG TGTGCTGTCG GTTGTGGTCT GTATGCCGAA GTACAAAACG GTGTGTGGAC AGGTCAAGAG CCTGCATTCG ATCACCCATT CAATGCCGGT GGCCACTGTG CTAAAGGTGC TGCGCTACGT GAACACGGCC ACGGTGAAAA ACGCCTGAAA TACCCAATGA AATTGGTTGA TGGCAAGTGG AAAAAAATCT CTTGGGAAGA CGCGATTAAC GAAGTGGGCG ACCAAATGCT CAACATTCGT AAAGAATCAG GCCCAGACTC TGTGTACTTT ATGGGTAGTG CGAAGTTCTC TAACGAAGGT TGTTATGCAT ACCGCAAACT GGCGGCCATG TGGGGCACCA ACAACGTCGA CCACTCTGCT CGTATTTGTC ACTCTACCAC GGTAGCTGGT GTTGCTAACA CTTGGGGCTA CGGTGCGCAA ACTAACTCTT TCAACGACAT TCAGAATGCC AATGCCATCT TCCTGATCGG TGCAAACCCT GCGGAAGCGC ATCCAGTTTC TATGCAACAT ATTCTGATCG CCAAAGAGAA AAACAACGCG AAAATCATCG TCGTTGATCC TCGTTTCTCT CGTACTGCAG CACACTCAGA TCTGCACTGC GCGATTCGTC CAGGTACCGA TATTCCGTTT ATTTACGGTA TGTTATGGCA CATTTTCGAA AACGGTTGGG AAGATAAAAC CTTTATCCAA CACCGCGTAT TCGAGATGGA AACCATCCGT GCAGAAGTGA AAAAATTCCC ACCTAAAGAA GTGGCGAATA TCACAGGCGT AAGCGAAGAA GTGGTTTATC AAGCCGCGAA ACTGATGGCA GAAAACCGTC CAGGTACCGT TATTTGGTGT ATGGGTGGTA CTCAGCATCA CGTTGGTAAC GCTAACACCC GCGCATACTG TATTCTGCAA TTAGCCTTAG GCAACATGGG CGTATCTGGC GGCGGTACTA ACATTTTCCG TGGTCACGAT AACGTTCAAG CAGCAACGGA CTTAGGTCTG CTATTTGATA ACTTACCAGG TTACTACGGT CTGACTACTG CGGCATGGGA ACATTGGACT AACGTGTGGG AACTGGATAT GGAGTGGATG AAGAGTCGCT TCGATCACGG AACCTACCTT GGTCGTGAGC CAATGACCAC ACCTGGTATG CCATGTTCTC GCTGGTTTGA CGGCGTGTTA TTAGAAAAAG ATAAACTGGC ACAGAAAGAC AATATCCGCA TGGCTTTCTT CTGGGGACAA TCGGTTAACA CTGGCACCCG CCAACGTGAT GTCCGCGATG CCTTAGATAA ATTAGATACG GTTGTGGTCG TGGATCCATT CCCAACCATC GCAGGTATCA TGCACAGCCG TAAAAACGGC GTGTACCTGT TACCAGCCTG TACTCAATTT GAAGCCTCTG GCGCAGTATC TAACTCAGGC CGTTCAATTC AGTGGCGTGA GCAGGTGATT CAGCCGCTAT TCGAATCTAA AAACGATATC GAAATTATGT ACATGCTTGC GAAAAAAGTG GGCATTTCTG AGCAATGGGC TAAGCGCTGG AACATCGCAG GCAATATGCC AGTGGTTGAA GATATCTCTC GCGAAATCAA CCGCGGTATG TGGACTATCG GTTACACCGG TCAAAGCCCA GAGCGTATCA AACAGCACAC CCAAAACTGG GGTACCTTTA GCAACAAGAC GCTTGAAGCT GCAGGTGGTC CATGTAAAGG TGAAACCTAT GGCTTACCTT GGCCATGTTG GGGCACACCA GAAGCCAAGC ACCCTGGTAC CCAAATTCTG TATAACCAAG ACAAGCACGT TAAAGACGGC GGCGGTAACT TCCGTGCCCG TTACGGCGTT GAATACAATG GTAAAAACCT ACTGGCGGAA GGCACCTTCT CTAAGGGCGC TGAAATCCAA GACGGTTACC CAGAATTCTC TGACAAACTG CTTAAGCAAC TCGGTTGGTG GGATGATCTG ACTGCAGAAG AAAAAGCGGC TGCAGAAGGT AAAAACTGGA AAACTGATTT GTCAGGCGGC ATTGTGCGCG TGGCGATCAA GCACGGTTGT ATTCCGTTTG GTAATGCGAA AGCCCGTTGT ATTGTTTGGA CTTTCCCAGA CCAAGCCCCT GTACACCGCG AGCCAATTTA TACAGCCCGC CGTGACTTAG TGGCTAAATA CCCAACCTAT GACGATATGC AAGTACATCG TCTACCAACA CTGTATAAGA CGCTCCAAGA GAACGACTTA AGCGGTAAAT ATCCATTAGT CATGACCTCT GGTCGTTTAG TGGAATACGA AGGTGGTGGT GAAGAATCTC GTTGTAACCC ATGGCTGGCT GAGCTTCAAC AGGAAATGTT CATTGAGATC AGCCCTGCTG ACGCTGCTGA CCGAGCTATC CGCAACGGTG AGTTTGTATG GGTAGAAGGT GCTGAAGGCG GTCGTATCAA GGTACAAGCC ATGGTGACGC CACGCGTAGC ACCGGGTGTA ACCTTTATGC CATACCACTT TGCGGGTGTG ATGCACGGAG AAAGTTTAGC GGCTAACTAT CCAGAAGGCA CTGTGCCTTA CGTTATCGGT GAATCCTGTA ACACGGCACT GACCTATGGT TATGACCCTG TGACCCAAAT GCAGGAAACC AAAGCGTCGC TCTGTCAGAT CGTTAAAGCA TAA
|
Protein sequence | MKLTRKSDVA QVANKPTLGI SRRQFMKQAG IATGGIAAAS LMGTGMMRRA EAKDVPYDAP IEIKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK YPMKLVDGKW KKISWEDAIN EVGDQMLNIR KESGPDSVYF MGSAKFSNEG CYAYRKLAAM WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA NAIFLIGANP AEAHPVSMQH ILIAKEKNNA KIIVVDPRFS RTAAHSDLHC AIRPGTDIPF IYGMLWHIFE NGWEDKTFIQ HRVFEMETIR AEVKKFPPKE VANITGVSEE VVYQAAKLMA ENRPGTVIWC MGGTQHHVGN ANTRAYCILQ LALGNMGVSG GGTNIFRGHD NVQAATDLGL LFDNLPGYYG LTTAAWEHWT NVWELDMEWM KSRFDHGTYL GREPMTTPGM PCSRWFDGVL LEKDKLAQKD NIRMAFFWGQ SVNTGTRQRD VRDALDKLDT VVVVDPFPTI AGIMHSRKNG VYLLPACTQF EASGAVSNSG RSIQWREQVI QPLFESKNDI EIMYMLAKKV GISEQWAKRW NIAGNMPVVE DISREINRGM WTIGYTGQSP ERIKQHTQNW GTFSNKTLEA AGGPCKGETY GLPWPCWGTP EAKHPGTQIL YNQDKHVKDG GGNFRARYGV EYNGKNLLAE GTFSKGAEIQ DGYPEFSDKL LKQLGWWDDL TAEEKAAAEG KNWKTDLSGG IVRVAIKHGC IPFGNAKARC IVWTFPDQAP VHREPIYTAR RDLVAKYPTY DDMQVHRLPT LYKTLQENDL SGKYPLVMTS GRLVEYEGGG EESRCNPWLA ELQQEMFIEI SPADAADRAI RNGEFVWVEG AEGGRIKVQA MVTPRVAPGV TFMPYHFAGV MHGESLAANY PEGTVPYVIG ESCNTALTYG YDPVTQMQET KASLCQIVKA
|
| |