Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_4509 |
Symbol | |
ID | 1172101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | + |
Start bp | 4700477 |
End bp | 4703326 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637346229 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | NP_720029 |
Protein GI | 24375986 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTAA CCCGCAAAAC AGACCCAGTT GTCGAGCCCA AAGTGCCCGC CCTCGGTCTC AATCGTCGCC AATTTTTAAA ATCCGCAGGT CTTGCCACGG GTGGTATTGC TGCGGCTTCA ATGCTTGGCA CAGGCATGAT GCGTAAAGCA GAAGCTAAAG ATATCCCCCA CAATGCGCCA ACTGAAGTCA AGCGTACCAT TTGCTCTCAC TGCTCAGTAG GTTGTGGTGT GTATGCAGAA GTGCAAAATG GCGTGTGGAC AGGCCAAGAA CCTGCATTTG ACCACCCCTT CAACCAAGGC GGTCACTGTG CTAAAGGGGC TGCGCTGCGT GAACACGGCC ATGGTGAAAA ACGCCTTAAA TACCCAATGA AATTAGAAGG CGGCAAGTGG AAGAAGATCT CTTGGGATCA GGCCATCAAT GAAGTAGGCG ATAAAATGAT GGCGATTCGT CAAGAATCAG GTCCTGACTC TATCTACTTT ATGGGTAGTG CTAAGTTTTC TAACGAACAG GCTTATTTAT ATCGCAAATT CGCGGCACTG TGGGGCACAA ACAACGTCGA CCACTCAGCC CGCATTTGTC ACTCTACCAC AGTAGCCGGT GTTGCAAACA CATGGGGCTA CGGTGCGCAA ACCAACTCAG TTAACGATAT TCGCCACTCA AAATGTATCT TATTTGTTGG TTCTAACCCC AGTGAAGCGC ACCCTGTCGC TATGCAACAC ATTTTGGTAG CTAAAGAGCG CGGCGCCAAA ATTATCGTAG TTGATCCTCG TTTCACCCGT ACTGCAGCGA AATCTGACGA GTACGTGCAT ATCCGCCCAG GTACCGATAT TCCGTTTATC TATGGCCTGT TATGGCACAT TTTTGAAAAT GGCTGGGAAG ATAAAGACTT TATCAAGCGC CGCGTTTACG GCATGGAACG TATCCGCGAA GAAGTGAAAA AGTATACTCC AGAAGAAGTT GAAAACGTGG TGGGCGCGCC TAAGGCACAA ATGTATCGCG TCGCTAAGAT GATGGCTGAA ACGAAACCCG GTAGCATTGT ATGGTGCATG GGTGGCACTC AACACCACGT TGGCAACGCT AACACCCGCT CGTACTGTAT TTTACAGTTA GCGCTGGGCA ACATGGGTGT TACAGGTGGT GGCACTAACA TTTTTCGTGG TCACGATAAC GTCCAAGGCG CTTCAGACTT TGGCCTATCG TTTGACGACT TACCCGGCTA CTTCGGTTTA ACGTCTGGTT CCTGGGCTCA CTGGGCAAAC GTATGGGACT TAGATCCTAA GTGGGTCACA AGCCGTTTTG ATCAAGGTGA GTACTTAGGT CAGTCTCCCC AAACCTCTCC TGGGATCCCA TGTTCTCGCT GGCACGATGG TGTACTGGAA GATAAAACCA AGATCGCACA GAAGGATAAT ATCCGTTTAG CGTTCTTCTG GGGCCAATCG GTCAACACCG AAACTCGTGG CCGCGAAGTA CGTCAAGCAT TGGATAAAAT GGATACAGTG GTAGTTGTCG ACCCATTCCC AACCATGGCT GGTGTAATGC ACCAACGTAA AGATGGTGTG TATCTGTTAC CTGCTGCGAC CCAATTTGAA ACCTATGGCT CAGTCTCTGC AACTAACCGC TCAATTCAAT GGCGCTCTAA AGTGATTGAG CCACTGTTTG AATCACTGCC AGATCACGTG ATTATGTGCA AACTTGCTAA AAAGGTGGGG ATTGATAAAG AGCTATTCAA GCATATTAAG GTCAATGGTG AAGAACCGCT GATTGAAGAT ATTGTCCGTG AATACAACAG AGGTATGTGG ACAATTGGCT ACACAGGTCA AAGCCCTGAG CGTTTAAAAA TGCACCAAGA AAACTGGGGC ACATTTAATG TTGATAGCTT AGAAGCACCT GGTGGCCCAG CCAAAGGTGA GACCTATGGT TTGCCTTGGC CATGCTGGGG TACGCCAGAG ATGAAACACC CTGGCTCCCA CATTTTGTAT AACGAAACAA AGCATGTTAA AGACGGTGGT GGCTCTTTCC GCGCCCGCTT TGGTGTCGAG CGAAATGGCG TCAATCTTCT TTCAGAAGAA GCTTACTCTG CGGGTAGTGA AATCCAAGAT GGTTATCCTG AATTTACCGC TGACATGCTC AAGCAATTGG GTTGGTGGGA TGATTTAACC GAAGACGAGA AAAAATACGC TGAAGGAAAA AACTGGAAAA CAGATATTTC TGGCGGTATT CAACGTGTTG TCATTAAGCA TGGCTGTATT CCCTATGGCA ATGGTAAGGC CCGCGCGGTG GTATGGAATT TCCCCGATGA TATCCCGCTG CACCGCGAAC CACTTTACAC TCCTCGTCGA GACTTAGTCG CTAAGTACCC AACCTACGAA GACCGTATGG TTGCACGTCT ACCGACGCTT TATAAGTCCA TTCAGGACAA GGACTTTGCC AAAGACTTCC CACTGGCGCT CACATCGGGT CGTTTAGTGG AATACGAAGG CGGTGGTGAA GAAACTCGCT CTAACCCTTG GTTAGCTGAG CTACAACAAG AAATGTTTAT CGAAATAAGC CCGGCAGACG CTGCTGACCG TGGTATCCGT GACGGTGACA ATGTCTTTGT TCATAGCCCT GAAGGCGCCA AGATCACAGT GAAGGCAATG GTGACACCAC GGGTTGTTCC GGGTGAATGT TTTATGCCGT ACCACTTCGC AGGGGTATTT GAAGGTGAAA GCCTTGCGAA AAATTATCCT GAAGGCACAG TGCCTTACGT GATAGGCGAA TCAGCAAACA CCATTTTAAC TTATGGCTAT GACGTTGTGA CTCAGATGCA AGAAACTAAG TCCAGCCTCT GCCAAATAAG CAAAGCCTAA
|
Protein sequence | MRLTRKTDPV VEPKVPALGL NRRQFLKSAG LATGGIAAAS MLGTGMMRKA EAKDIPHNAP TEVKRTICSH CSVGCGVYAE VQNGVWTGQE PAFDHPFNQG GHCAKGAALR EHGHGEKRLK YPMKLEGGKW KKISWDQAIN EVGDKMMAIR QESGPDSIYF MGSAKFSNEQ AYLYRKFAAL WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSVNDIRHS KCILFVGSNP SEAHPVAMQH ILVAKERGAK IIVVDPRFTR TAAKSDEYVH IRPGTDIPFI YGLLWHIFEN GWEDKDFIKR RVYGMERIRE EVKKYTPEEV ENVVGAPKAQ MYRVAKMMAE TKPGSIVWCM GGTQHHVGNA NTRSYCILQL ALGNMGVTGG GTNIFRGHDN VQGASDFGLS FDDLPGYFGL TSGSWAHWAN VWDLDPKWVT SRFDQGEYLG QSPQTSPGIP CSRWHDGVLE DKTKIAQKDN IRLAFFWGQS VNTETRGREV RQALDKMDTV VVVDPFPTMA GVMHQRKDGV YLLPAATQFE TYGSVSATNR SIQWRSKVIE PLFESLPDHV IMCKLAKKVG IDKELFKHIK VNGEEPLIED IVREYNRGMW TIGYTGQSPE RLKMHQENWG TFNVDSLEAP GGPAKGETYG LPWPCWGTPE MKHPGSHILY NETKHVKDGG GSFRARFGVE RNGVNLLSEE AYSAGSEIQD GYPEFTADML KQLGWWDDLT EDEKKYAEGK NWKTDISGGI QRVVIKHGCI PYGNGKARAV VWNFPDDIPL HREPLYTPRR DLVAKYPTYE DRMVARLPTL YKSIQDKDFA KDFPLALTSG RLVEYEGGGE ETRSNPWLAE LQQEMFIEIS PADAADRGIR DGDNVFVHSP EGAKITVKAM VTPRVVPGEC FMPYHFAGVF EGESLAKNYP EGTVPYVIGE SANTILTYGY DVVTQMQETK SSLCQISKA
|
| |