Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sama_3609 |
Symbol | |
ID | 4605856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella amazonensis SB2B |
Kingdom | Bacteria |
Replicon accession | NC_008700 |
Strand | - |
Start bp | 4246748 |
End bp | 4249600 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639783030 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_929481 |
Protein GI | 119776741 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTA CCCGTACCTC AGACGTGGCC GCCAAGGTCG AGGCCAAGCC ACTGGGCATG AGCCGCCGTC AATTTATGAA AACGGCCGGT ATTGCCACCG GCGGCATTGC TGCAGCCTCC ATGCTTGGCA CTGGTATGAT GCGCCGCGCC GAGGCCAAAG ACGTACCCCA TGATGCGCCG ATTGAAATCA AGCGCACCAT TTGCAGTGCC TGCGCCGTGG GTTGTGGCCT GTATGCCGAA GTGCAAAATG GCGTCTGGAC CGGCCAGGAG CCGGCATTCG ATCACCCCTT CAACGCCGGT GGTCACTGTG CCAAGGGTGC TGCGCTGCGT GAGCACGGCC ATGGTGAAAA ACGCCTCAAG TATCCGATGA AGCTTGAAGG CGGTAAGTGG AAGCGTATCA GCTGGGAGCA GGCGATTAAC GAAGTGGGCG ACAAGATGCT CAACATTCGT CAGGAGTCAG GCCCGGATTC TGTGTACTTC ATGGGCTCGG CCAAGTTCTC CAACGAAGGC TGCTATATGT ACCGCAAGCT CGCGGCCATG TGGGGTACCA ACAACGTCGA CCACTCGGCC CGTATTTGTC ACTCTACCAC TGTAGCCGGT GTGGCCAACA CCTGGGGCTA TGGTGCGCAA ACCAACTCTT TCAACGACAT TCAGAATGCC CGCGCCATCT TTTTTATCGG CGCCAACCCT GCCGAGGCAC ACCCTGTGTC CATGCAGCAC ATTCTGACTG CCAAAGAGCG CAACAACGCC AAGATAATCG TGGTCGACCC GCGCTTCTCC CGCACAGCGG CCCATGCCGA CCTGCACGTG GGCATTCGCC CCGGCACTGA TATTCCGTTT ATTTACGGCA TGTTGTGGCA CATTTTTGAA AACGGCTGGG AAGACAAAAC CTTTATCGAC CAGCGCGTAT TTGGTATGGA CAAGATTCGC GAAGAAGCCA AAAAATTTCC GCCAAAAGAA GTGGCCGATA TCACGGGTGT CTCTGAAGAG GCCATTTATC AGGCTGCCAA ACTGATGGCC GATAACCGCC CCGGCACCGT GGTGTGGTGT ATGGGCGGTA CCCAGCACCA TGTGGGTAAC GCAAATACCC GTGCCTACAG TATTCTGCAG CTGGCGCTGG GTAATATGGG CGTATCCGGT GGTGGTACCA ACATCTTCCG AGGTCACGAC AACGTACAGG GCGCCACCGA CCTGGGCCTG CTGTTTGACA ACCTGCCCGG CTACTACGGC CTGACCAGCG CCGCCTGGCA GCATTGGACT CACGTGTGGG ATCTGGATCT GGAATGGGTC AAGGGGCGTT TCGACCACGG CACCTACCTG GGCCGCGAGC CCATGACCAC CCCCGGTATT CCTTGCTCCC GCTGGCACGA TGGTGTGCTG GAAGACAAGG CCAAGCTGGC GCAGAAAGAC AATATCCGTC TGGCGTTTTT CTGGGGGCAG TCGGTTAACA CCGAAACCCG TCAGCGCGAA GTCCGTGATG CACTGGACAA GATGGACACA GTGGTCGTGG TTGACCCCTT CCCAACCATG GCCGGTGTTA TGCACCGCCG TAAAGACGGC GTTTACCTGC TGCCTGCAGC TACCCAGTTT GAGACCCGTG GCTCCATCTC CAACTCGGGC CGCTCTATCC AGTGGCGTGA ACAGGTTATC GAGCCGCTGT TTGAGTCCAA GACCGACATC GAAATCATGT ACCGTCTGGC GGAAAAGCTG GGTATTGCCG AGCAATACAC CAAGCGCATC AGCAAAGAAA ATGGTGTGCC GCTGATTGAA GACATCACCC GCGAAATCAA CCGCGGCATG TGGACCATCG GCATGACAGG TCAGAGCCCT GAGCGTATCA AGGCCCACAC CATGAACTGG GGCACCTTCT CGCAGAAGAG CCTCGAAGCC GAAGGTGGCC CATGTAAGGG CGAAACCTAC GGTCTGCCAT GGCCATGTTG GGGCACGCCT GAGATGAAAC ACCCGGGCAC CCAGATTCTC TATAACACAA GCAAACATGT GAAAGACGGC GGCGGCAACT TCCGTGCCCG TTATGGCGTT GAGTATCAGG GCCAAAATCT GCTGGCCGAG GGCTCCTTCT CCAAGGGCGC CGAGATTGAA GACGGCTATC CCGAATTTAC CGCCGATATG CTCAAGCAAT TGGGCTGGTG GGACGACCTG ACCGCGGCCG AAAAAGCCGA GGCCGAAGGC AAAAACTGGA AGACGGATCT GTCCGGTGGC ATCGTGCGCG TTGCCATCAA GCACGGCTGT ATTCCTTTCG GTAACGCCCG TGCCCGCTGC CTGGTGTGGA CCTTCCCCGA TGCTGTGCCT GTGCACCGTG AACCGCTGTA TACAGCCCGT CGCGATCTGG TGGCCAAGTA CCCCACCTAC GACGACATGC AGGTGCACCG TTTGCCTACC CTGTACAAAT CGATTCAGGA AAAAGACTTC AGCGGCAGCT ACCCGCTGGT GCTCACCTCC GGTCGTTTGG TGGAGTACGA GGGTGGCGGC GAAGAGTCCC GCTCCAACCC CTGGCTGGCC GAGCTGCAGC AGGAGATGTT TGTGGAAATC AATCCGGGCG ACGCTGCCGA CCGGGGTATT CGCAACAATG ACAATGTCTG GCTGGAAGGC CCAGAGGGTG GCCGCATCCT CATCAAGGCA CTGGTGACAC CCAGGGTAAA ACCCGGCGTG ACCTTTATGC CTTACCACTT CGCCGGTGTG ATGCATGGCG AGAGTCTGGC CCCCAACTAT CCGGAAGGCA CCGTGCCCTA CGTCATCGGT GAATCCGCCA ACACGGCGCT GACCTATGGC TACGACCCTG TGACCCAAAT GCAGGAAACC AAGTCGAGCC TGTGTCAGAT AGTAAAAGCC TGA
|
Protein sequence | MKLTRTSDVA AKVEAKPLGM SRRQFMKTAG IATGGIAAAS MLGTGMMRRA EAKDVPHDAP IEIKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK YPMKLEGGKW KRISWEQAIN EVGDKMLNIR QESGPDSVYF MGSAKFSNEG CYMYRKLAAM WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA RAIFFIGANP AEAHPVSMQH ILTAKERNNA KIIVVDPRFS RTAAHADLHV GIRPGTDIPF IYGMLWHIFE NGWEDKTFID QRVFGMDKIR EEAKKFPPKE VADITGVSEE AIYQAAKLMA DNRPGTVVWC MGGTQHHVGN ANTRAYSILQ LALGNMGVSG GGTNIFRGHD NVQGATDLGL LFDNLPGYYG LTSAAWQHWT HVWDLDLEWV KGRFDHGTYL GREPMTTPGI PCSRWHDGVL EDKAKLAQKD NIRLAFFWGQ SVNTETRQRE VRDALDKMDT VVVVDPFPTM AGVMHRRKDG VYLLPAATQF ETRGSISNSG RSIQWREQVI EPLFESKTDI EIMYRLAEKL GIAEQYTKRI SKENGVPLIE DITREINRGM WTIGMTGQSP ERIKAHTMNW GTFSQKSLEA EGGPCKGETY GLPWPCWGTP EMKHPGTQIL YNTSKHVKDG GGNFRARYGV EYQGQNLLAE GSFSKGAEIE DGYPEFTADM LKQLGWWDDL TAAEKAEAEG KNWKTDLSGG IVRVAIKHGC IPFGNARARC LVWTFPDAVP VHREPLYTAR RDLVAKYPTY DDMQVHRLPT LYKSIQEKDF SGSYPLVLTS GRLVEYEGGG EESRSNPWLA ELQQEMFVEI NPGDAADRGI RNNDNVWLEG PEGGRILIKA LVTPRVKPGV TFMPYHFAGV MHGESLAPNY PEGTVPYVIG ESANTALTYG YDPVTQMQET KSSLCQIVKA
|
| |