Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3720 |
Symbol | |
ID | 4254283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 4444941 |
End bp | 4447793 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 638120365 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_735840 |
Protein GI | 113972047 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.933728 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATTAA CCCGCAAAAC AGACCAAGTC GTCGAGCCAA AAGTGCCCGC CCTCGGTCTC AATCGTCGCC AATTCTTAAA ATCTGCAGGT CTTGCCACTG GTGGTATCGC CGCCGCGTCT ATGCTTGGCA CAGGCATGAT GCGTAAAGCA CAGGCGCAGG AACATATCCC CCATAATGCA CCGACTGAAG TCAAACGTAC TATTTGCTCT CACTGCGCTG TGGGTTGTGG TATCTATGCT GAAGTGCAAA ACGGTGTGTG GACAGGTCAA GAACCCGCGT TCGATCATCC ATTTAACCAA GGCGGCCACT GCGCGAAAGG GGCTGCACTG CGTGAACACG GCCACGGTGA GAAACGCCTG AAATACCCAA TGAAGTTAGA AGGCGGTAAG TGGAAGAAGA TCTCTTGGGA TCAAGCCATC AATGAAGTGG GTGATAAAAT GACTGCGATT CGTCAAGAAT CGGGTCCTGA CTCTATCTAC TTTATGGGTA GCGCTAAGTT CTCTAACGAA CAAGCCTATT TATATCGTAA ACTCGCGGCA CTGTGGGGCA CAAACAACGT CGACCACTCA GCCCGTATTT GTCACTCTAC CACGGTAGCC GGTGTTGCTA ACACTTGGGG CTACGGTGCG CAAACCAACT CGTTGAACGA TATCCGCAAC TCTAAGTGCG TCATGTTTGT GGGTTCAAAC CCAAGTGAAG CGCACCCAGT CGCCATGCAA CACATTTTGG TGGCAAAAGA GCGCGGCGCT AAGATTATCG TTGTTGATCC ACGTTTCACC CGTACTGCAG CTAAGTCTGA CGAGTACGTG CATATCCGCC CAGGTACCGA TATCCCCTTC ATCTATGGTC TGTTATGGCA CATTTTTGAA AACGGCTGGG AAGATAAAGA GTTCATCAAG CAACGTGTTT ACGGCATGGA ACGTATTCGC GATGAAGTGA AAAAATATAC GCCTGAAGAA GTCGAAAACG TTGCTGGCGT GCCTAAGGCG CAAATGTACC GTATCGCTAA AATGTTAGCC GAAACCAAAC CTGGCACTAT CGTATGGTGT ATGGGCGGTA CTCAGCACCA CGTCGGTAAC GCCAACACCC GTTCATACTG TATTTTACAG TTAGCGCTGG GCAACATGGG CGTATCAGGC GGCGGTACTA ACATTTTCCG TGGTCATGAT AACGTGCAAG GCGCGACTGA CTTTGGTCTG TTATTCGACA ACTTACCCGG TTACTACGGT TTAACTTCTG GCGCTTGGGC TCACTGGTCT AACGTTTGGG ACTTAGATCC AAAATGGGTT GCAGGCCGTT TCGACCAAGG CGAGTACCTA GGTCAAACAC CTCAAACCTC AACGGGTATT CCCTGCTCTC GCTGGCACGA TGGTGTACTA GAAGATAAAA CCAAGATCGC GCAAAAGGAC AACATCCGTC TGGCGTTCTT CTGGGGTCAA TCTGTCAACA CCGAAACCCG TGGCCGCGAA GTACGTGAAG CACTGAACAA GTTAGATACT GTGGTTGTTG TCGACCCAAT CCCAACCATG GCCGGTGTTA TGCACCAGCG TAAAGATGGG GTGTATCTGC TCCCTGCGTC GACTCAATTT GAAACCTACG GCTCAGTGTC TGCCACTAAC CGTTCGATTC AATGGCGCTC TAAAGTGATC GAGCCGCTGT TTGAGTCTCT GCCTGACCAC GTCATTATGT ACAAACTGGC GAAAAAGCTG GGTATCGAAA AAGAATTCTG TAAGCACATC CAAGTGAATG GTGAAGAGCC ATTGATTGAA GACGTGACCC GCGAATTCAA CAAAGGCATG TGGACCGTCG GTTACACAGG CCAGAGCCCA GAACGTCTGA AAATGCACCA AGAAAACTGG GGCACTTTCG ATGTTAACAG CCTGACCGCA CCGGGCGGCC CAGCTAAAGG TGAAGTCTAC GGCTTACCTT GGCCATGTTG GGGTACTCCA GAGATGAAAC ACCCTGGTAC CCAAATCCTT TACGATCAAT CCAAAGAAGT GAAAGACGGT GGTGGTAACT TCCGCGCCCG TTATGGTGTT GAACACAATG GTGTGAATAT TCTCGCCGAC GGTTCATTCT CCAAAGGCAG TGAAATTCAA GATGGTTATC CTGAGTTTAC CGCCGACATG CTCAAGCAAT TGGGTTGGTG GGATGATTTA ACTGAAGAAG AGAAAAAATA CGCCGAAGGC AAAAATTGGA AAACAGATAT TTCTGGTGGT ATCCAACGTG TTGCCATCAA ACACGGCTGT ATTCCTTTCG GTAACGCAAA AGCGCGTTGT ATCGTGTGGA CTTTCCCAGA CGATATCCCG CTGCACCGCG AGCCACTCTA CACTCCTCGT CGTGACTTAG TCGCTAAGTA CCCAACCTAC GAAGACCGTA TGGTTGCCCG CCTACCGACC CTGTATAAGT CAATTCAGGA TAAGGACTTC ACCCAAGGCT TCCCACTGAC ACTGACCTCT GGTCGTTTGG TGGAATACGA AGGTGGTGGT GAAGAATCTC GTTCTAACCC TTGGTTGGCC GAGCTACAAC AGGAAATGTT CATCGAAATG AACCCGGCAG ACGCTGCTGA CCGTGGTATC CGTGACGGTG ACAATGTCTT TGTTCATGGT CCTGAAGGCG CCAAGATCAC AGTGAAGGCA ATGGTGACAC CACGGGTTGT TCCGGGTGAA TGTTTTATGC CATACCACTT CGCCGGTATC TTTGAAGGTG AAAACCTCGC GAAGAATTAC CCTGAAGGTA CAGTACCTTA TGTACAAGGT GAATCGGCTA ACACCATTCT AACTTACGGC TATGACGTTG TGACTCAGAT GCAAGAAACT AAGTCCAGCC TTTGCCAAGT TAGCAAAGCC TAA
|
Protein sequence | MRLTRKTDQV VEPKVPALGL NRRQFLKSAG LATGGIAAAS MLGTGMMRKA QAQEHIPHNA PTEVKRTICS HCAVGCGIYA EVQNGVWTGQ EPAFDHPFNQ GGHCAKGAAL REHGHGEKRL KYPMKLEGGK WKKISWDQAI NEVGDKMTAI RQESGPDSIY FMGSAKFSNE QAYLYRKLAA LWGTNNVDHS ARICHSTTVA GVANTWGYGA QTNSLNDIRN SKCVMFVGSN PSEAHPVAMQ HILVAKERGA KIIVVDPRFT RTAAKSDEYV HIRPGTDIPF IYGLLWHIFE NGWEDKEFIK QRVYGMERIR DEVKKYTPEE VENVAGVPKA QMYRIAKMLA ETKPGTIVWC MGGTQHHVGN ANTRSYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDFGL LFDNLPGYYG LTSGAWAHWS NVWDLDPKWV AGRFDQGEYL GQTPQTSTGI PCSRWHDGVL EDKTKIAQKD NIRLAFFWGQ SVNTETRGRE VREALNKLDT VVVVDPIPTM AGVMHQRKDG VYLLPASTQF ETYGSVSATN RSIQWRSKVI EPLFESLPDH VIMYKLAKKL GIEKEFCKHI QVNGEEPLIE DVTREFNKGM WTVGYTGQSP ERLKMHQENW GTFDVNSLTA PGGPAKGEVY GLPWPCWGTP EMKHPGTQIL YDQSKEVKDG GGNFRARYGV EHNGVNILAD GSFSKGSEIQ DGYPEFTADM LKQLGWWDDL TEEEKKYAEG KNWKTDISGG IQRVAIKHGC IPFGNAKARC IVWTFPDDIP LHREPLYTPR RDLVAKYPTY EDRMVARLPT LYKSIQDKDF TQGFPLTLTS GRLVEYEGGG EESRSNPWLA ELQQEMFIEM NPADAADRGI RDGDNVFVHG PEGAKITVKA MVTPRVVPGE CFMPYHFAGI FEGENLAKNY PEGTVPYVQG ESANTILTYG YDVVTQMQET KSSLCQVSKA
|
| |