Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr7_3791 |
Symbol | |
ID | 4255375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-7 |
Kingdom | Bacteria |
Replicon accession | NC_008322 |
Strand | + |
Start bp | 4503782 |
End bp | 4506634 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 638124482 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_739827 |
Protein GI | 114049277 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATTAA CCCGCAAAAC AGACCAAGTC GTCGAGCCTA AAGTGCCCGC CCTCGGTCTT AATCGTCGCC AATTCTTAAA ATCTGCAGGT CTTGCCACTG GTGGTATCGC CGCCGCGTCT ATGCTTGGCA CAGGTATGAT GCGTAAAGCA CAGGCGCAGG AACATATCCC CCATAATGCA CCGACTGAAG TCAAACGTAC CATCTGCTCT CACTGCGCTG TGGGTTGTGG TATCTATGCT GAAGTGCAAA ACGGTGTGTG GACAGGTCAA GAACCCGCGT TCGATCATCC ATTTAACCAA GGCGGCCACT GCGCGAAAGG GGCTGCACTG CGTGAACACG GCCACGGTGA AAAACGCCTG AAATACCCAA TGAAGTTAGA AGGCGGCAAG TGGAAGAAGA TCTCTTGGGA TCAAGCCATC AATGAAGTGG GCGATAAAAT GACTGCGATT CGTCAAGAAT CGGGTCCAGA CTCTATCTAC TTTATGGGTA GTGCTAAGTT CTCTAACGAA CAGGCTTATT TATATCGCAA ACTCGCGGCA CTGTGGGGTA CAAACAACGT CGACCACTCT GCCCGTATTT GTCACTCTAC CACGGTAGCC GGTGTTGCTA ACACTTGGGG CTACGGTGCG CAAACCAACT CGTTAAACGA TATCCGCAAC TCTAAGTGCA TCATGTTCGT GGGTTCAAAC CCAAGTGAAG CACACCCTGT CGCCATGCAA CACATTCTGG TGGCCAAAGA GCGCGGCGCT AAGATTATCG TTGTTGACCC ACGTTTCACC CGTACAGCAG CTAAGTCTGA CGAGTACGTG CATATCCGCC CAGGTACCGA TATCCCCTTC ATCTATGGCC TGTTATGGCA CATTTTTGAA AATGGCTGGG AAGATAAAGA GTTCATCAAG CAACGTGTAT ACGGCATGGA ACGTATTCGC GATGAAGTGA AAAAATATAC CCCTGAAGAA GTCGAAAACG TTGCTGGTGT GCCTAAGGCG CAAATGTATC GTATCGCAAA AATGTTAGCA GAGACCAAAC CAGGCACTAT CGTATGGTGT ATGGGCGGTA CTCAGCACCA CGTCGGTAAC GCCAACACCC GTTCATACTG TATTTTACAG TTAGCGCTGG GCAACATGGG CGTATCGGGC GGTGGTACCA ACATTTTCCG CGGTCACGAT AACGTGCAAG GTGCAACGGA CTTTGGTCTG TTATTCGACA ACTTACCCGG TTACTACGGT TTAACCTCTG GCGCTTGGGC TCACTGGTCT GGCGTTTGGG ACTTAGATCC TAAATGGGTT GCAGGCCGTT TCGACCAAGG CGAATACTTA GGTCAAACCC CACAAACCTC AACGGGTATA CCCTGCTCTC GCTGGCACGA TGGTGTACTC GAAGATAAAA CCAAGATCGC GCAGAAGGAT AACATCCGTC TGGCGTTCTT CTGGGGTCAA TCTGTCAACA CCGAAACCCG TGGCCGCGAA GTGCGTGAAG CACTGAACAA GTTAGATACT GTAGTTGTTG TCGACCCAAT CCCAACCATG GCCGGTGTTA TGCACCAGCG TAAAGATGGG GTGTATCTGC TCCCTGCGTC GACTCAATTT GAAACCTACG GCTCAGTGTC TGCCACTAAC CGTTCGATTC AATGGCGCTC TAAAGTGATC GAGCCGCTGT TTGAGTCTCT GCCTGACCAC GTCATTATGT ACAAACTGGC GAAAAAGCTG GGTATCGAAA AAGAATTCTG TAAGCACATC CAAGTGAATG GTGAAGAGCC ATTGATTGAA GACGTGACCC GCGAATTCAA CAAAGGCATG TGGACCGTCG GTTACACAGG CCAGAGCCCA GAACGTCTGA AAATGCACCA AGAAAACTGG GGCACTTTCG ATGTTAACAG CCTGACCGCA CCGGGCGGCC CAGCTAAAGG TGAAGTCTAC GGCTTACCTT GGCCATGTTG GGGTACTCCA GAGATGAAAC ACCCTGGTAC CCAAATCCTT TACGATCAAT CCAAAGAAGT GAAAGACGGT GGTGGTAACT TCCGCGCCCG TTATGGTGTT GAACACAATG GTGTGAATAT TCTCGCCGAC GGTTCATTCT CCAAAGGCAG TGAAATTCAA GATGGTTATC CTGAGTTTAC CGCCGACATG CTCAAGCAAT TGGGTTGGTG GGATGATTTA ACTGAAGAAG AGAAAAAATA CGCCGAAGGC AAAAACTGGA AAACAGATAT TTCTGGTGGT ATCCAACGTG TTGCCATCAA ACACGGCTGT ATTCCTTTCG GTAACGCAAA AGCGCGTTGT ATCGTGTGGA CTTTCCCAGA CGATATCCCG CTGCACCGCG AACCACTCTA CACTCCTCGT CGTGACTTAG TCGCTAAGTA CCCAACCTAC GAAGACCGTA TGGTTGCCCG TCTACCGACC CTGTATAAGT CAATTCAGGA TAAGGACTTC ACCCAAGGCT TCCCACTGAC ACTGACCTCT GGTCGTTTAG TGGAATACGA AGGTGGTGGT GAAGAATCTC GTTCTAACCC TTGGTTGGCC GAGCTACAAC AGGAAATGTT CATCGAAATG AACCCGGCAG ACGCTGCTGA CCGTGGTATC CGTGACGGTG ACAATGTCTT TGTTCATGGT CCTGAAGGCG CCAAGATCAC AGTGAAGGCA ATGGTGACAC CACGGGTTGT TCCGGGTGAA TGTTTTATGC CATACCACTT CGCCGGTATC TTCGAAGGTG AAAACCTCGC GAAGAATTAC CCAGAAGGTA CAGTACCTTA TGTACAAGGT GAATCGGCAA ACACCATTTT AACTTACGGC TATGACGTTG TGACTCAGAT GCAAGAAACT AAGTCCAGCC TTTGCCAAGT TAGCAAAGCC TAA
|
Protein sequence | MRLTRKTDQV VEPKVPALGL NRRQFLKSAG LATGGIAAAS MLGTGMMRKA QAQEHIPHNA PTEVKRTICS HCAVGCGIYA EVQNGVWTGQ EPAFDHPFNQ GGHCAKGAAL REHGHGEKRL KYPMKLEGGK WKKISWDQAI NEVGDKMTAI RQESGPDSIY FMGSAKFSNE QAYLYRKLAA LWGTNNVDHS ARICHSTTVA GVANTWGYGA QTNSLNDIRN SKCIMFVGSN PSEAHPVAMQ HILVAKERGA KIIVVDPRFT RTAAKSDEYV HIRPGTDIPF IYGLLWHIFE NGWEDKEFIK QRVYGMERIR DEVKKYTPEE VENVAGVPKA QMYRIAKMLA ETKPGTIVWC MGGTQHHVGN ANTRSYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDFGL LFDNLPGYYG LTSGAWAHWS GVWDLDPKWV AGRFDQGEYL GQTPQTSTGI PCSRWHDGVL EDKTKIAQKD NIRLAFFWGQ SVNTETRGRE VREALNKLDT VVVVDPIPTM AGVMHQRKDG VYLLPASTQF ETYGSVSATN RSIQWRSKVI EPLFESLPDH VIMYKLAKKL GIEKEFCKHI QVNGEEPLIE DVTREFNKGM WTVGYTGQSP ERLKMHQENW GTFDVNSLTA PGGPAKGEVY GLPWPCWGTP EMKHPGTQIL YDQSKEVKDG GGNFRARYGV EHNGVNILAD GSFSKGSEIQ DGYPEFTADM LKQLGWWDDL TEEEKKYAEG KNWKTDISGG IQRVAIKHGC IPFGNAKARC IVWTFPDDIP LHREPLYTPR RDLVAKYPTY EDRMVARLPT LYKSIQDKDF TQGFPLTLTS GRLVEYEGGG EESRSNPWLA ELQQEMFIEM NPADAADRGI RDGDNVFVHG PEGAKITVKA MVTPRVVPGE CFMPYHFAGI FEGENLAKNY PEGTVPYVQG ESANTILTYG YDVVTQMQET KSSLCQVSKA
|
| |