Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3724 |
Symbol | |
ID | 4254287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 4450091 |
End bp | 4452943 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 638120369 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_735844 |
Protein GI | 113972051 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTAA CTCGCAAGTC CGATGTCGCC CAAGTGGCCG ACAAACCGAC GTTAGGCATT AGCCGTCGTC AATTTATGAA GCAAGCAGGT ATTACTACCG GTGGTATCGC AGCAGCTTCT CTGATGGGTA CTGGCATGAT GCGCCGCGCA GAAGCCAAAG ATGTGCCACA CGACGCGCCG ATCGAAGTTA AACGTACGAT TTGTAGTGCC TGTGCTGTGG GTTGTGGTCT GTATGCCGAA GTGCAAAATG GTGTGTGGAC GGGTCAAGAA CCTGCATTCG ATCACCCATT CAATGCTGGC GGTCACTGCG CTAAGGGTGC TGCACTGCGT GAGCATGGCC ACGGTGAAAA ACGCCTGAAA TACCCAATGA AATTGGTTGA TGGTAAGTGG AAAAAAATCT CTTGGGAAGA TGCGATTAAC GAAGTGGGCG ACCAAATGCT CAACATTCGT AAAGAATCAG GCCCAGACTC AGTGTACTTC ATGGGTAGCG CTAAGTTCTC TAACGAAGGC TGCTATATGT ACCGCAAACT GGCGGCCATG TGGGGCACAA ACAACGTCGA CCACTCAGCT CGTATTTGTC ACTCTACCAC GGTAGCCGGT GTTGCTAACA CTTGGGGTTA CGGTGCGCAA ACTAACTCTT TCAACGACAT TCAGAATGCC AATGCCATCT TCCTGATCGG GGCCAACCCA GCAGAAGCGC ATCCAGTATC TATGCAGCAC ATTCTGATCG CTAAAGAGAA AAACAACGCG AAAATCATCG TTGTTGACCC ACGTTTCTCT CGCACTGCGG CGCACTCAGA TCTGCACTGC GCGATTCGTC CAGGTACTGA CATTCCCTTT ATCTACGGTA TGTTATGGCA CATTTTCGAA AACGGTTGGG AAGATAAGAC CTTTATCCAA CAACGCGTAT TTGAGATGGA AACCATCCGC GAAGAAGCGA AAAAATTCCC ACCTAAAGAA GTGGCAAATA TCACTGGCGT AAGTGAAGAA GTCATTTATC AAGCCGCGAA ACTGATGGCG GAAAACCGTC CAGGTACCGT GATTTGGTGT ATGGGTGGTA CTCAACACCA CGTCGGTAAC GCTAACACCC GTGCTTACTG TATTCTGCAA TTAGCCTTAG GCAACATGGG CGTTTCTGGC GGCGGTACTA ACATTTTCCG TGGCCACGAC AACGTACAAG GCGCGACCGA CTTAGGTCTG CTGTTCGATA ACCTACCAGG TTACTACGGT TTAACCTCAG CCGCTTGGAC TCACTGGACC CATGTGTGGG ATCTAGATAT GGAGTGGGTG AAGAGCCGCT TCGATCAAAA CGCCTATTTA GGCAAAGATC CAATGACCAC CCCTGGTATT CCTTGTTCTC GCTGGCACGA TGGCGTGTTA GAAGACAAGA GCAAGCTGGC ACAGAAAGAC AATATCCGTA TGGCCTTCTT CTGGGGTCAA TCGGTCAACA CTGAAACCCG TCAACGTGAA GTGCGTGATG CTTTAGACAA GATGGACACA GTAGTGGTTG TCGACCCATT CCCAACCATG GCGGGTGTGA TGCACCGTCG TAAGAATGGC GTGTATCTGT TACCTGCTGC GACTCAGTTT GAAACTCAAG GTTCAGTGTC TAACTCAGGC CGTTCTATCC AATGGCGTGA GCAGGTTATC CAACCTTTAT TCGAGTCAAA AACCGACATC GAAATTATGT ACCGTTTAGC GCAAAAACTC GGTATTGCCG AGCAATACAC TAAACGCATC GCCAAAGAAA ACGGCTTACC GGTTATCGAA GAGATCACCC GCGAAATCAA CCGCGGCATG TGGACCATCG GTATGACAGG TCAAAGCCCT GAGCGTATCA AGCTGCACAC CCAAAACTGG GGCACTTTCA GCAACAAGAC GCTCGAAGCC GTCGGCGGCC CAGCGAAGGG CGAAACCTAC GGTTTACCTT GGCCATGTTG GGGCACACCA GAAGCTAAAC ACCCTGGTAC CCAAATTCTG TATAACCAAT CCAAACACGT TAAAGACGGC GGCGGTAACT TCCGCGCTCG TTACGGCGTT GAATACAATG GCAAAAACCT ACTGGCTGAA GGCTCTTTCT CTAAAGGTGC CGAGATCCAA GACGGTTACC CAGAATTTAC CGACAAGCTG CTGAAGCAAC TCGGTTGGTG GGATGACCTG ACTGCGGAAG AAAAAGCCGA AGCTGAAGGC CGCAACTGGA AGACAGACTT GTCAGGCGGC ATAGTGCGCG TGGCAATCAA GCACGGTTGT ATTCCATTTG GTAACGCTAA AGCCCGTTGT ATTGTTTGGA CTTTCCCAGA CCAAGTGCCA GTTCACCGCG AGCCGTTATA CACAGCACGC CGTGACTTAG TGGCTAAATA CCCAACCTAC GACGATATGC AAGTTCATCG TCTGCCAACA CTGTACAAGT CAATCCAAGA GAAAGACTTC AGCGGCAAGT ACCCACTGGT ACTGACCTCT GGTCGTTTAG TGGAATACGA AGGTGGTGGT GAAGAATCTC GTTCTAACCC ATGGCTGGCT GAGCTTCAAC AGGAAATGTT TGTTGAAATC AACCCAGGTG ACGCAGCCGA CCGCGGTATC CGCAACGGTG AGTTTGTGTG GTTAGAGGGC GCCGAAGGTG GCCGCATTAA AGTACAAGCC ATGGTAACAC CACGCGTTAA ACCAGGTGTG ACCTTTATGC CATACCACTT TGCGGGTGTG ATGCACGGTG AAAGCTTAGC GCCTAACTAT CCTGAGGGCA CTGTGCCTTA CGTTATCGGT GAATCCGCTA ACACGGCACT GACCTATGGT TATGACCCTG TGACTCAAAT GCAGGAAACC AAAGCGTCGC TCTGTCAGAT CGTTAAAGCG TAA
|
Protein sequence | MKLTRKSDVA QVADKPTLGI SRRQFMKQAG ITTGGIAAAS LMGTGMMRRA EAKDVPHDAP IEVKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK YPMKLVDGKW KKISWEDAIN EVGDQMLNIR KESGPDSVYF MGSAKFSNEG CYMYRKLAAM WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA NAIFLIGANP AEAHPVSMQH ILIAKEKNNA KIIVVDPRFS RTAAHSDLHC AIRPGTDIPF IYGMLWHIFE NGWEDKTFIQ QRVFEMETIR EEAKKFPPKE VANITGVSEE VIYQAAKLMA ENRPGTVIWC MGGTQHHVGN ANTRAYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDLGL LFDNLPGYYG LTSAAWTHWT HVWDLDMEWV KSRFDQNAYL GKDPMTTPGI PCSRWHDGVL EDKSKLAQKD NIRMAFFWGQ SVNTETRQRE VRDALDKMDT VVVVDPFPTM AGVMHRRKNG VYLLPAATQF ETQGSVSNSG RSIQWREQVI QPLFESKTDI EIMYRLAQKL GIAEQYTKRI AKENGLPVIE EITREINRGM WTIGMTGQSP ERIKLHTQNW GTFSNKTLEA VGGPAKGETY GLPWPCWGTP EAKHPGTQIL YNQSKHVKDG GGNFRARYGV EYNGKNLLAE GSFSKGAEIQ DGYPEFTDKL LKQLGWWDDL TAEEKAEAEG RNWKTDLSGG IVRVAIKHGC IPFGNAKARC IVWTFPDQVP VHREPLYTAR RDLVAKYPTY DDMQVHRLPT LYKSIQEKDF SGKYPLVLTS GRLVEYEGGG EESRSNPWLA ELQQEMFVEI NPGDAADRGI RNGEFVWLEG AEGGRIKVQA MVTPRVKPGV TFMPYHFAGV MHGESLAPNY PEGTVPYVIG ESANTALTYG YDPVTQMQET KASLCQIVKA
|
| |