Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr7_3795 |
Symbol | |
ID | 4255379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-7 |
Kingdom | Bacteria |
Replicon accession | NC_008322 |
Strand | + |
Start bp | 4508932 |
End bp | 4511784 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 638124486 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_739831 |
Protein GI | 114049281 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTAA CTCGCAAGTC CGATGTCGCC CAAGTGGCCG ACAAACCGAC GTTAGGCATT AGCCGTCGTC AATTTATGAA GCAAGCAGGT ATTACTACCG GTGGTATCGC AGCCGCTTCT CTGATGGGTA CTGGCATGAT GCGCCGCGCA GAAGCCAAAG ATGTGCCACA CGACGCGCCG ATCGAAGTTA AACGTACGAT TTGTAGTGCC TGTGCTGTGG GTTGTGGTCT GTATGCCGAA GTGCAAAATG GTGTGTGGAC GGGTCAAGAA CCTGCATTCG ATCACCCATT CAATGCCGGC GGTCACTGCG CTAAGGGTGC TGCACTGCGT GAGCATGGCC ACGGTGAAAA ACGCCTGAAA TACCCAATGA AATTGGTTGA TGGTAAGTGG AAAAAAATCT CTTGGGAAGA TGCGATTAAC GAAGTGGGCG ACCAAATGCT CAACATTCGT AAAGAATCTG GCCCAGACTC AGTGTACTTC ATGGGTAGCG CTAAGTTCTC TAACGAAGGC TGCTATATGT ACCGCAAACT GGCGGCCATG TGGGGCACAA ACAACGTCGA CCACTCTGCT CGTATTTGTC ACTCTACCAC GGTAGCCGGT GTTGCTAACA CTTGGGGTTA CGGTGCGCAA ACTAACTCTT TCAACGACAT TCAGAATGCC AATGCCATCT TCCTGATCGG GGCAAACCCT GCGGAAGCGC ATCCAGTATC GATGCAACAC ATTCTGATCG CCAAAGAGAA AAACAACGCG AAAATCATCG TCGTTGACCC ACGTTTCTCT CGCACTGCGG CGCACTCAGA TCTGCACTGC GCGATTCGTC CAGGTACTGA CATTCCCTTT ATCTACGGTA TGTTATGGCA CATTTTCGAA AACGGTTGGG AAGATAAGAC CTTTATCCAA CAACGCGTAT TCGAGATGGA AACCATCCGC GAAGAAGCGA AAAAATTCCC ACCTAAAGAA GTGGCAAATA TCACTGGCGT AAGTGAAGAA GTCATTTATC AAGCCGCGAA ACTGATGGCG GAAAACCGTC CAGGTACCGT GATTTGGTGT ATGGGTGGTA CTCAACACCA CGTCGGTAAC GCTAACACCC GTGCTTACTG TATTCTGCAA TTAGCCTTAG GCAACATGGG CGTTTCTGGT GGCGGTACTA ACATTTTCCG TGGCCACGAC AACGTACAGG GCGCGACCGA CTTAGGTCTG CTGTTCGATA ACCTACCAGG TTACTACGGT TTAACCTCAG CCGCTTGGAC TCACTGGACC CATGTGTGGG ATCTAGATAT GGAGTGGGTG AAGAGCCGCT TCGATCAAAA CGCCTATTTA GGCAAAGATC CAATGACCAC CCCTGGTATT CCTTGTTCTC GCTGGCACGA TGGCGTGTTA GAAGACAAGA GCAAGCTGGC ACAGAAAGAC AATATCCGTA TGGCCTTCTT CTGGGGTCAA TCGGTCAACA CTGAAACCCG TCAACGTGAA GTGCGTGATG CTTTAGACAA GATGGACACA GTAGTGGTTG TCGACCCATT CCCAACCATG GCGGGTGTGA TGCACCGTCG TAAGAATGGC GTATATCTGT TACCTGCTGC GACTCAGTTT GAAACTCAAG GTTCAGTGTC TAACTCAGGC CGTTCTATCC AATGGCGTGA GCAGGTTATC CAACCTTTAT TCGAGTCAAA AACCGACATC GAAATCATGT ACCGTTTAGC GCAAAAACTC GGTATCGCCG AGCAATACAC TAAACGCATC GCCAAAGAAA ACGGCTTACC TGTTATCGAA GAAATCACCC GCGAAATCAA CCGCGGCATG TGGACCATCG GTATGACAGG TCAAAGCCCT GAGCGTATCA AGCTGCACAC CCAAAACTGG GGCACTTTCA GTAACAAGAC GCTCGAAGCC GCTGGCGGCC CAGCGAAGGG CGAAACCTAC GGTTTACCTT GGCCATGTTG GGGCACACCA GAAGCTAAAC ACCCTGGTAC CCAAATTCTG TATAACCAAT CTAAACACGT TAAAGACGGC GGCGGTAACT TCCGCGCTCG TTACGGCGTT GAATACAATG GTAAAAACCT GCTGGCTGAA GGCTCTTTCT CTAAAGGTGC CGAGATCCAA GACGGTTACC CAGAATTTAC CGACAAGCTG CTGAAGCAAC TCGGTTGGTG GGATGACCTG ACTGCGGAAG AAAAAGCCGA AGCCGAAGGC CGCAACTGGA AGACCGACTT ATCTGGCGGT ATCGTGCGCG TGGCAATCAA GCACGGTTGT ATTCCATTTG GTAACGCGAA AGCCCGTTGT ATTGTTTGGA CTTTCCCAGA CCAAGTGCCA GTTCACCGCG AGCCGTTATA CACAGCACGC CGTGACTTAG TGGCTAAATA CCCAACCTAC GACGATATGC AAGTTCATCG TCTGCCAACA CTGTACAAGT CAATCCAAGA GAAAGACTTC AGCGGCAAGT ACCCACTGGT ACTGACCTCT GGTCGTTTAG TGGAATACGA AGGTGGTGGT GAAGAATCTC GTTCTAACCC ATGGCTGGCT GAGCTTCAAC AGGAAATGTT TGTTGAAATC AACCCAGGTG ACGCAGCCGA CCGCGGTATC CGCAACGGTG AGTTTGTGTG GTTAGAGGGC GCCGAAGGTG GCCGCATTAA AGTACAAGCC ATGGTAACGC CACGCGTTAA ACCAGGTGTG ACCTTTATGC CATACCACTT TGCGGGTGTG ATGCACGGTG AAAGCTTAGC GCCTAACTAT CCTGAGGGCA CTGTGCCTTA CGTTATCGGT GAATCCGCTA ACACGGCACT GACCTATGGT TATGACCCTG TGACTCAAAT GCAGGAAACC AAAGCGTCGC TCTGTCAGAT CGTTAAAGCG TAA
|
Protein sequence | MKLTRKSDVA QVADKPTLGI SRRQFMKQAG ITTGGIAAAS LMGTGMMRRA EAKDVPHDAP IEVKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK YPMKLVDGKW KKISWEDAIN EVGDQMLNIR KESGPDSVYF MGSAKFSNEG CYMYRKLAAM WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA NAIFLIGANP AEAHPVSMQH ILIAKEKNNA KIIVVDPRFS RTAAHSDLHC AIRPGTDIPF IYGMLWHIFE NGWEDKTFIQ QRVFEMETIR EEAKKFPPKE VANITGVSEE VIYQAAKLMA ENRPGTVIWC MGGTQHHVGN ANTRAYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDLGL LFDNLPGYYG LTSAAWTHWT HVWDLDMEWV KSRFDQNAYL GKDPMTTPGI PCSRWHDGVL EDKSKLAQKD NIRMAFFWGQ SVNTETRQRE VRDALDKMDT VVVVDPFPTM AGVMHRRKNG VYLLPAATQF ETQGSVSNSG RSIQWREQVI QPLFESKTDI EIMYRLAQKL GIAEQYTKRI AKENGLPVIE EITREINRGM WTIGMTGQSP ERIKLHTQNW GTFSNKTLEA AGGPAKGETY GLPWPCWGTP EAKHPGTQIL YNQSKHVKDG GGNFRARYGV EYNGKNLLAE GSFSKGAEIQ DGYPEFTDKL LKQLGWWDDL TAEEKAEAEG RNWKTDLSGG IVRVAIKHGC IPFGNAKARC IVWTFPDQVP VHREPLYTAR RDLVAKYPTY DDMQVHRLPT LYKSIQEKDF SGKYPLVLTS GRLVEYEGGG EESRSNPWLA ELQQEMFVEI NPGDAADRGI RNGEFVWLEG AEGGRIKVQA MVTPRVKPGV TFMPYHFAGV MHGESLAPNY PEGTVPYVIG ESANTALTYG YDPVTQMQET KASLCQIVKA
|
| |