Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0633 |
Symbol | |
ID | 4251651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 732536 |
End bp | 735643 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 638117196 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_732770 |
Protein GI | 113968977 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0100143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGA GTAAACGTCG TTTTATCAAA GGCGCAGCTG TGACGGGTGG CGTGGGATTA TTTGCCGCAG GTTACAGCCA TACCTTAGCG CAAATGGGCA AAGGGGTCGT GAATGGCAGT TCAGGCAAAA CCACCCAAGA TCCCATCCAT GGCAATTCGC TAGCCCCTGA ATATAAAGTG GATTTAACCT CAGGCCAGCT GATTGGCAAC GATGAACAAC GCATCGCCAA TACCATGTGT CTCGGCTGCT GGACCAAGTG TGGCGTGCGC GCCCGTATCG ATAATAAAAC AGATAAGATC CTCCGCATCA GCGGTAATCC CTATCATCCA CTCTCGGCGC GGGAACATAT CGACTTCGAT ACCCCGATCA AAACCGCCTT AGTCGGTTTA AGCGCCTATC AGGAGCAGGG CTTACAGGGG CGCTCTACCG CCTGCGCCCG TGGCAATGCC ATGTTAGAAC AACTAGATAG TCCCTATCGC GTCACCCAAT GCCTTAAACG GGTCGGGCCA CGCGGCAGTG GTCAATGGCA AAGCATTAGT TTCGAGCAAC TGCTGACCGA GGTCGTCGAT GGCGGTGACT TATTTGGCGA AGGCCATGTC GAAGGCTTAA GAGCGATTCG CGATATCAGC ACGCCACTCG ATGCCAATAA TCCTGAATAT GGCCCCAAGG CCAACCAATT ATTGGTGTCG AACGCTTCGG ATGAGGGCCG CGATGTCATC ATTAAACGCT TTACCTTCAA TAGTTTCGGT ACACGAAATT TTGCCAACCA CGGCGCCTAT TGCGGCTTTA CCTATCGCGC AGGTGCAGGT GCCCTGCTCG ACGATTTAAA AAAGTACGCT CACCTTAAGC CCGATTGGGA CAATAGCGAG TTTCTGCTGT TTATCGGCAC CTCGCCGCAA CAGTCGGGCA ATCCTTTTAA ACGCCAAGCA CGGCAGTTGG CCAACGCTCG GGTCGATGGG CGAGAATTCA GTTATGTGGT GGTGTCGCCC ATGTTGCCCA ACACCTCGAA CCAGCCCTGC GCGGGCAACA ATACTTGGCT GCCCATTCGC CCCGCAACCG ACTCCGCCCT CGCCATGGGC ATGTTGCGCT GGATTATCGA TAACCAGCGC TACGCCAGTG CGTTTTTAAC CGCGCCCAAT GCCGAGGCGG CGAAAGCAGC AGGCTACCGA GGCTTTAGTA ATGCCAGCTT CTTAATCATC AGCACGCCAA ATCACCCCAG ATACGGCCAA TATTTGTATG CCAGTGATCT CGGCTTGCCC TTCGAAGGTA AAGCTTACGG CGAGCAGGAT TCACCACTGG TTGTCGAGGC CGCTACTGGG CAGATCCTAC CTGCCGGACA ATGCGCCAAG GCGCAACTTG AGGTTGAGCA AAGCGTGAAT ACGCCAAAGG GCGAGCTGAG CGTGAAATCG AGTTTTCAGC TGCTCAGTGA AGCCGCGCGC AGCGTCGAGA TGGCCGACTA CGCCAAGGAA TGCGATATCA GCGAGGCGCA AATTGCCGCC CTCGCCCACA AATTCACAGA TCACGGCACT AAGGCGGCGG TGATTTCCCA CGGCGGTACC ATGAGTGCTA ACGGTTTTTA CAGTGCATGG GCGATCATGA TGCTCAACGC CATGATTGGT AACCTAAATC TTAAGGGCGG CGCTATGGCT AAGGGCGGCA CCTTCCCCGC CTTTGGTGCT GGCCCAAGGT ACAATTTGCA GGACTTTGAC GGCATGGTCG AACCTAAGGG CGTATTCCTT TCCCGCTCTA AGTTCCCCTA CGAAAAAACC TCTGAGTACC GCCGTAAAAT CGAAGCGGGA GAGTCGCCCT ATCCGTCGAA GGAGCCTTGG TTCCCCATAT CATCGCCCCT GCTGGGCGAA CATTTAACCG CGGCGGTGCA CGGTTATCCC TATCGACTCA AGGCGTGGAT CAACCATATG GGCAACCCCA TTTATGGCCA GGCGGGTCTC GGCGAAGCCA TTGGTGAACA ATTAAAAGAT CCCAAAGTAT TATCGCTATT TATCAGCATC GACAGCTTTA TCAATGAGAC CTCGGCGCTG TCGGACTACA TAGTGCCCGA CACCTTAACC TATGAATCTT GGGGCTGGAG CGCCGCGTGG CACGGTACGC TCACCAAAGA AGCCACCGGG CGCTGGCCGA TTGTCGAGCC GAGGGTCGCC AAAACTGCCG ACGGCGATGT GATTTGTATG GAGTCCTTCC TTAGCCAAGT GGCGATAAAA CTTAATCTGC CCGGCTTTGG CGCCAATGCT ATCAAGGCCG CAGATGGCAG TTTGCATCCT TTGAGCCGCG CGGCCGATTT CTCGCTCTAC GGCGCGGCGA ACGTGGCTTA CCTTGGTAAA CCCGTTGCCG ATATCAATGC CGAAGATATG TACTTCAGTG GTGTTGGGCG CATTCAAGCG GAACTCAATG CCAGACTCAA ACCCGAAGAG GTCAAAAAAG TCAGTTACTT ATTTGCCCGT GGCGGCCGCT TTGAGAATAT CAGCGCGGGT CGCGACGAGG CGGGCAACCC CAGCAAGGTT TGGCCAAAAC CTTTAATGCT GTGGAACCCA GAAGTCGGCA GTCGGCGCAA CAGCTTGAGC GGTGAATATA TGAGTGGCTG TCCACGGCGT TATCACCCTG CGCTCGCCAA CGGCACACCC TTGGAGCAAG CATTCGATAA GTCACAGTGG CCGCTGCTGA TCAGCAGTTA TAAGTCGCAT ACTATGAGTT CGATGAGCAT AGGTTCGGAC CGTTTGCGCC AAGTGCACCC CAACAATCCG GTGCGCATCA ATGAACAAAC CGCGGCACGT TTTGGCATCA AGACCGGAGA TACGGTCAAG ATCAGCACGC CAAATGGCAG TGTGGTGGCC TTGGTGGAAT GTGTGGCGGG CGTGCATCCC GACTGCATTG CCATCGAACA CGGTTATGGC CATAAGGAGT TAGGCGCGCG GGATATCGTC ATCGATGATA AAAAGATTGC CGCCAATCCG CTGATCCGCG TCGGAGTCAA TATGAACGAC TTGAATCTGT TCGACCCAAG CCGCCATGGG CGCTATCCGC TGGTGGACTG GGCCAGCGGT GCATCGGCGC GGCAAGGACT TCCCGCCAAG ATAGAGAAAA TCGCTTAA
|
Protein sequence | MDKSKRRFIK GAAVTGGVGL FAAGYSHTLA QMGKGVVNGS SGKTTQDPIH GNSLAPEYKV DLTSGQLIGN DEQRIANTMC LGCWTKCGVR ARIDNKTDKI LRISGNPYHP LSAREHIDFD TPIKTALVGL SAYQEQGLQG RSTACARGNA MLEQLDSPYR VTQCLKRVGP RGSGQWQSIS FEQLLTEVVD GGDLFGEGHV EGLRAIRDIS TPLDANNPEY GPKANQLLVS NASDEGRDVI IKRFTFNSFG TRNFANHGAY CGFTYRAGAG ALLDDLKKYA HLKPDWDNSE FLLFIGTSPQ QSGNPFKRQA RQLANARVDG REFSYVVVSP MLPNTSNQPC AGNNTWLPIR PATDSALAMG MLRWIIDNQR YASAFLTAPN AEAAKAAGYR GFSNASFLII STPNHPRYGQ YLYASDLGLP FEGKAYGEQD SPLVVEAATG QILPAGQCAK AQLEVEQSVN TPKGELSVKS SFQLLSEAAR SVEMADYAKE CDISEAQIAA LAHKFTDHGT KAAVISHGGT MSANGFYSAW AIMMLNAMIG NLNLKGGAMA KGGTFPAFGA GPRYNLQDFD GMVEPKGVFL SRSKFPYEKT SEYRRKIEAG ESPYPSKEPW FPISSPLLGE HLTAAVHGYP YRLKAWINHM GNPIYGQAGL GEAIGEQLKD PKVLSLFISI DSFINETSAL SDYIVPDTLT YESWGWSAAW HGTLTKEATG RWPIVEPRVA KTADGDVICM ESFLSQVAIK LNLPGFGANA IKAADGSLHP LSRAADFSLY GAANVAYLGK PVADINAEDM YFSGVGRIQA ELNARLKPEE VKKVSYLFAR GGRFENISAG RDEAGNPSKV WPKPLMLWNP EVGSRRNSLS GEYMSGCPRR YHPALANGTP LEQAFDKSQW PLLISSYKSH TMSSMSIGSD RLRQVHPNNP VRINEQTAAR FGIKTGDTVK ISTPNGSVVA LVECVAGVHP DCIAIEHGYG HKELGARDIV IDDKKIAANP LIRVGVNMND LNLFDPSRHG RYPLVDWASG ASARQGLPAK IEKIA
|
| |