Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_0229 |
Symbol | |
ID | 7085885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 260416 |
End bp | 263268 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643459149 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_002356189 |
Protein GI | 217971438 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.363214 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTAA CTCGCAAGTC TAGTGTCGCC CCAGTGGCAG ACAAACCGGC GTTAGGCATT AGCCGTCGTC AATTTATGAA GCACGCAGGT ATTGCCACCG GTGGTATTGC GGCGGCATCA CTAATGGGCA CAGGCATGAT GCGTCGCGTC GAAGCCAAAG AAGTGCCACA TGATGCCCCC ATCGAAGTCA AACGTACTAT TTGTAGTGCC TGCGCCGTCG GTTGTGGTCT GTATGCTGAA GTACAAAATG GCGTGTGGAC GGGTCAAGAA CCTGCATTCG ATCACCCCTT CAATGCCGGT GGTCACTGTG CTAAAGGTGC GGCACTGCGT GAGCATGGCC ACGGTGAAAA ACGCCTGAAA TACCCAATGA AATTAGTTGA TGGCAAGTGG AAAAAAATCT CTTGGGAAGA TGCGATTAAC GAAGTGGGCG ACCAAATGCT CAACATTCGT AAAGAATCCG GCCCAGACTC AGTGTACTTC ATGGGTAGCG CTAAATTCTC AAACGAAGGT TGCTACATGT ACCGCAAACT GGCGGCCATG TGGGGCACAA ACAACGTCGA CCATTCAGCT CGTATTTGTC ACTCTACCAC GGTAGCCGGT GTTGCTAACA CTTGGGGCTA CGGTGCGCAA ACTAACTCCT TTAACGATAT CCAAAACGCG AATGCGATTT TCTTTATCGG TGCTAACCCT GCAGAAGCCC ACCCAGTGGC GATGCAACAT ATTCTGATCG CTAAAGAAAG AAACAACGCT AAGTTGATCG TAGTTGATCC ACGTTTCTCT CGCACAGCGG CGCATTCAGA TCTGCACTGC GCATTGCGTC CAGGTACTGA TATTCCGTTT ATCTACGGTA TGTTATGGCA CATTTTTGAA AACGGTTGGG AGGATAAAAC CTTTATCCAA CAACGTGTTT TCGAGATGGA CACCATTCGC GCCGAGGCGA AAAAATTTCC ACCTAAAGAA GTGGAAAACA TCACAGGCGT GAGCGAAGCC GAGGTTTATC AAGCCGCAAA ATTAATGGCT GATAACCGCC CAGGTACTGT GATCTGGTGT ATGGGTGGAA CTCAACACCA CGTCGGTAAT GCCAACACCC GAGCTTACTG TATTCTGCAA TTAGCGCTAG GTAACATGGG CGTGTCAGGC GGCGGAACGA ACATTTTCCG TGGTCACGAT AACGTACAAG GCGCGACTGA CTTAGGTTTG TTGTTCGATA ACTTACCGGG TTACTACGGC TTAACAACAG GTGCTTGGAC TCACTGGACC CACGTGTGGG ATCTAGACAT GGAATGGATG AAAGGCCGTT TCGATCAAGG CACTTATCTT GGCCGTGAAC CTATGACTAC ACCGGGTATC CCTTGTTCTC GCTGGCACGA TGGTGTACTA GAAGAAAAAG GCAAACTCGC GCAGCAAGAT AACATTCGCA TGGCGTTTTT CTGGGGACAA TCGGTGAACA CCGAAACCCG CCAACGTGAT GTGCGTGATG CTTTAGACAA GATGGACACT GTGGTGGTAG TCGATCCATT CCCAACTATG GCAGGTGTGA TGCATCGCCG TAAGAATGGT GTTTACTTGC TACCAGCAGC GACACAATTT GAAACTGAAG GCTCAGTTTC AAACTCTGGC CGCAGTCAGC AATGGCGCGA AAAAGTCATA GAGCCATTGT TTGAATCTAA AACCGATATC GAAATCATGT ATCGCTTAGC GCAAAAACTC GGTTTTGCTG AGCAGTACAC TAAACACATC GCCAAAGAAA ATGATTTGCC TGTGATTGAA GACATCACCC GCGAAATCAA CCGCGGCATG TGGACTATCG GTATGACGGG CCAAAGCCCT GAACGCCTGA AACTACACAC CCAAAACTGG GGCACCTTCA GCAACAAGAC GCTGGAAGCC GCAGGCGGCC CAGCGAAAGG CGAAACCTAC GGTTTACCTT GGCCATGTTG GGGCACACCA GAAGCGAAAC ACCCAGGCAC CCAAATTCTT TACGATCAAT CCAAACACGT TAAAGACGGT GGCGGTAACT TCCGTGCCCG TTACGGCGTT GAATACAATG GTAAAAACCT ATTGGCTGAA GGCTCTTTCT CTAAAGGTGC TGAAATCCAA GACGGTTATC CAGAGTTCTC CGACAAGCTG CTCAAGCAAC TCGGTTGGTG GGATGATCTC ACGGCGGCAG AAAAAGCCGA AGCCGAAGGC AGAAACTGGA AGACTGATTT GTCAGGCGGC ATAGTTCGCG TAGCGATCAA GCATGGCTGT ATTCCTTTCG GTAACGCTAA AGCCCGTTGT ATTGTTTGGA CTTTCCCAGA CCAAGCGCCA GTTCACCGCG AACCGTTATA CACGCCGCGT CGTGACTTAG TGGCCAAATA CCCCACCTAT GCCGACATGC AAGTGCACCG TCTGCCTACG CTGTATAAGT CAATCCAAGA CAAAGACATG AGCGGCAAGT ACCCACTGGT ACTGACCTCA GGCCGTCTGG TTGAATACGA GGGTGGTGGT GAAGAATCTC GTTGTAACCC TTGGCTAGCT GAACTTCAAC AGGAAATGTT TGTTGAGATA AGCCCTGCCG ATGCTGCTGA TCGCGGCATT CGTAACGGTG AAGACGTCTG GTTAGAAGGC GCCGAAGGTG GCCGTATTAA GGTACAAGCT ATGGTGACGC CACGCGTTAA ACCTGGCGTG ACCTTTATGC CATATCACTT TGCGGGTGTG ATGCACGGTG AAAGTTTAGC GCCTAATTAT CCAGAGGGCA CAGTACCTTA CGTTATCGGT GAATCCGCTA ACACTGCGCT GACCTATGGT TATGACCCTG TGACGCAAAT GCAGGAAACC AAGGCATCAC TGTGTCAGAT CGTTAAAGCG TAA
|
Protein sequence | MKLTRKSSVA PVADKPALGI SRRQFMKHAG IATGGIAAAS LMGTGMMRRV EAKEVPHDAP IEVKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK YPMKLVDGKW KKISWEDAIN EVGDQMLNIR KESGPDSVYF MGSAKFSNEG CYMYRKLAAM WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA NAIFFIGANP AEAHPVAMQH ILIAKERNNA KLIVVDPRFS RTAAHSDLHC ALRPGTDIPF IYGMLWHIFE NGWEDKTFIQ QRVFEMDTIR AEAKKFPPKE VENITGVSEA EVYQAAKLMA DNRPGTVIWC MGGTQHHVGN ANTRAYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDLGL LFDNLPGYYG LTTGAWTHWT HVWDLDMEWM KGRFDQGTYL GREPMTTPGI PCSRWHDGVL EEKGKLAQQD NIRMAFFWGQ SVNTETRQRD VRDALDKMDT VVVVDPFPTM AGVMHRRKNG VYLLPAATQF ETEGSVSNSG RSQQWREKVI EPLFESKTDI EIMYRLAQKL GFAEQYTKHI AKENDLPVIE DITREINRGM WTIGMTGQSP ERLKLHTQNW GTFSNKTLEA AGGPAKGETY GLPWPCWGTP EAKHPGTQIL YDQSKHVKDG GGNFRARYGV EYNGKNLLAE GSFSKGAEIQ DGYPEFSDKL LKQLGWWDDL TAAEKAEAEG RNWKTDLSGG IVRVAIKHGC IPFGNAKARC IVWTFPDQAP VHREPLYTPR RDLVAKYPTY ADMQVHRLPT LYKSIQDKDM SGKYPLVLTS GRLVEYEGGG EESRCNPWLA ELQQEMFVEI SPADAADRGI RNGEDVWLEG AEGGRIKVQA MVTPRVKPGV TFMPYHFAGV MHGESLAPNY PEGTVPYVIG ESANTALTYG YDPVTQMQET KASLCQIVKA
|
| |