Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_4228 |
Symbol | |
ID | 5756059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 5002940 |
End bp | 5005792 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641290584 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_001556646 |
Protein GI | 160877330 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00413313 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGTTAA CTCGCAAGTC TAGTGTCGCC CCAGTGGCAG ACAAACCGGC GTTAGGCATT AGCCGTCGTC AATTTATGAA GCACGCAGGT ATTGCCACCG GAGGTATTGC GGCGGCATCA CTAATGGGCA CAGGCATGAT GCGTCGCGTC GAAGCCAAAG AAGTGCCACA TGATGCCCCT ATCGAAGTCA AACGTACTAT TTGTAGTGCC TGCGCCGTCG GTTGTGGTCT GTATGCTGAA GTACAAAATG GCGTGTGGAC GGGTCAAGAA CCTGCATTCG ATCATCCCTT CAACGCCGGT GGTCATTGCG CTAAAGGTGC GGCACTGCGT GAGCATGGCC ATGGCGAAAA ACGCCTGAAA TACCCAATGA AATTGGTTGA TGGCAAGTGG AAAAAAATCT CTTGGGAAGA TGCGATTAAC GAAGTGGGCG ACCAAATGCT CAACATTCGT AAAGAATCCG GCCCAGACTC AGTGTACTTC ATGGGTAGCG CTAAATTCTC AAACGAAGGT TGCTACATGT ACCGCAAACT GGCGGCCATG TGGGGCACAA ACAACGTCGA CCATTCAGCT CGTATTTGTC ACTCTACCAC GGTAGCCGGT GTTGCTAACA CTTGGGGCTA CGGTGCGCAA ACTAACTCCT TTAACGATAT CCAAAACGCG AATGCGATTT TCTTTATCGG TGCTAACCCA GCAGAAGCCC ACCCAGTGGC GATGCAACAT ATTCTGATCG CTAAAGAAAG AAACAACGCT AAGTTGATCG TAGTTGATCC ACGCTTCTCT CGTACCGCGG CGCATTCCGA TCTGCACTGT GCATTACGTC CAGGTACTGA TATTCCTTTT ATCTACGGTA TGTTATGGCA CATTTTTGAA AACGGTTGGG AAGATAAAAC CTTTATTCAA CAACGTGTTT TCGAGATGGA CACCATTCGC GCCGAGGCGA AAAAATTCCC ACCTAAAGAA GTAGAAAACA TCACAGGCGT GAGCGAAGCC GAGGTTTATC AAGCCGCGAA ACTCATGGCT GATAACCGCC CAGGTACTGT GATCTGGTGT ATGGGTGGTA CTCAACACCA CGTCGGTAAT GCCAATACCC GCGCTTACTG TATTCTGCAA TTAGCGCTAG GTAACATGGG CGTGTCAGGC GGCGGAACGA ACATTTTCCG TGGTCACGAT AACGTACAAG GCGCGACTGA CTTAGGTTTG TTATTCGATA ACTTACCGGG TTACTACGGC TTAACAACAG GTGCTTGGAC TCACTGGACC CACGTGTGGG ATCTAGACAT GGAATGGATG AAAGGCCGTT TCGATCAAGG CACTTATCTT GGCCGCGAAC CTATGACTAC GCCGGGTATC CCTTGTTCTC GCTGGCACGA TGGTGTACTA GAAGAAAAAG GCAAACTCGC GCAGCAAGAT AACATTCGCA TGGCGTTTTT CTGGGGACAA TCGGTGAACA CCGAAACCCG CCAACGTGAT GTGCGTGATG CCTTAGACAA GATGGACACT GTGGTGGTGG TCGATCCATT CCCAACTATG GCAGGTGTGA TGCATCGCCG TAAGAATGGT GTTTACTTGC TACCAGCAGC GACACAGTTT GAAACTGAAG GCTCAGTTTC AAACTCTGGC CGCAGTCAGC AATGGCGCGA AAAAGTCATA GAGCCATTGT TTGAATCTAA AACCGATATC GAAATCATGT ATCGCTTAGC GCAAAAACTT GGTTTTGCTG AGCAGTACAC TAAACGCATC GCTAAAGAAA ATGATTTGCC TGTGATTGAA GACATCACCC GCGAAATCAA CCGCGGCATG TGGACTATCG GTATGACGGG CCAAAGCCCT GAACGTCTGA AACTGCATAC CCAAAACTGG GGCACCTTCA GCAATAAGAC GCTCGAAGCC GCAGGCGGCC CAGCAAAAGG CGAAACCTAC GGTTTACCTT GGCCATGTTG GGGCACGCCA GAAGCGAAAC ACCCCGGCAC CCAAATTCTT TACGATCAAT CCAAACATGT TAAAGACGGT GGCGGTAACT TCCGTGCCCG TTACGGCGTT GAATACAATG GTAAAAACCT ATTGGCTGAA GGCTCTTTCT CTAAAGGTGC TGAAATCCAA GACGGTTATC CAGAGTTCTC CGACAAGCTG CTCAAGCAAC TCGGTTGGTG GGATGATCTG ACGGCGGCAG AAAAAGCCGA AGCCGAAGGC AGAAACTGGA AGACAGACTT GTCAGGCGGC ATAGTGCGCG TAGCAATCAA GCATGGCTGT ATTCCTTTCG GTAACGCTAA AGCCCGTTGT ATTGTTTGGA CTTTCCCAGA CCAAGCGCCA GTTCACCGCG AACCGTTATA CACGCCACGC CGTGACTTAG TGGCCAAATA CCCAACCTAT GCCGACATGC AAGTGCACCG TCTGCCTACG CTGTATAAGT CAATCCAAGA CAAAGACATG AGCGGCAAGT ACCCACTGGT ACTGACCTCA GGCCGTCTGG TTGAATACGA GGGTGGTGGT GAAGAATCTC GTTGTAACCC TTGGCTAGCT GAACTTCAAC AGGAAATGTT TGTTGAGATA AGCCCTGCCG ATGCAGCTGA TCGCGGCATT CGTAACGGTG AAGACGTCTG GTTAGAAGGC GCCGAAGGTG GCCGCATTAA GGTACAAGCT ATGGTGACGC CACGCGTTAA ACCTGGCGTG ACCTTTATGC CATATCACTT TGCGGGTGTG ATGCACGGTG AAAGTTTAGC GCCTAATTAT CCAGAGGGCA CAGTACCTTA CGTTATCGGT GAATCCGCTA ACACTGCGCT GACCTATGGT TATGATCCTG TGACGCAAAT GCAGGAAACC AAGGCATCAC TGTGTCAGAT CGTGAAAGCG TAA
|
Protein sequence | MKLTRKSSVA PVADKPALGI SRRQFMKHAG IATGGIAAAS LMGTGMMRRV EAKEVPHDAP IEVKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK YPMKLVDGKW KKISWEDAIN EVGDQMLNIR KESGPDSVYF MGSAKFSNEG CYMYRKLAAM WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA NAIFFIGANP AEAHPVAMQH ILIAKERNNA KLIVVDPRFS RTAAHSDLHC ALRPGTDIPF IYGMLWHIFE NGWEDKTFIQ QRVFEMDTIR AEAKKFPPKE VENITGVSEA EVYQAAKLMA DNRPGTVIWC MGGTQHHVGN ANTRAYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDLGL LFDNLPGYYG LTTGAWTHWT HVWDLDMEWM KGRFDQGTYL GREPMTTPGI PCSRWHDGVL EEKGKLAQQD NIRMAFFWGQ SVNTETRQRD VRDALDKMDT VVVVDPFPTM AGVMHRRKNG VYLLPAATQF ETEGSVSNSG RSQQWREKVI EPLFESKTDI EIMYRLAQKL GFAEQYTKRI AKENDLPVIE DITREINRGM WTIGMTGQSP ERLKLHTQNW GTFSNKTLEA AGGPAKGETY GLPWPCWGTP EAKHPGTQIL YDQSKHVKDG GGNFRARYGV EYNGKNLLAE GSFSKGAEIQ DGYPEFSDKL LKQLGWWDDL TAAEKAEAEG RNWKTDLSGG IVRVAIKHGC IPFGNAKARC IVWTFPDQAP VHREPLYTPR RDLVAKYPTY ADMQVHRLPT LYKSIQDKDM SGKYPLVLTS GRLVEYEGGG EESRCNPWLA ELQQEMFVEI SPADAADRGI RNGEDVWLEG AEGGRIKVQA MVTPRVKPGV TFMPYHFAGV MHGESLAPNY PEGTVPYVIG ESANTALTYG YDPVTQMQET KASLCQIVKA
|
| |