Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_4224 |
Symbol | |
ID | 5756055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 4997784 |
End bp | 5000636 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641290580 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_001556642 |
Protein GI | 160877326 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00105156 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGATTAA CCCGCAAAAC AGACCAGGTT GTCGAATCTA AAGCGCCAGC ACTGGGTCTC AATCGTCGTC AGTTCTTAAA ATCGGCAGGT CTTGCCACAG GTGGCATAGC TGCGGCGTCC ATGCTGGGCA CAGGTATGAT GCGTAAAGCA CAAGCCCAAG AAGACATAGG CCACAATGTT CCCACCGAAG TTAAACGTAC CATCTGTTCT CACTGCGCCG TCGGCTGTGG TGTATATGCA GAAGTACAAA ACGGCGTGTG GACAGGTCAA GAACCGGCAT TTGACCATCC CTTTAACCAA GGCGGACACT GCGCGAAAGG TGCGGCACTG CGTGAGCACG GCCACGGTGA AAAACGCCTG AAATACCCAA TGAAGTTAGA AGGCGGCAAG TGGAAAAAGA TCTCTTGGGA TCAAGCGATT AATGAAGTCG GCGACAAGAT GGAAGCGATT CGTCAAGAGT CAGGTCCAGA TTCCGTTTAC TTTATGGGTA GCGCCAAGTT TTCGAACGAA CAAGCTTATT TATATCGCAA ATTAGCGGCA CTGTGGGGCA CGAACAACGT CGACCATTCA GCCCGTATTT GTCACTCTAC CACGGTAGCC GGTGTTGCTA ACACTTGGGG CTACGGTGCG CAAACCAACT CGATTAACGA TATCCGCCAC ACTAAGTGCA TCATGTTTGT GGGTTCAAAC CCAAGTGAAG CCCACCCAGT GGCAATGCAA CATATTCTGG TCGCCAAAGA ACGCGGCGCG AAAGTGATTG TTGTTGATCC ACGTTTCACC CGTACTGCAG CGAAATCTGA CGAGTACGTG CATATCCGCC CAGGTACAGA TATCCCCTTC ATTTATGGCC TGTTATGGCA CATTTTTGAA AACGGCTGGG AAGATAAAGA GTTTATCAAA CAACGCGTAT ACGGCATCGA GCGTATCCGT GAAGAAGCCA AAAAATACAC CCCAGAAGAA GTTGAACTGG TTGCAGGCGT GCCTAAGGCG CAAATGTACC GTGTGGCTAA AATGCTGGCC GAAACGAAAC CCGGCACTAT CGTTTGGTGT ATGGGCGGTA CTCAGCACCA CGTCGGTAAT GCCAACACTC GTTCATACTG TATTTTACAG TTAGCCCTTG GCAACATGGG GGTTTCTGGT GGCGGGACTA ACATCTTCCG TGGTCACGAT AACGTGCAAG GCGCGACTGA CTTTGGTTTG TTATTCGACA ACTTACCCGG TTACTACGGC TTAACGACTG GCGCGTGGGC TCACTGGTCA AATGTATGGG ATCTCGATCC TAAATGGATT GCCAGCCGTT TCGACCAAGG TGAATACCTA GGCCAAACAC CACAAACCTC AGCCGGTATT CCTTGCTCTC GCTGGCACGA TGGGGTGTTA GAAGATAAAA CCAAAATTGC GCAGAAAGAC AATATCCGTC TGGCGTTCTT CTGGGGACAA TCGGTTAACA CTGAAACCCG TGGTCGTGAA GTGCGTGAAG CATTGAACAA GATGGATACT GTGGTTGTGG TCGATCCTTT CCCAACCATG GCAGGTGTTA TGCACCAACG TAAAGACGGT GTGTATCTAT TGCCTGCGGC GACGCAATTC GAAACCTATG GTTCAGTATC GGCCTCAAAC CGTTCGCTCC AATGGCGTAC TAAAGTCATC GAGCCACTGT TTGAGTCTCT GCCCGATCAC ATCATCATGT ATAAGTTCGC TAAAAAATGG GGCGTTGAAA AAGAATTCTG TAAGCACATT CAAGTGAATG GCGAAGAGCC TTTGATTGAA GATGTAACCC GCGAATTCAA TACCGGTATG TGGACTGTCG GTTATACAGG TCAAAGCCCT GAGCGTTTGA AAATGCACCA AGAAAACTGG GGCACTTTCG ATGTCAATAA TCTAACAGCA CCCGGCGGCC CAGCTAAGGG TGAAGTCTAC GGTTTACCTT GGCCATGTTG GGGTACACCA GAGATGAAAC ATCCTGGTAC CCAGATCCTG TACGATCAAT CAAAACACGT TAAAGACGGT GGCGGTAACT TCCGTGCCCG TTACGGTGTT GAGCATGATG GCGTCAACAT TCTTGCCGAA GGTTCTTTCT CTAAAGGCAG TGAGATTCAA GATGGTTATC CTGAATTCTC CGCCGACATG CTCAAGCAAT TGGGTTGGTG GGATGAACTT ACAGATGCAG AGAAAAAACG CGCCGAAGGC AAAAACTGGA AGACTGACAT TTCTGGTGGC ATCCAACGTG TTGCGATCAA GCATGGCTGT ATTCCCTATG GAAATGCTAA AGCCCGTTGT ATCGTTTGGA CTTTCCCAGA TGATATCCCG AAACACCGCG AACCACTTTA CACTCCTCGT CGTGACTTAG TCGCCAAGTA CCCAACCTAC GAAGACCGTA TGGTTGCGCG TCTGCCGACC TTGTATAAGT CAATCCAAGA TAAGGACTTT ACCCAAGGCT ACCCACTGAC ACTGACCTCT GGTCGTCTAG TGGAATACGA AGGCGGCGGT GAAGAATCCC GTTCTAACCC TTGGCTTGCT GAGCTGCAAC AAGAAATGTT CATCGAGATG AACCCTGCGG ATGCTGCGGA TCGCGGCGTG CGTGACGGTG ACAATGTCTT CGTGCATGGC CCAGAAGGCG CCAAGATCAC GGTTAAGGCT ATGGTGACAC CGCGCGTTAT TGCCGGTGAA TGTTTTATGC CATACCACTT TGCTGGTGTG TTTGAAGGCG AAAGCCTAGC GAAAAACTAT CCTGAAGGCA CAGTGCCTTA CGTGATAGGC GAATCGGCAA ACACCATTTT AACTTATGGC TACGACGTTG TGACTCAGAT GCAAGAAACT AAGTCCAGCT TGTGCCAGAT CAGCAAAGCC TAA
|
Protein sequence | MRLTRKTDQV VESKAPALGL NRRQFLKSAG LATGGIAAAS MLGTGMMRKA QAQEDIGHNV PTEVKRTICS HCAVGCGVYA EVQNGVWTGQ EPAFDHPFNQ GGHCAKGAAL REHGHGEKRL KYPMKLEGGK WKKISWDQAI NEVGDKMEAI RQESGPDSVY FMGSAKFSNE QAYLYRKLAA LWGTNNVDHS ARICHSTTVA GVANTWGYGA QTNSINDIRH TKCIMFVGSN PSEAHPVAMQ HILVAKERGA KVIVVDPRFT RTAAKSDEYV HIRPGTDIPF IYGLLWHIFE NGWEDKEFIK QRVYGIERIR EEAKKYTPEE VELVAGVPKA QMYRVAKMLA ETKPGTIVWC MGGTQHHVGN ANTRSYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDFGL LFDNLPGYYG LTTGAWAHWS NVWDLDPKWI ASRFDQGEYL GQTPQTSAGI PCSRWHDGVL EDKTKIAQKD NIRLAFFWGQ SVNTETRGRE VREALNKMDT VVVVDPFPTM AGVMHQRKDG VYLLPAATQF ETYGSVSASN RSLQWRTKVI EPLFESLPDH IIMYKFAKKW GVEKEFCKHI QVNGEEPLIE DVTREFNTGM WTVGYTGQSP ERLKMHQENW GTFDVNNLTA PGGPAKGEVY GLPWPCWGTP EMKHPGTQIL YDQSKHVKDG GGNFRARYGV EHDGVNILAE GSFSKGSEIQ DGYPEFSADM LKQLGWWDEL TDAEKKRAEG KNWKTDISGG IQRVAIKHGC IPYGNAKARC IVWTFPDDIP KHREPLYTPR RDLVAKYPTY EDRMVARLPT LYKSIQDKDF TQGYPLTLTS GRLVEYEGGG EESRSNPWLA ELQQEMFIEM NPADAADRGV RDGDNVFVHG PEGAKITVKA MVTPRVIAGE CFMPYHFAGV FEGESLAKNY PEGTVPYVIG ESANTILTYG YDVVTQMQET KSSLCQISKA
|
| |