Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_0104 |
Symbol | |
ID | 7087369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 121577 |
End bp | 124591 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643459028 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002356068 |
Protein GI | 217971317 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAC GCCAGTTTTT TAGACTGTGT GCTGCCGGAG CTGCCACCTC AGCAATTTCT GCGTTGGGAT TGATGTCCGA AAAGGCCTAT GCGGCCGTTC GGGAATTCAA ACTCATTAGC GCGAAAGAAA CCCGCAATAA CTGCTGCTAT TGCTCAGTGG GTTGCGGATT GCTGATGTAC AGCAAAGGAA GTAATGGCAA GAACGCCGAA CAGAGTATTT TCCATATCGA AGGGGATGCA GACCATCCAG TCAACCGCGG GGCCTTGTGT CCTAAAGGCG CTGGTTTAGT GGATTATGTC AATAGTCCCA ATCGTCTTAA ATACCCAGAG TACCGTGCAC CAGGCAGCAA TGAATGGACA CGTATCAGCT GGAGCGAGGC TTACCAACGC ATCGCCCGCC TGATGAAAGA TGATCGTGAT GCTAACTTGA TTGAAAAGAA TTCTGCCGGG ACGACGGTTA ACCGTTGGCT TAGCACTGGG ATGATGACCT CGTCGGCGAT GCCGAACGAA GGCGGCTATA TCACCCAGAA ATTTGCACGG GCACTGGGCC TAGTTGCAAT CGATACTATT GCGCGTAACT GACACTCTCC AACGGTAGCA AGTCTTGCTC CGACATTTGG ACGTGGTGCC ATGACCAACC ACTGGATCGA TATTAAAAAT TCTAATTTGA TCATTATCAT GGGCGGTAAT GCTGCAGAGG CTCACCCTGT CGGATTCGGC TGGGTTACAG AAGCGATGCA ACATAACAAC GCTAAACTTT TGGTGGTCGA TCCCCGCTTC ACCCGTAGCG CCGCTGTGGC CGATTATTAT GCGCCGATCC GTTCGGGCAC AGATATCGCC TTCCTGCTCG GGGTGATCCG CTATCTTATT GAAACCAAAC AAGTGAACTA CGACTATGTG AAGGCTTATA CCAATGCCAG TTACATAGTG CGCGAAGACT TCTCTTTTAG TGAAGGTCTG TTCAGTGGTT TCGATGAAGA AACAGATAGC TACGATAAAG AGTCTTGGTA CTACGAGTTA GACGAACAGG GTTACGCGAA AGTTGATCCT AGCTTCGAGC ATCCACGCTG TGTGTGGAAC TTAATGAAGC AACATGTAGA TCGCTATGAC TTTGAAACAG TAAGCAATAT TACCGGTACG CCGATTCCGG ACTATGAGAT GGTTTGCCAG CAAATTGGCA GCACTCATAC CCACGATAGA GCCGCCACTT TTATGTATGC CTTAGGTTGG ACTCACCACA CTAAAGGGGC GCAGAACATT CGCTCTATGG CTATGGTGCA GTTGCTGCTG GGTAACATTG GGGTGTTGGG CGGTGGTGTT AACGCACTAC GTGGTCACTC TAACGTGCAA GGCGCGACCG ATATGGGTTT ATTGTGTCAA AGCTTGCCTG GTTACTTAAA GTTACCGAGT GAAAAAGACA CTGATCTTAA GACCTATCTT GATCGTTACA CGCCTGTCGC CTTGCGTCCG GGTCAAACCA ACTATTGGCA GAATTACCCT AAATTCTTTA TTTCGCAAAT GAAGAGCTTC TGGGGCGATA ACGCCACGGT TGAAAATGAC TTTGGTTATG ACTGGGTGCC TAAGTGGGAT AAACAGTACG ACTTTACTAA GCACTTAGAT ATGGCTTTCC ACGGTGAAGT GAATGGCTAC ATTATCCAAG GTGTTAACGC CATTAATTCT ATGCCGAACC GCAATAAGGT ACTCAAAGCC TTGAGTAATC TTAAGTACAT GGTGGTGCTG GATGCCTTGG CAACGGAAAC CGCAACCTTC TGGCAAAACG CCGATGGCTT CAACGAGGTG AATCCCGCTG AAATCATGAC GGAAGTGTTC CGCTTACCGA CGACGTGTTT TGCTGAGGAA GAAGGTTCCA TCGTCAACTC TGGACGTTGG ATGCAGTGGC ATTGGAAGGG CGCTAACCCG CCGGGTGAAG CCAAGCCAGA TGCTGAGATC CTCTCCGGTG TCTTAATGGC AATGCGTGAA CTGTACAAGC AAGAGGGCGG TAAGTTGCCT GAGCCAGTGC AAGCGATAAG TTGGGATTAC CACAATCCTT ACTCGCCGCA TGCCGAAGAA GTTGCTCGCG AGCTTAACGG TAAGGATCTC GTGACTCAGC GTCAGCTCAG TAGCTTCTCT GAACTCAAAG CTGATGGTAG TACTGCTAGC GCCTGCTGGA TTTATGCCGG TTCTTGGACC GAAGAGGGCA ACCTGATGGC TCGCCGCGAC AACCATGATC CTTCGGGTAA AGGTGTCACT CCGGGCTGGG CATTTGCTTG GCCAGCCAAC CGTCGCGTGC TGTATAACCG CGCTTCTTGC GATATCAACG GCAAACCATG GGATCCCAAA CGTACTATCG TCGAGTGGGT TGATGGTAAG TGGCATGGTA TCGATGTGGG CGACTTCAAT ATGAAGTTAA CCCCACAGGA ATCCGCAGGT CCCTTCATCA TGCAACCCGA AGGTGTGGGT CGCTTCTTTG CCCTTAAGTT GTTGGCCGAA GGTCCGTTCC CTGAGCACTA CGAGCCGATG GAGTCGCCTA TTGGGGTGAA CCCGTTACAC AAGGTCACCA GTAACCCTGC GGTACGTATG CTACCGGGTG TGAAGGAAAC CTTAGGTTCC CATAAGGACT TCCCTTATGT GGCAACCACT TACTGTGTGA CTGAACACTT TAATTTTTGG TCTACTCATG CCCGTTTGGC AGCGATTTCT ATGCCGGAAA CCTTCGTCGA AATCGATGAA ATGTTGGCGG CAGAAAAAGG TATCGCCAAT GGGGATTGGG TCACTGTCAG CTCTAAACGT GGCAGCATTG AAACCAAAGC CTTAGTGACT AAACGTCTGC AGCCGTTAAA GGTGAATGGT CAGTTAGTTC ATACCGTCGG GTTACCGCGT CACGGCAGCC ATAATGCGCT CACTCGTAAG AGTTACTCCT GTAACGTGCT CACCACGGAA GTGGGGGATG CAAATACACA GGTGCCTGAG TTCAAAGCAT TTTTGGTTAA CATCACTAAA GCGAAGGGGC TCTGA
|
Protein sequence | MNRRQFFRLC AAGAATSAIS ALGLMSEKAY AAVREFKLIS AKETRNNCCY CSVGCGLLMY SKGSNGKNAE QSIFHIEGDA DHPVNRGALC PKGAGLVDYV NSPNRLKYPE YRAPGSNEWT RISWSEAYQR IARLMKDDRD ANLIEKNSAG TTVNRWLSTG MMTSSAMPNE GGYITQKFAR ALGLVAIDTI ARNUHSPTVA SLAPTFGRGA MTNHWIDIKN SNLIIIMGGN AAEAHPVGFG WVTEAMQHNN AKLLVVDPRF TRSAAVADYY APIRSGTDIA FLLGVIRYLI ETKQVNYDYV KAYTNASYIV REDFSFSEGL FSGFDEETDS YDKESWYYEL DEQGYAKVDP SFEHPRCVWN LMKQHVDRYD FETVSNITGT PIPDYEMVCQ QIGSTHTHDR AATFMYALGW THHTKGAQNI RSMAMVQLLL GNIGVLGGGV NALRGHSNVQ GATDMGLLCQ SLPGYLKLPS EKDTDLKTYL DRYTPVALRP GQTNYWQNYP KFFISQMKSF WGDNATVEND FGYDWVPKWD KQYDFTKHLD MAFHGEVNGY IIQGVNAINS MPNRNKVLKA LSNLKYMVVL DALATETATF WQNADGFNEV NPAEIMTEVF RLPTTCFAEE EGSIVNSGRW MQWHWKGANP PGEAKPDAEI LSGVLMAMRE LYKQEGGKLP EPVQAISWDY HNPYSPHAEE VARELNGKDL VTQRQLSSFS ELKADGSTAS ACWIYAGSWT EEGNLMARRD NHDPSGKGVT PGWAFAWPAN RRVLYNRASC DINGKPWDPK RTIVEWVDGK WHGIDVGDFN MKLTPQESAG PFIMQPEGVG RFFALKLLAE GPFPEHYEPM ESPIGVNPLH KVTSNPAVRM LPGVKETLGS HKDFPYVATT YCVTEHFNFW STHARLAAIS MPETFVEIDE MLAAEKGIAN GDWVTVSSKR GSIETKALVT KRLQPLKVNG QLVHTVGLPR HGSHNALTRK SYSCNVLTTE VGDANTQVPE FKAFLVNITK AKGL
|
| |