Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewana3_2089 |
Symbol | |
ID | 4479497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. ANA-3 |
Kingdom | Bacteria |
Replicon accession | NC_008577 |
Strand | + |
Start bp | 2506781 |
End bp | 2509798 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639726674 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_869725 |
Protein GI | 117920533 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000081067 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAGAC GACAGTTTTT TAAACTCTGT GCTGTAGGAG CCGCGACTTC TACTATTTCT GCACTGGGGT TAATGTCCGA AAAGGCATTT GCATCCGTCA GAGGTTTTAA ATTGCTACGC GCAAAAGAGA CGCGTAACAA TTGTTGCTAC TGCTCTGTGG GTTGTGGTTT GTTGATGTAC AGCCAAAGCA GTAATGGGAA AAATGCGGAG CAGAGCATTT TTCATATCGA GGGCGATGCC GATAATCCGA TTAACCGCGG CGCCCTGTGT CCAAAAGGGG CAGGACTGGT TGACTATGTG AACAGTCCGC ACCGTTTAAA ATACCCCGAA GTGCGTTTAC CCGGTTCGGA TAAATGGCAG CGCATTAGCT GGGATGAGGC CTTTAAGCGT ATCGCAAGGC TTATCAAAGA CGAGCGCGAT GCCAATCTGG TTGAGAAAAA TGCTCAGGGT CAAACCGTTA ACCGCTTAGT CAGCCTAGGG ATGATGACGT CATCGGCCCA GGCAAACGAA GGCTGCTACA TTACCCATAA ATTTGGCCGT GCCATTGGTA TGTTAGGCAT AGATAACATC GCCCGTGTTT GCCACGCCCC AACACCGGCC GCGATGGCGC CCACCTTTGG CCGTGGTGCT ATGACTAACC ACTGGGCGGA TATGAAAAAT ACCGATCTGG CCATTGTGAT GGGCGGTAAC GCTGCCGAAG CGCATCCCGT CGGCTTTGGT TGGGTGACAG AAGCGATGGA GCACAACAAC GCCAAGTTGA TTGTGGTCGA TCCGCGCTTT AACCGAAGTG CTGCCGTCGC CGATTTATAT GCGCCTATTC GTTCGGGCAC CGATATTGCC TTCCTGCTCG GCATGATCCG CTACCTGCTC GAAACCCAGC AAATCAACCT TAATTATGTC AAAGCCTATA CCAACGCCAC GTTTATCGTG CGGGAAGACT TTGAATTTAA TGACGGTTTA TTCAGTGGCT ACGATGAGGC TAACCATAAA TACGACCAAT CCACTTGGTT CTACGAGCTA GATGAAGAGG GTTACGCAAA AGTTGACCCT AGCTTAAGCC ATCCTCGCTG CGTGATTAAC CTGCTGAAAA AACACGTCGA TCGCTACGAT CCCTACACGG TTTCTAGCAT CACAGGTACG CCTAAAGAAG CCTATCTTGA AGTGTGTCAG CAAATTGGGG CGACCCATGT TGACCATAAA GCTGCCACCT TCCTGTATGC CCTCGGTTGG ACACAGCACA GCGTTGGCGC GCAAAACATC CGTACCATGG CGATGATCCA ATTGCTGCTT GGCAATATGG GAATCATGGG CGGCGGCGTG AATGCGCTGC GCGGTCACTC AAACGTACAG GGCGCGACGG ACTTAGGTTT ATTGTGCCAA GGATTACCTG GCTACCTTAA ACTGCCACAG GATAGAGATG TTGATTTACA AAACTATTTA GCCCATTACA CCCCTAAAGC CTTAAGACCA AACCAAACCA ACTATTGGCA CAATTACCCA GCCTTTACCG TGTCGTTGTT AAAAGCCTTC TTCGGTGAAC ATGCAACGGC AGAGAATGAT TATGGTTATA ACTGGCTGCC AAAATGGGAC CAACAGTACG ATATCAACAA GCAAATCGAC ATGATGGTTC ACGGTGAGGT CAACGGATAC TTTATCCAGG GCATCAACGC GCTTAACTCC CAGCCCGATA AGCAAAAAGT GTCTAAGGGC TTATCGAATC TTAAGTTCTT AGTAGTGCTC GATGCGCTTG CGAACGAAAC CTCGAGTTTC TGGCGCAATG CGGGTCAATT TAACGATGTC GATACCGCCA GTATTCAAAC CGAAGTCTTC CGCTTACCAA CAACCTGTTT TGCTGAGGAA AGTGGTTCCA TTGCTAACTC GAGCCGCTGG TTACAATGGC ACTTTAAGGG CGCTAATCCT CCCGGTGAAG CCTTATCTGA TCCTGCGATC CTTTCTGGCA TCATGCTGGA ATTAAAACGT TTATACCGTG AAGAGGGCGG CCGTTTACCT GCGCCTATCG AAGCCATTAA ATGGGACTAT GCGATTGAGC ATGAACCCAG TTCAGAGGAA ATCGCGCGGG AGATGAACGG TTATGATCTC ACGACCGGTA AGCTGCTCAA TGGTTTCTCC GAATTAAAAT CCGATGGCTC AACCTCATGC GGTATTTGGG TTTACTCAGG CATGTGGACT GAAGCGGGCA ACTTGATGGC GCGCCGCGAT AATAGCGATC CATCCGGCAA AGGTATTACC CCGAATTGGT CATTTGCATG GCCTGCAAAC CGCCGCATCT TGTACAACCG CGCATCCTGC GACGTGCAAG GTAAGCCGCG CGATCCAAGC CGTGTGCTAC TCGAGTATAA GGACAACAAG TGGCAGGGTA TTGATGTGCC AGACTTTAAT GCCAAATTGA ATGCCGAAGA ATCGGCCCAT CCTTTCATCA TGCAAGCTGA TGGCGTTGGC CACTTATTTG CGCTGCGTGA CTTAAAAGAT GGCCCATTCC CAGAGCATTA CGAGCCGTTT GAATCACCAC TGGCGAGTAA CCCGCTGCAT CCTAAGGTCA CCAATAACCC TGTGGCACGG ATGTTCAAAG GCTTACGTGA AAGCTTTGGT ACCAATGAAG AATTCCCCTA TGTTGGCACC ACTTACTCAA TGACGGAACA CTTCAACAAC TGGACCACGC ATTGCCACCT TGCTGCGATT ACCCAGCCAC AGCACTTTAT CGAAATCGAT GAAACCTTGG CGGCGGAAAA GGGCATCAAT AACGGTGATT GGGTCAAGGT GAGCTCTAAG CGCTCGCATA TTGTCACTAA GGCCTATGTC ACTAAACGAC TCCAACCCAT GATGGTTCAG GGCAAAAAAG TTCACACCAT TGGTATTCCA CGCCATGGCA GTTATGAGGC CTTGACGCAG AAGAGTTATA TCGTCAACGA GCTGACTTCA TCTGTGGGCG ATGCCAATAC CCAAACCCCT GAATATAAAG CATTCCTTGT GAATATTGCC AAAGCGGAGG GCTTCTAA
|
Protein sequence | MNRRQFFKLC AVGAATSTIS ALGLMSEKAF ASVRGFKLLR AKETRNNCCY CSVGCGLLMY SQSSNGKNAE QSIFHIEGDA DNPINRGALC PKGAGLVDYV NSPHRLKYPE VRLPGSDKWQ RISWDEAFKR IARLIKDERD ANLVEKNAQG QTVNRLVSLG MMTSSAQANE GCYITHKFGR AIGMLGIDNI ARVCHAPTPA AMAPTFGRGA MTNHWADMKN TDLAIVMGGN AAEAHPVGFG WVTEAMEHNN AKLIVVDPRF NRSAAVADLY APIRSGTDIA FLLGMIRYLL ETQQINLNYV KAYTNATFIV REDFEFNDGL FSGYDEANHK YDQSTWFYEL DEEGYAKVDP SLSHPRCVIN LLKKHVDRYD PYTVSSITGT PKEAYLEVCQ QIGATHVDHK AATFLYALGW TQHSVGAQNI RTMAMIQLLL GNMGIMGGGV NALRGHSNVQ GATDLGLLCQ GLPGYLKLPQ DRDVDLQNYL AHYTPKALRP NQTNYWHNYP AFTVSLLKAF FGEHATAEND YGYNWLPKWD QQYDINKQID MMVHGEVNGY FIQGINALNS QPDKQKVSKG LSNLKFLVVL DALANETSSF WRNAGQFNDV DTASIQTEVF RLPTTCFAEE SGSIANSSRW LQWHFKGANP PGEALSDPAI LSGIMLELKR LYREEGGRLP APIEAIKWDY AIEHEPSSEE IAREMNGYDL TTGKLLNGFS ELKSDGSTSC GIWVYSGMWT EAGNLMARRD NSDPSGKGIT PNWSFAWPAN RRILYNRASC DVQGKPRDPS RVLLEYKDNK WQGIDVPDFN AKLNAEESAH PFIMQADGVG HLFALRDLKD GPFPEHYEPF ESPLASNPLH PKVTNNPVAR MFKGLRESFG TNEEFPYVGT TYSMTEHFNN WTTHCHLAAI TQPQHFIEID ETLAAEKGIN NGDWVKVSSK RSHIVTKAYV TKRLQPMMVQ GKKVHTIGIP RHGSYEALTQ KSYIVNELTS SVGDANTQTP EYKAFLVNIA KAEGF
|
| |