Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0401 |
Symbol | |
ID | 4078795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 408686 |
End bp | 411580 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005696 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_612396 |
Protein GI | 99080242 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.298134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAGGA AAAAGACCAA CGGGGTTGCG CGACGCCCCC AGCGGACCAG TATCCTGTCC AAGGTCGGTG AGTCAGCCGT CGACCGCCGC GCGTTCCTGC GCGGATCTGG CCTTGCCATC GGCGGGCTTG CTGCCATCAG CGCAACCGGT GGTACAGTGA CGCAAGCCAA TGCGGCGGCG TCTGCAACCG GCGCGATGGA AACGATCAAA TCCGTCTGCA CCCACTGCTC GGTCGGCTGT ACCGTGGTGG CAGAGGTGCA AAACGGTGTC TGGGTTGGCC AGGAGCCAGG CTGGGACAGC CCCTTCAACC TCGGTGCGCA TTGCGCCAAG GGCGCTTCGG TGCGTGAACA CGCTCATGGT GAGCGCCGCC TGAAGTACCC GATGAAGAAA GAAGGCGGTG AGTGGAAGCG CATCAGCTGG GAGCAGGCCA TCGACGAGAT CGGCGACGGC ATGATGCAGA TCCGCGAAGA AAGCGGCCCT GATAGCGTCT ACTGGCTCGG TTCTGCCAAG CACAACAACG AACAGGCCTA CCTGTTCCGC AAGTTCGCCG CCTACTGGGG TACGAACAAC GTGGATCACC AGGCCCGGAT CTGTCACTCC ACCACGGTTG CGGGTGTTGC GAATACATGG GGCTACGGCG CCATGACCAA CAGCTACAAC GACATCCATA AATCCAAGGC GATCTTTATC ATCGGTGGCA ACCCCGCCGA GGCGCATCCG GTATCGCTGC TGCATGTGCT GAAGGCCAAG GAAGAGAACA ACGCGCCGCT GATCGTCTGC GATCCGCGTT TCACGCGTAC GGCGGCCCAT GCGGATGAAT ATGTCCGCTT CCGTCCCGGC ACCGACGTGG CGCTCGTTTG GGGCATCCTG TGGCATATCT TTGAAAACGG TTGGGAAGAC ACCGAGTTCA TCCGCACCCG TGTCTGGGGT ATGGATCAGA TCCGGACCGA AGTGGCCAAA TGGACGCCCG AAGAGGTCGA ACGCGTCACC GGCACCCCGG GTAGCCAGCT CAAGCGCGTT GCGCGCACCC TGGTCAACAA CCGCCCCGGC ACCGTCATCT GGTGTATGGG TGGCACCCAG CACACCAATG GCAACAACAA CACCCGCGCC TACTGCATCC TGCAGCTGGC CCTTGGCAAC ATGGGTGTGT CCGGCGGTGG CACCAACATC TTCCGCGGCC ACGACAACGT GCAGGGGGCA ACCGACCTTG GCGTTCTGAG CCACACTCTG CCGGGCTATT ATGGTCTGTC GGCTGGCGCA TGGGGCCATT GGGGCCGCGT CTGGGGCGAA GACATGGACT GGCTGAAGGG TCAGTTTGAA ACCGTCAAAG GCGCCGACGG CAAGGATAAG AACCTGATGA ACCTGACGGG CATTCCGGTG TCCCGCTGGA TCGACGGTAT CCTTGAAGAC AAGGAAAACA TGGACCAGCC CAACAATGTT CGGGCCATGG TTCTCTGGGG CCACGCGCCG AACTCTCAGA CCCGGATGAC GGAGATGAAG ACGGCGATGG AGAAACTCGA CATGCTTGTC GTGGTTGACC CCTATCCGAC CGTCTCTGCC GTGCTGCATG ATCGCACCGA TGGTGTCTAT CTGCTGCCCG CCTGCACCCA GTTTGAGACC CGCGGCTCCG TGACGGCCTC GAACCGTTCG CTGCAGTGGC GCGATCAGGT GGTGGAGCCT CTCTTTGAGA GTCTGCCGGA TCACGTCATC ATGGCCAAAT TCGCCAATAA GTTCGGCTGG GCAGATCGTC TCTTCCGCAA TATCGAAATG GAAGACGCCG AGACCCCCAA CATCGAAAGC ATCACCCGTG AGTTCAATGC GGGCATGTGG ACAGTGGGCT ACACGGGTCA GAGCCCGGAG CGCATCAAGC TCCATATGGC CAATCAGCAC ACCTTTGATC GCACGACGCT TCAGGCCGTT GGTGGCCCGG CGGATGGCGA TTACTACGGG ATGCCATGGC CCTGCTGGGG CACGCCGGAA ATGAAGCATC CGGGCACGCC GAACCTCTAT GACATGTCCA AACCTGTCGC CGAAGGTGGT CTGTGTTTCC GCGCCCGTTT CGGGGTGGAG CGTGATGGCG AAAATCTCCT GGCAGAAGGG GTCTCCAACC CCGGTGCGGA GATTCAGGAT GGCTATCCTG AGTTCACCAT GCAGATGCTG ATGGATCTGG GCTGGGATGG CGACCTGACG GCCGAGGAAC GCGCGGCGAT CGACGCCGTT GCCGGGCCAA AGACCAACTG GAAAACCGAC CTCTCCGGTG GGATTCAGCG GGTTGCGATC AAGCACGGCT GCGCGCCCTT CGGGAACGCC AAGGCTCGTG CGGTTGTGTG GACCTTCCCG GATCCGGTGC CGCTGCACCG CGAGCCGCTC TACACCAACC GGCGTGACCT GGTGGCGGAT TATCCGACCT ATGAGGATCG GAAATTCTAT CGTCTGCCCA CCATGTATGC CTCGATCCAG AAGAACGATG TCTCCAAGGA GTATCCGATC ATCCTCACCT CCGGCCGTCT GGTCGAATAT GAGGGCGGCG GTGACGAGAC CCGTTCGAAC CCGTGGCTTG CAGAACTGCA GCAGGACATG TTCGTCGAGA TCAATCCGCG CGATGCCAAT GACATCGGAA TCCGCGATGG GTCTCAGGTC TGGGTCGAAG GCCCGGAAGG CGGCAAGGTC AAGGTGATGG CAATGGTGAC AGAACGCGTC GGGGCCGGTG TGGCCTTCAT GCCGTTCCAC TTTGGCGGGC ACTTCCAAGG TAAGGATCTG AGGGATAAAT ATCCCGACGG GGCCGACCCT TACGTGCTGG GTGAAAGTAC CAACACCGCG CAGACCTACG GCTATGACTC TGTCACGCAG ATGCAAGAGA CCAAAGCCAC CCTCTGCAAA ATCTCAGCAG CCTAA
|
Protein sequence | MLRKKTNGVA RRPQRTSILS KVGESAVDRR AFLRGSGLAI GGLAAISATG GTVTQANAAA SATGAMETIK SVCTHCSVGC TVVAEVQNGV WVGQEPGWDS PFNLGAHCAK GASVREHAHG ERRLKYPMKK EGGEWKRISW EQAIDEIGDG MMQIREESGP DSVYWLGSAK HNNEQAYLFR KFAAYWGTNN VDHQARICHS TTVAGVANTW GYGAMTNSYN DIHKSKAIFI IGGNPAEAHP VSLLHVLKAK EENNAPLIVC DPRFTRTAAH ADEYVRFRPG TDVALVWGIL WHIFENGWED TEFIRTRVWG MDQIRTEVAK WTPEEVERVT GTPGSQLKRV ARTLVNNRPG TVIWCMGGTQ HTNGNNNTRA YCILQLALGN MGVSGGGTNI FRGHDNVQGA TDLGVLSHTL PGYYGLSAGA WGHWGRVWGE DMDWLKGQFE TVKGADGKDK NLMNLTGIPV SRWIDGILED KENMDQPNNV RAMVLWGHAP NSQTRMTEMK TAMEKLDMLV VVDPYPTVSA VLHDRTDGVY LLPACTQFET RGSVTASNRS LQWRDQVVEP LFESLPDHVI MAKFANKFGW ADRLFRNIEM EDAETPNIES ITREFNAGMW TVGYTGQSPE RIKLHMANQH TFDRTTLQAV GGPADGDYYG MPWPCWGTPE MKHPGTPNLY DMSKPVAEGG LCFRARFGVE RDGENLLAEG VSNPGAEIQD GYPEFTMQML MDLGWDGDLT AEERAAIDAV AGPKTNWKTD LSGGIQRVAI KHGCAPFGNA KARAVVWTFP DPVPLHREPL YTNRRDLVAD YPTYEDRKFY RLPTMYASIQ KNDVSKEYPI ILTSGRLVEY EGGGDETRSN PWLAELQQDM FVEINPRDAN DIGIRDGSQV WVEGPEGGKV KVMAMVTERV GAGVAFMPFH FGGHFQGKDL RDKYPDGADP YVLGESTNTA QTYGYDSVTQ MQETKATLCK ISAA
|
| |