Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_05600 |
Symbol | |
ID | 8394452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | + |
Start bp | 648724 |
End bp | 651663 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644985322 |
Product | anaerobic dehydrogenase, typically selenocysteine-containing |
Protein accession | YP_003142968 |
Protein GI | 257063296 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.533202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAAT CTAATCTGAC TCGTCGAAGC TTCATCAAAG CGGCAGGCTT TACGGCGGCT GCTTCGGCCA TCGGCGCGTC GCTGGCCGGC TGTATGCAGT CCGATGGCGG TTCCGCCGAA GGCGAAGGCT CTGCCGACGG CAACGTAAAG TCCTCTGAGG GCGTTACGTT CTCGACGCAC ATCGACTACG ACACGAAGCT GCGCACCTCG TGCCATGGTT GCATCCAGAT GTGCCCGGCC ATCGCCTACC TGAAGGACGG CGTGGTCGTC AAGCTCGAGG GTGACCCCGA GGCGCCCGTG AGCCGCGGCT CCCTGTGCAT CAAGGGCCTC AACCAGGTCC ACACTATGTA CAGCCCGCGC CGCGTCCTGC ATCCGCTGCG CCACATCGAG CGCGGCACCA ACAAATGGGA GCAGATCAGC TGGGACGAGG CCATCGACGA GGCCGCCCAG CACATCGCTG ATGCCATCAA CGAGTACGGC CCGTATTCGT TCTTCGCAAG CGTCGGCGGC GGCGGCTCCT ATTCGTTCAT GGAGGCCATG ACCCTGCCTA TGGCTCTGGG TTCGCCCACC GTGTTCGAGC CGGGCTGCGC GCAGTGCTAC CTGCCCCGTT GGGCTCTCTC CAAGCTGTTC TACGGCGGCG ACGACCAGTC CATCGCCGAC AACGCAGTTC AGGAGATCTT CCGTCCTGGC GACGACAACA AGGCCGAAGT GGTCGTGATC TGGGGCGCCC AGCCGTCCGT TTCCGAAACC GCTGAATCCG GCCGCGGCAT GGCCGAGCTC CGCGCTGCGG GCGTCAAGAC CATCGTCGTC GACCCGAACT TCAGCCCTGA CGCGGTTCAC GCCGACATCT GGCTGCCCAT CCGTCCGGCA ACCGACACTA AACTCATCCT GTGCTGGTTC AACTACATCT TCGAGAACAA GCTGTACAAC GAGCAGTTCA CGAAGTACTG GACCAACCTG CCGTTCCTGA TCGACCCGGA CACCAAGCTG CCCGTCAAGG CTCAGGAGCT GTTCCCGGAC TTCGAGCAGA CCACGCCCGA GCACACGCCG GCTTATGTCT GCTACGACCT GAAGACCAAC GCTGTTGCTC CCTTCGAGTA CAGCGCTCCT GCCGACAGCG CCGTCGACCC CGAGATCTTC TGGGAAGGCG AGTTCAACGG CAAGACCTAC AAGACCGCCG GCCAGATTTA CGCCGAAGAG GCTGCTCCTT GGACGCTTGA GAAGACCGCC GAGGATTGCT GGCTTGAGGC CGACAAGATC GAAGCTGCCA TCCGCATGTA CACCCATGCC GACGACGGCG GCCCGATTGA CCATGTTGCC GGCATCGCCA ACGGCGTTGC TTCCGACATG ACCGAGTCCG CCAGCCAGGT GCCTATCGGC CTGATGGGCC TCGACTCCAT CATGGGCTAC ATCAACAGGC CCGGCGCCAC CATGACCCAG AAGGGCGGCG GCTACTTCAC CGACGCGACG GGCACCCAGT CGCTCAAGCG TCAGTACACC TTCAACAACG GCTTCGGCGG CATGTTCAGC GCCATGTACG GCATCGGCGC CGTCATCGGC CTGTCGGATG AGCAGAACGA AGCCTGGGCA CGCGGCGAAG AGACCCCGGC CGGCGTCGCC GACGTCAGCC AGCAGGAGCT CGCCAACCAG CTGCTCCTCG ACCGTATCGG CATGAAGGAC CACAAGGGCC TGTATGCTTG GTGCCACAGC CACATCCCCA GCGTCCGCGA AGCCATCGCA ACCGGCGAGC CCTTCAAGCC TCGCGTCTGG TTCGACATGT CCGGCAACAA GCTGGCCATG CTCGGCAACG CCAAGAGCTG GTATGAGGTC TTCCCCGAGG TCGACTACAT CATCACCCAG TACCCGAACA TCACGTCGTT CCAGTTCGAA GCTGCCGACC TCATGTTCCC GCTGCGTGAG TGGCTGGAAG AGCCCATGGT CAACATGACC CAGCTCGACA CCCAGTGGCT GCAGAACGAA TGCACCCACA TCGGCGAGAC CGTATCCCAC AGCATCCCCG CCGCTCAGGT TCTGGCGAAG GTCGCCGAGA AGCTCGGCGG AGACCTGCCT GGCTTGAAGC CCGGCTATCT GGGCAACCCC ACCGAGCAGG CGAACAAGGA TTCCGTCGCC GCCACGCTGG GCGCTCCCAG CTGGGATGAA CTGATCGCCA ACACCGACCA GTACGTGCCG CACATCACCG AGGGCTACTT CAACTACAAC CAGCACGAAG TCCTCGCCGA GGATGACGAC AACCTGCCCA TCGGCTTCGC CACTGAGTCC CGTAAGATCG AGACCTACTG CCAGATTCTG CTGCGCACGG CCCGCACCGG CTATCCGTAC TGCTACCCCA AGCCGCAGGA AGCCTGCGAC GACTACAGCC CGATCTGCGT GCCCATCGAA CCCTGCGAGA GCCCCTATTC CGAGGCCGAG CAGCAAATCG CCGACCGCGA AGAGTACCCG TTCGTGCTCA CCAGCGGCCG TGTTCCGTAC TTCCATCACG GCACCATGCG CCATGCCGCG TACAGCCGCG AGCTGTTCCC CTGCGCCGAG ATCCGCATCA ACCCGGCCAG CGCTGCCGAG CTGGGCATCG AACATATGGA CTGGGTCAAG GTCACGAGCC GCCGCGGCGA GATCCATGCC CGCGCATGGC TCACCGAGGC CATCAATCCG CATACCGTTT GGATGGAGCG CTTCTGGAAC CCCGAGGCAT TCGACGAGTC CCAGGCCAAT CCGGACGGTG GCTGGCGCCA GATGAACGTC AACGTGCTCA CCAAGAACAC GGCGCCGTTC AACGAGGTGT TCGGTTCTTA CACCAACCGC GGCTTCACCG TTAAGATCGA GAAGTCTGAG AAGCCCGAGA ACGTGTGGGT CGAACCCGAG GAGTTCCAGC CGTTCATGCC CACCCTGCAA GGCGAAGCTA GGACGGAGGA TGTATTCTAA
|
Protein sequence | MTKSNLTRRS FIKAAGFTAA ASAIGASLAG CMQSDGGSAE GEGSADGNVK SSEGVTFSTH IDYDTKLRTS CHGCIQMCPA IAYLKDGVVV KLEGDPEAPV SRGSLCIKGL NQVHTMYSPR RVLHPLRHIE RGTNKWEQIS WDEAIDEAAQ HIADAINEYG PYSFFASVGG GGSYSFMEAM TLPMALGSPT VFEPGCAQCY LPRWALSKLF YGGDDQSIAD NAVQEIFRPG DDNKAEVVVI WGAQPSVSET AESGRGMAEL RAAGVKTIVV DPNFSPDAVH ADIWLPIRPA TDTKLILCWF NYIFENKLYN EQFTKYWTNL PFLIDPDTKL PVKAQELFPD FEQTTPEHTP AYVCYDLKTN AVAPFEYSAP ADSAVDPEIF WEGEFNGKTY KTAGQIYAEE AAPWTLEKTA EDCWLEADKI EAAIRMYTHA DDGGPIDHVA GIANGVASDM TESASQVPIG LMGLDSIMGY INRPGATMTQ KGGGYFTDAT GTQSLKRQYT FNNGFGGMFS AMYGIGAVIG LSDEQNEAWA RGEETPAGVA DVSQQELANQ LLLDRIGMKD HKGLYAWCHS HIPSVREAIA TGEPFKPRVW FDMSGNKLAM LGNAKSWYEV FPEVDYIITQ YPNITSFQFE AADLMFPLRE WLEEPMVNMT QLDTQWLQNE CTHIGETVSH SIPAAQVLAK VAEKLGGDLP GLKPGYLGNP TEQANKDSVA ATLGAPSWDE LIANTDQYVP HITEGYFNYN QHEVLAEDDD NLPIGFATES RKIETYCQIL LRTARTGYPY CYPKPQEACD DYSPICVPIE PCESPYSEAE QQIADREEYP FVLTSGRVPY FHHGTMRHAA YSRELFPCAE IRINPASAAE LGIEHMDWVK VTSRRGEIHA RAWLTEAINP HTVWMERFWN PEAFDESQAN PDGGWRQMNV NVLTKNTAPF NEVFGSYTNR GFTVKIEKSE KPENVWVEPE EFQPFMPTLQ GEARTEDVF
|
| |