Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtpsy_1423 |
Symbol | |
ID | 7382676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax ebreus TPSY |
Kingdom | Bacteria |
Replicon accession | NC_011992 |
Strand | - |
Start bp | 1491425 |
End bp | 1494619 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643654739 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_002552885 |
Protein GI | 222110621 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.658184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACA AGAAACCGCA TTCCTCTCCC GAGACCGAGG CCCACCCCGA CGCCAACCGC CGCCGCCTGC TGCTGCGCGG CGGCGCGGTG GCCGGTGGCC TGGCGGCCTT TGCCGCCGGC TACGGCGAAA CCGTGGCCAA GGGCGCCAAG GGCCTGGTGA CCGGCACGTC CGGCACCGCC ACCAAGAGCG CCACGCGCGG CAATTCGCTC ACGCCCGAGT TCCGCATCGA CCCCGTGACC GGCCAGCTCA CCACCCAGCC GGGCCAGGTG GTCAGCCCCA GCAGCTGCCT GGGCTGCTGG ACGCAGTGCG GCGTGCGTGT GCGCGTGGAC ACCGAGCACA ACAAGATCAT CCGCATCGCC GGCAACCCCT ACCACCCGCT GGCCACCACG CACCATGCGC CCATGGAGAC GCCCGTGCGC GAGGTGTACG CCCTGCTGGG TGGCGACAAC GGCCTGGAAG GCCGCGCCAC CAGCTGCGCG CGCGGCTCCG CCATGCTGGA GCACCAGACT GCGGCCCACC GCGTGCTCAC CCCGCTCAAG CGCGTGGGCC CGCGCGGCTC CGGCCAGTGG AAAAGCATCT CGCTGGAACA GCTGGTCAAG GAAATCTGCG AGGGCGGCGA CCTGTTCGGC GAAGGCCACG TGGACGGACT GGCTGCCATC CGCGACGTGC AGACGCTGAT CGACCCCGAG AACCCCGAGT ACGGCCCCAA GGCCAACCAG CTGCTGGTGA CCGATGCCTC CAACGAAGGC CGCACCCCGC TCATCAACCG CTTTGCGCGC CAGTCCTTCG GCACGGTCAA CGTGTCCAAC CACGGCGCCT ACTGCGGCCA GACCTACCGC GTGGGCACGG CGGCGGCCCT GGGCAACATC CCCGGCATGC CGCACGGCAA GCCGGACTGG AAGAACTCGC GCTTCGGCCT GTTCCTGGGC ACGGCGCCGG CGCAGTCGGG CAACCCGTTC CAGCGCATGG GACGCGAGCT GGCCGAGGCC CGCTCGCGCG ACGACAACAC CTACCGCTAC GTGGTGGTGT CGCCCGTGCT GCCCATGTCG TCCAGCCACG CGGCGGGCGA CAACAACCGC TGGCTGCCCA TCAAGCCCGC CACCGACCTG GCGCTGGCCA TGGCGCTGAT CCGCTGGATC ATCGACAACG AGCGCTACGA CGCCAAGTAC CTCACCCAGC CCGGCCCGGC TGCCATGGCC GCCGCGGGCG AGGCCAGCTG GAGCAACGCC ACCCACCTGC TCATCAACGA CCCCAAGCAC CCGCGCTACG GCCAGTTCCT GCGCGGGGCC GACCTGGGCC TGCCCATGCC CGAGCCGGTG GACGAAAAGA CCCCGGCCGA AGACGTCTAT GTGGTGCAGG TGGCAGATGG CAATGGCGGC TTCAAGCTGG TGCCCCACAC CGTGGCCCAG CCGGCGGAGC TGGTGGTGGA GCGCGAATTC ACGCCCCTGA AGGCCGCGGG CGCCACCGAA GAGCCCGCGC CCATCGCCGT GTGCACCGCC TTCGTCAAGC TGCGCGAGGA AGCGCGCCGC AAGACGCTGC AGGAGTATTC CGACCTGTGC GGCGTACCGG TGAAAGACAT CGAAGACCTG GCGCGCGAAT TCACCAGCCA CGGCAAGCAG GCCGTGGCCA ACTCGCACGG CGGCACCATG AGCGGCGCAG GCTTCTACAC CGCCTACGCC ATCGCCATGC TGAACAACCT GATCGGCAAC CTGAACGTCA AGGGCGGCTG GGTGCTGGAC GCTGGCCCGT TCGGCCCCTT TGGCCCCGGC CCGCGCTACA ACTTCGCGCA GTTCGAGGGC GCGGTCAAAG CCACGGGCGT GGCGCTGTCG CGCACGCGCT TTCCGTACGA GAAGACCAGC GAGTTCAAGC GCAAGAAGGA AGCCGGCCAG AACCCCTACC CCGCCAAGGC ACCGTGGTAC CCGGCGCCCG GGGGCCTCTC CAGCGAAATG CTGGCCGCCG GCCTGCTGGG CTACCCCTAC CCGGTGAAGG CGTGGATCAA CCACATGAGC AACCCGGTGT ACGCCATCTG TGGCTTTGAG AACACGCTGG TCAGCGCCCT CAAGGACCCG AAGAAGCTGC CGCTGTTCGT CTCGGTCGAT CCGTTCATCA ACGAGACCTC GGCCCTGGCC GACTACATCG TGCCCGACAC CGTCACGTAC GAAAGCTGGG GCATCGGCGC CCCCTGGGCC GACGTGATCG CCAAGAGCAG CACCGTGCGC TGGCCCACCG TGGAGCCCGC CACCGCCAAG ACGGCCGATG GCAAGCCGGT GAGCTTTGAG AGCTTCGTCT TCGCCGTGGC CAAGCAGCTG CAGCTGCCCG GCTTTGGCAA GGGCGCGATG AGCACCAAGG ACGGCGAGCC GCTCGACCTG GAAAGCGCCG AAGACTTCTA CCTGCGCGGC ATGTGCAACA TCGCCTACCA GGCCGGCAAG CCCGTGCCTG AAGCCAGCGA CGACGACATC GCCCTGACAG GCGTCACCCG CTGGATGCCC GAAGTGGAAA AGCGCCTCAA GCCCGAGGAA GTGCGCCGCG TGGCCATGGT GATGAGCCGC GGCGGGCGCT TCGACAAGAT CGAAGACGCC TGGAAGGGCG AGCAGATCAA GGCCGCCTAC AAGTTCCCCG TGCAGCTGTG GCACGAGGGC CTGGCCAAGA TGCGCCACTC CATGACCGGC GAGCGCTACG TGGGCTGCCC CACCTGGTTC CCCACCCGCT TTGCCGACGG CAGCAGCATG CGCGAACGCT TCACCGAGCA GGACTGGCCG CTGACCATGA GCAGCTACAA GTCCAACCTG ATGAGCAGCA TGTCGATCGC CGCCAGCCGT CTGCGCCAGG TGCACCCGCA CAACCCCATC AGCCTGAACA AGGACGATGC GGCCAAGCTG GGTATCGCCA ACGGCGACCG CATCGAGGTC AGCACCCCCG GCGCCAAGCT GCAGGGCGTG GCCCTGGTGC GCAGCGGCAT CGCCCAGGGC GCCCTGGCCA TCGAGTACGG CTACGGCCAC AAGCAGCTGG GCGCCGCCGT TCACACCGTG GACGGCAAGC CCATGGCCCA CAACCCGCAG CACGGCAACG GCGTAAACCT GAACGCGCTG GGCTTTGCCG ATCCCACCCG TCCCGCCAAG GACAACGTGT GGATCGACTG GGTGTCGGGG GCCGTGGTGC GGCAGGGGTT GCCGGTGAAG GTGCGGAAGG TGTGA
|
Protein sequence | MTDKKPHSSP ETEAHPDANR RRLLLRGGAV AGGLAAFAAG YGETVAKGAK GLVTGTSGTA TKSATRGNSL TPEFRIDPVT GQLTTQPGQV VSPSSCLGCW TQCGVRVRVD TEHNKIIRIA GNPYHPLATT HHAPMETPVR EVYALLGGDN GLEGRATSCA RGSAMLEHQT AAHRVLTPLK RVGPRGSGQW KSISLEQLVK EICEGGDLFG EGHVDGLAAI RDVQTLIDPE NPEYGPKANQ LLVTDASNEG RTPLINRFAR QSFGTVNVSN HGAYCGQTYR VGTAAALGNI PGMPHGKPDW KNSRFGLFLG TAPAQSGNPF QRMGRELAEA RSRDDNTYRY VVVSPVLPMS SSHAAGDNNR WLPIKPATDL ALAMALIRWI IDNERYDAKY LTQPGPAAMA AAGEASWSNA THLLINDPKH PRYGQFLRGA DLGLPMPEPV DEKTPAEDVY VVQVADGNGG FKLVPHTVAQ PAELVVEREF TPLKAAGATE EPAPIAVCTA FVKLREEARR KTLQEYSDLC GVPVKDIEDL AREFTSHGKQ AVANSHGGTM SGAGFYTAYA IAMLNNLIGN LNVKGGWVLD AGPFGPFGPG PRYNFAQFEG AVKATGVALS RTRFPYEKTS EFKRKKEAGQ NPYPAKAPWY PAPGGLSSEM LAAGLLGYPY PVKAWINHMS NPVYAICGFE NTLVSALKDP KKLPLFVSVD PFINETSALA DYIVPDTVTY ESWGIGAPWA DVIAKSSTVR WPTVEPATAK TADGKPVSFE SFVFAVAKQL QLPGFGKGAM STKDGEPLDL ESAEDFYLRG MCNIAYQAGK PVPEASDDDI ALTGVTRWMP EVEKRLKPEE VRRVAMVMSR GGRFDKIEDA WKGEQIKAAY KFPVQLWHEG LAKMRHSMTG ERYVGCPTWF PTRFADGSSM RERFTEQDWP LTMSSYKSNL MSSMSIAASR LRQVHPHNPI SLNKDDAAKL GIANGDRIEV STPGAKLQGV ALVRSGIAQG ALAIEYGYGH KQLGAAVHTV DGKPMAHNPQ HGNGVNLNAL GFADPTRPAK DNVWIDWVSG AVVRQGLPVK VRKV
|
| |