Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4229 |
Symbol | |
ID | 9158417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4364240 |
End bp | 4367674 |
Gene Length | 3435 bp |
Protein Length | 1144 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003649136 |
Protein GI | 296141893 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGACC TCTCTCCTCA TTCACTCCCC CCTGCCATGG TCGCCGGTGA CGATCCCGCG GTGGTCGATG CCGCGGTTAC CCGCGCCGAT CGGTGGTTGC GCACCAGCCG CGGCCCGCGA CCCACCGAGG TAGGCCGCCG CGAGGCCGCC GCCACGTCCA GTCTCGCTGC GCTGCTCCAT GATCCGAACG GCGTCGAGTT CACCATGGGA TTCGTCGATC AGGTCGCGCG GCCCGAGGAC GACCGCGTGG CCGCCAAGGC ACTGCGCCGG CTCGTCTCCC CCACCGACGG CTCCACCGCC GCGTTCATGG GCGGCGTCGA CTCCGCGCTA CTGCGGGTGG GCACCGTCGC CGCCGGTATC GCACCGTCGA TCGCCATGCC GGTGGCCCGT GCCCGGCTGC GGCAGCTGGT GGGGCACCTG GTGTTCGACG CCGACGGCGA CAAGCTGCGC CGCCGCCTGG ACCGTGCCCG CGAGGTGGGC GTGCAACTGA ACCTCAATCT GCTGGGCGAG GCCGTCCTCG GCCAGGGCGA GGCCGACAGC CGGCTGCGCC GCACGCACGA GCTGCTCGCG GACCCCGCGG TCGACTACGT CTCCATCAAG GTGTCCTCCG TGGTGGCCCA GCTGATCCCC TGGGACCTGG AGGGCAACCG CGACCGCATC GTCGAGCGGC TGCGTCCGCT GTACCGCACC GCCCGGGACG GAGGGAAGTT CGTCAACCTC GATATGGAGG AGTACAAAGA CCTGCACCTC ACCCTGGAGG TGTTCACCGC GCTGCTCGAT GAGCCCGAAT TCCGTGGCCT GACAGCGGGA ATCGTGCTCC AGGCCTACCT CCCCGACGCC CCCGGAGCCC TCGATCGACT GCTGGAGTTC GCTCGCCGGC GCGTGGCCGA GGGAGGCGCC CCGGTCAAGG TACGTCTGGT GAAGGGCGCC AACCTGGCGA TGGAGCGGGT CGACGCCGAA CTGCACGACT GGCCACTGGC CACCTACGGC ACCAAGGCCG ATGTGGACGC CGACTACCTG CGCCTGCTCG ATACCGCGCT GCGCCCCGAG AACGCCGACG CCCTGCGGAT CGGCGTGGCC TCACAGAACC TCTTCTCCGT GGCCTATGCC GTCGAGCTCG CCGAACGCCG CGGCGTGCAG CGTCAACTCG ACGTGGAGAT GCTGCAGGGT ATGGCCCCGA TGGAGGCCGC CGCCGTGCGC GCCGACGTCG GCTCGCTGAT CCTCTACACC CCGGTGGTGC ACTCCGGCGA CTTCGACGTG GCCGTGAGCT ACCTGGTGCG CAGGCTGGAG GAGAACTCAT CGAGCGACAA CTTCCTGTAC TCGATGTTCA GCCCCGATCC CGCGGCGATC CCGCTGGAGG AGCAGCGATT CCGCACCGCG ATCGCCCGCC GCTCCGAGGT GCAGGACACC CCGAACCGAG TGCAGGACCG CGCACACGAC CCGATCGAAC CCCGGCGTGA CCGCTTCGTC GGCGAACCCG ATACCGACCC GTCGACCCCG GGCAACCGGG CGTGGGCCCG CGCCGCGCTT GCCGCCCCGG TCACCGTGAC CCCGCCCCCG CAGGTCACCG ATACGGCTGC CGTGGACACC GCGGTCGATA CCGCACTGCG GGCCCGCGAG GCCTGGGCGG CGCTCTCCCC CGCGGACCGC GCCGAGCACC TCCAGCGCGC CGCCGACGAA CTCGCCCGCC GCCGCGGCGA GCTCCTGGGC GTGATGACGC ACGAGGCCGG AAAGACCGTG GCCGAAGCGG ATCCGGAGAT CTCCGAGGCC ATCGACTTCG CCCGCTACTA CGCCCACAGC GCGCTGGATC TGGCCGATCA CGCCGACGGC GAGGCAGTGT TCACCCCGCA CCGGCTCGTG GTGGTCACCC CGCCGTGGAA CTTCCCGGTG GCGATCCCGC TCGGCGGTGT GCTCGCCGCG CTCGCCGCAG GATCCGCGGT GATCATCAAG CCCGCACCGC AGGTGCTGCG CTGCGGAACC GCTGCGATCG CGGCCCTGCA CGCTGCAGGC ATCCCGCGCG AGCTGGTGCA ACTGGTCAAC GCCGACGAGG CCGCTGCGGG CCGCCGTCTG GTGACGCACC CCGAGGTCGA TGCCGTCGTC CTCACCGGCG CCAGCGAGAC CGCGGCCCTG TTCCGCGGCT GGCGTCCCGA GCTCGACCTG CTGGCCGAGA CCTCCGGTAA GAACGCCATG ATCGTCACGC CCGCAGCCGA TCCCGACCTC GCGGTCAACG ATCTGGTGCG CTCGGCGTTC GGCCATGCCG GGCAGAAGTG CTCCGCGGCC TCGCTCGTGA TCGCCGTCGG CAGCGTCGGC ACGTCGAAGC GGTTCCTGGG CCAACTGGAG GACGCCGTGC GCACTCTCAC TGTGGGACCC GGCACCGACC TGGGCACCAG CGTGGGTCCC CTCATCGAAC CGGCCGCCGG AAAGCTACTG CGTGGGCTCA CCGAGCCCGG ACCGGGCGAG CATTGGCTGG TGCAGCCGCG CCGCCTCGAC GAGGCGGGCC GGCTGTGGAG CCCCGGCGTG CTCGACGGGG TGGCCGAGGG TAGCTGGTTC CACACCACCG AATTATTCGG CCCGGTGCTC GGCATCATGC GAGCCGCCAC TCTCGATGAT GCACTGCGCC TGCAGAATTC GACCGGCTAC GGCTTGACCG CGGGCTTGCA CAGCCTCGAC CCCGAGGAGA TCGCGCACTG GCGGGAGAAA GTGGAGGCCG GAAACCTCTA CATCAACCGG CACATGACCG GCGCGATCGT GCAGCGACAG TCCTTCGGCG GCTGGAAGCG CTCCTCCATC GGCCCCGGCG CCAAGGCCGG CGGACCCAAC TACGTGGCCC AATTCGGCCG CTGGTCCGAT ACCGAGTATC CCGACGTGCC CGCGAGCGCA CGCACGCTGT TCAGTGAGCG GATCATCGCT GCCGCGCAGC ATCTCTCCGC CGCCGACGTG CGCTGGTTGC ACGCCGCCGC CGCCTCCGAC CAGCGGGCCT GGGACGCCGA ATTCGGGCTC GAGCACGATC CCACCGGTCT GGCGTGCGAG GGCAACGACT TCCGCTACCG GCCGCTGCCC AAGCTGGAGG TGCGGGTCGG GCCCGGAGCT GCCCCGCGTG ATCTGGTGCG CCTGCAACTC GCGGCCGCGC AGACCGGTAC CCGACTCGAT GTGACCGTTG ATCCCGACGC GGTCGAGCGG GCCCCCGGCC AACCCGTGCA CACCGCCGAT GAGTACGCCG CCTCCCTCGC CGAGCGCGGC GAGGCGATCC GGATCCGCGT ACTCGGACAG CCGGAGCCCT CGGTCCTGGC GGCGGCCGCC GCACACGGTC ACAGCGTGTT GCGGGCCCCG GTGCTCTGGT CGGGCCGCCG GGAACTGCTC ACCATGCTGC GCGAGCAGGC CGTGAGTACC ACCCGGCACC GCTACGGGCA CGTCTCCGCC GAAAACGGCG CCTAG
|
Protein sequence | MVDLSPHSLP PAMVAGDDPA VVDAAVTRAD RWLRTSRGPR PTEVGRREAA ATSSLAALLH DPNGVEFTMG FVDQVARPED DRVAAKALRR LVSPTDGSTA AFMGGVDSAL LRVGTVAAGI APSIAMPVAR ARLRQLVGHL VFDADGDKLR RRLDRAREVG VQLNLNLLGE AVLGQGEADS RLRRTHELLA DPAVDYVSIK VSSVVAQLIP WDLEGNRDRI VERLRPLYRT ARDGGKFVNL DMEEYKDLHL TLEVFTALLD EPEFRGLTAG IVLQAYLPDA PGALDRLLEF ARRRVAEGGA PVKVRLVKGA NLAMERVDAE LHDWPLATYG TKADVDADYL RLLDTALRPE NADALRIGVA SQNLFSVAYA VELAERRGVQ RQLDVEMLQG MAPMEAAAVR ADVGSLILYT PVVHSGDFDV AVSYLVRRLE ENSSSDNFLY SMFSPDPAAI PLEEQRFRTA IARRSEVQDT PNRVQDRAHD PIEPRRDRFV GEPDTDPSTP GNRAWARAAL AAPVTVTPPP QVTDTAAVDT AVDTALRARE AWAALSPADR AEHLQRAADE LARRRGELLG VMTHEAGKTV AEADPEISEA IDFARYYAHS ALDLADHADG EAVFTPHRLV VVTPPWNFPV AIPLGGVLAA LAAGSAVIIK PAPQVLRCGT AAIAALHAAG IPRELVQLVN ADEAAAGRRL VTHPEVDAVV LTGASETAAL FRGWRPELDL LAETSGKNAM IVTPAADPDL AVNDLVRSAF GHAGQKCSAA SLVIAVGSVG TSKRFLGQLE DAVRTLTVGP GTDLGTSVGP LIEPAAGKLL RGLTEPGPGE HWLVQPRRLD EAGRLWSPGV LDGVAEGSWF HTTELFGPVL GIMRAATLDD ALRLQNSTGY GLTAGLHSLD PEEIAHWREK VEAGNLYINR HMTGAIVQRQ SFGGWKRSSI GPGAKAGGPN YVAQFGRWSD TEYPDVPASA RTLFSERIIA AAQHLSAADV RWLHAAAASD QRAWDAEFGL EHDPTGLACE GNDFRYRPLP KLEVRVGPGA APRDLVRLQL AAAQTGTRLD VTVDPDAVER APGQPVHTAD EYAASLAERG EAIRIRVLGQ PEPSVLAAAA AHGHSVLRAP VLWSGRRELL TMLREQAVST TRHRYGHVSA ENGA
|
| |