Gene Tpau_4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_4229 
Symbol 
ID9158417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4364240 
End bp4367674 
Gene Length3435 bp 
Protein Length1144 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003649136 
Protein GI296141893 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGACC TCTCTCCTCA TTCACTCCCC CCTGCCATGG TCGCCGGTGA CGATCCCGCG 
GTGGTCGATG CCGCGGTTAC CCGCGCCGAT CGGTGGTTGC GCACCAGCCG CGGCCCGCGA
CCCACCGAGG TAGGCCGCCG CGAGGCCGCC GCCACGTCCA GTCTCGCTGC GCTGCTCCAT
GATCCGAACG GCGTCGAGTT CACCATGGGA TTCGTCGATC AGGTCGCGCG GCCCGAGGAC
GACCGCGTGG CCGCCAAGGC ACTGCGCCGG CTCGTCTCCC CCACCGACGG CTCCACCGCC
GCGTTCATGG GCGGCGTCGA CTCCGCGCTA CTGCGGGTGG GCACCGTCGC CGCCGGTATC
GCACCGTCGA TCGCCATGCC GGTGGCCCGT GCCCGGCTGC GGCAGCTGGT GGGGCACCTG
GTGTTCGACG CCGACGGCGA CAAGCTGCGC CGCCGCCTGG ACCGTGCCCG CGAGGTGGGC
GTGCAACTGA ACCTCAATCT GCTGGGCGAG GCCGTCCTCG GCCAGGGCGA GGCCGACAGC
CGGCTGCGCC GCACGCACGA GCTGCTCGCG GACCCCGCGG TCGACTACGT CTCCATCAAG
GTGTCCTCCG TGGTGGCCCA GCTGATCCCC TGGGACCTGG AGGGCAACCG CGACCGCATC
GTCGAGCGGC TGCGTCCGCT GTACCGCACC GCCCGGGACG GAGGGAAGTT CGTCAACCTC
GATATGGAGG AGTACAAAGA CCTGCACCTC ACCCTGGAGG TGTTCACCGC GCTGCTCGAT
GAGCCCGAAT TCCGTGGCCT GACAGCGGGA ATCGTGCTCC AGGCCTACCT CCCCGACGCC
CCCGGAGCCC TCGATCGACT GCTGGAGTTC GCTCGCCGGC GCGTGGCCGA GGGAGGCGCC
CCGGTCAAGG TACGTCTGGT GAAGGGCGCC AACCTGGCGA TGGAGCGGGT CGACGCCGAA
CTGCACGACT GGCCACTGGC CACCTACGGC ACCAAGGCCG ATGTGGACGC CGACTACCTG
CGCCTGCTCG ATACCGCGCT GCGCCCCGAG AACGCCGACG CCCTGCGGAT CGGCGTGGCC
TCACAGAACC TCTTCTCCGT GGCCTATGCC GTCGAGCTCG CCGAACGCCG CGGCGTGCAG
CGTCAACTCG ACGTGGAGAT GCTGCAGGGT ATGGCCCCGA TGGAGGCCGC CGCCGTGCGC
GCCGACGTCG GCTCGCTGAT CCTCTACACC CCGGTGGTGC ACTCCGGCGA CTTCGACGTG
GCCGTGAGCT ACCTGGTGCG CAGGCTGGAG GAGAACTCAT CGAGCGACAA CTTCCTGTAC
TCGATGTTCA GCCCCGATCC CGCGGCGATC CCGCTGGAGG AGCAGCGATT CCGCACCGCG
ATCGCCCGCC GCTCCGAGGT GCAGGACACC CCGAACCGAG TGCAGGACCG CGCACACGAC
CCGATCGAAC CCCGGCGTGA CCGCTTCGTC GGCGAACCCG ATACCGACCC GTCGACCCCG
GGCAACCGGG CGTGGGCCCG CGCCGCGCTT GCCGCCCCGG TCACCGTGAC CCCGCCCCCG
CAGGTCACCG ATACGGCTGC CGTGGACACC GCGGTCGATA CCGCACTGCG GGCCCGCGAG
GCCTGGGCGG CGCTCTCCCC CGCGGACCGC GCCGAGCACC TCCAGCGCGC CGCCGACGAA
CTCGCCCGCC GCCGCGGCGA GCTCCTGGGC GTGATGACGC ACGAGGCCGG AAAGACCGTG
GCCGAAGCGG ATCCGGAGAT CTCCGAGGCC ATCGACTTCG CCCGCTACTA CGCCCACAGC
GCGCTGGATC TGGCCGATCA CGCCGACGGC GAGGCAGTGT TCACCCCGCA CCGGCTCGTG
GTGGTCACCC CGCCGTGGAA CTTCCCGGTG GCGATCCCGC TCGGCGGTGT GCTCGCCGCG
CTCGCCGCAG GATCCGCGGT GATCATCAAG CCCGCACCGC AGGTGCTGCG CTGCGGAACC
GCTGCGATCG CGGCCCTGCA CGCTGCAGGC ATCCCGCGCG AGCTGGTGCA ACTGGTCAAC
GCCGACGAGG CCGCTGCGGG CCGCCGTCTG GTGACGCACC CCGAGGTCGA TGCCGTCGTC
CTCACCGGCG CCAGCGAGAC CGCGGCCCTG TTCCGCGGCT GGCGTCCCGA GCTCGACCTG
CTGGCCGAGA CCTCCGGTAA GAACGCCATG ATCGTCACGC CCGCAGCCGA TCCCGACCTC
GCGGTCAACG ATCTGGTGCG CTCGGCGTTC GGCCATGCCG GGCAGAAGTG CTCCGCGGCC
TCGCTCGTGA TCGCCGTCGG CAGCGTCGGC ACGTCGAAGC GGTTCCTGGG CCAACTGGAG
GACGCCGTGC GCACTCTCAC TGTGGGACCC GGCACCGACC TGGGCACCAG CGTGGGTCCC
CTCATCGAAC CGGCCGCCGG AAAGCTACTG CGTGGGCTCA CCGAGCCCGG ACCGGGCGAG
CATTGGCTGG TGCAGCCGCG CCGCCTCGAC GAGGCGGGCC GGCTGTGGAG CCCCGGCGTG
CTCGACGGGG TGGCCGAGGG TAGCTGGTTC CACACCACCG AATTATTCGG CCCGGTGCTC
GGCATCATGC GAGCCGCCAC TCTCGATGAT GCACTGCGCC TGCAGAATTC GACCGGCTAC
GGCTTGACCG CGGGCTTGCA CAGCCTCGAC CCCGAGGAGA TCGCGCACTG GCGGGAGAAA
GTGGAGGCCG GAAACCTCTA CATCAACCGG CACATGACCG GCGCGATCGT GCAGCGACAG
TCCTTCGGCG GCTGGAAGCG CTCCTCCATC GGCCCCGGCG CCAAGGCCGG CGGACCCAAC
TACGTGGCCC AATTCGGCCG CTGGTCCGAT ACCGAGTATC CCGACGTGCC CGCGAGCGCA
CGCACGCTGT TCAGTGAGCG GATCATCGCT GCCGCGCAGC ATCTCTCCGC CGCCGACGTG
CGCTGGTTGC ACGCCGCCGC CGCCTCCGAC CAGCGGGCCT GGGACGCCGA ATTCGGGCTC
GAGCACGATC CCACCGGTCT GGCGTGCGAG GGCAACGACT TCCGCTACCG GCCGCTGCCC
AAGCTGGAGG TGCGGGTCGG GCCCGGAGCT GCCCCGCGTG ATCTGGTGCG CCTGCAACTC
GCGGCCGCGC AGACCGGTAC CCGACTCGAT GTGACCGTTG ATCCCGACGC GGTCGAGCGG
GCCCCCGGCC AACCCGTGCA CACCGCCGAT GAGTACGCCG CCTCCCTCGC CGAGCGCGGC
GAGGCGATCC GGATCCGCGT ACTCGGACAG CCGGAGCCCT CGGTCCTGGC GGCGGCCGCC
GCACACGGTC ACAGCGTGTT GCGGGCCCCG GTGCTCTGGT CGGGCCGCCG GGAACTGCTC
ACCATGCTGC GCGAGCAGGC CGTGAGTACC ACCCGGCACC GCTACGGGCA CGTCTCCGCC
GAAAACGGCG CCTAG
 
Protein sequence
MVDLSPHSLP PAMVAGDDPA VVDAAVTRAD RWLRTSRGPR PTEVGRREAA ATSSLAALLH 
DPNGVEFTMG FVDQVARPED DRVAAKALRR LVSPTDGSTA AFMGGVDSAL LRVGTVAAGI
APSIAMPVAR ARLRQLVGHL VFDADGDKLR RRLDRAREVG VQLNLNLLGE AVLGQGEADS
RLRRTHELLA DPAVDYVSIK VSSVVAQLIP WDLEGNRDRI VERLRPLYRT ARDGGKFVNL
DMEEYKDLHL TLEVFTALLD EPEFRGLTAG IVLQAYLPDA PGALDRLLEF ARRRVAEGGA
PVKVRLVKGA NLAMERVDAE LHDWPLATYG TKADVDADYL RLLDTALRPE NADALRIGVA
SQNLFSVAYA VELAERRGVQ RQLDVEMLQG MAPMEAAAVR ADVGSLILYT PVVHSGDFDV
AVSYLVRRLE ENSSSDNFLY SMFSPDPAAI PLEEQRFRTA IARRSEVQDT PNRVQDRAHD
PIEPRRDRFV GEPDTDPSTP GNRAWARAAL AAPVTVTPPP QVTDTAAVDT AVDTALRARE
AWAALSPADR AEHLQRAADE LARRRGELLG VMTHEAGKTV AEADPEISEA IDFARYYAHS
ALDLADHADG EAVFTPHRLV VVTPPWNFPV AIPLGGVLAA LAAGSAVIIK PAPQVLRCGT
AAIAALHAAG IPRELVQLVN ADEAAAGRRL VTHPEVDAVV LTGASETAAL FRGWRPELDL
LAETSGKNAM IVTPAADPDL AVNDLVRSAF GHAGQKCSAA SLVIAVGSVG TSKRFLGQLE
DAVRTLTVGP GTDLGTSVGP LIEPAAGKLL RGLTEPGPGE HWLVQPRRLD EAGRLWSPGV
LDGVAEGSWF HTTELFGPVL GIMRAATLDD ALRLQNSTGY GLTAGLHSLD PEEIAHWREK
VEAGNLYINR HMTGAIVQRQ SFGGWKRSSI GPGAKAGGPN YVAQFGRWSD TEYPDVPASA
RTLFSERIIA AAQHLSAADV RWLHAAAASD QRAWDAEFGL EHDPTGLACE GNDFRYRPLP
KLEVRVGPGA APRDLVRLQL AAAQTGTRLD VTVDPDAVER APGQPVHTAD EYAASLAERG
EAIRIRVLGQ PEPSVLAAAA AHGHSVLRAP VLWSGRRELL TMLREQAVST TRHRYGHVSA
ENGA