Gene Tpau_4187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_4187 
Symbol 
ID9158375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4312008 
End bp4313498 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content70% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003649095 
Protein GI296141852 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCAG CTCCCGAGAC CTCGACCGGA GAGCTCATCT CCACCAACCC CCGTACCGGC 
GCCGAGGTGG CCCGCTTCGC CATCGCCGAT GCCGCAGCGG TGGACGCCGC CGTCGCCACC
GCGCACACCG CTGCGCAGTG GTGGGGCGGC CTGGAGCCGA AAGCCCGCCG CAGTTGGCTG
CTCCGCTTCC GCGCCGAGCT CTCGCGTCGC GCTGAGGACC TCGCCGCGGT GGTCGCCGCC
GAGACCGGCA AGCCCGTCGA CGACGCGCTC CTCGAGGTGA TGCTCGCGGT GGTGCACCTC
GATTGGGCCG CCAAGAACGC CGAGAAGGTG CTGGGCCGGC GCAGCGTCGG CACCGGCATG
CTGGGCGCGA ACCTCGCCGC CACCGTCGAG TACCGCCCGT TCGGTGTGGT CGGGGTGATC
GGCCCGTGGA ATTATCCCGT CTACACCCCG ATGGGGTCGA TCTCGTATGC CCTCGCCGCC
GGTAACGCCA TCGTCTTCAA GCCCAGCGAA CTGACCCCCG CCGTGGGCCA GTTCCTGGCC
GATACCTGGG CCGCCGCCTG CCCCGGCCAG CCCGTGCTGC AGGCCATCCA CGGCGCCGGC
GAGACCGGCG CCGCGCTGTG CCGCTCCGCC GTGGACAAGC TGGCGTTCAC CGGTTCTGCC
GCGACCGCCC GCCGGGTCAT GGCCACCTGC GCGGAGAACC TCACCCCCGT CGCCATCGAG
GGCGGCGGCA AGGACGCCTT CATCGTGGAC TCCGATGCGA ACATCGATAG CGCGGTCGAT
GCCGCGGTCT TCGGCGCCTT CGGCAACGCC GGCCAGACCT GCGCCGGCGT CGAGCGCGTC
TACGTGGTCG GCGACAAGTA CGACGAATTC GTCGACAAAC TCGCCGCGAA ATCTCGTGAG
ATCCACGGAG GGTCCGAGGA TTCCGCCGAC TACGGCCCGG CCACCATGCA CAAGCAACTC
ACGGTGATCG CCAGCCACAT CGATGACGCC CTCAACCGCG GCGGCCGCGC CATCGTCGGC
GGCCGGGAAT CCGTGGGCGA GAACACCGTC CAGCCGGTCG TCCTGGTCGA CGTGCCGGAG
GACTCCACGG CCGTCACCGA GGAGACCTTC GGGCCCACCG TGGTGGTGAA CCGCGTCAAG
GACATCGACG AGGCGATCGA CCGCGCCAAC AACAGCACCT ACGGACTGTC CGCCGCGATC
ATGACCAAGG ACCTGAACAG GGGCCGAGAG CTGGCGCGCA AGCTGCGCAC CGGTGCGGTG
GCCGTCAATT CCTTCCTCTC CTTCGCCTCG GTACCCGCAC TGCCTTTCGG CGGCATCGGC
GACTCCGGCT TCGGCCGCAT CCACGGCGCC GACGGCCTGC GCGAGTTCAG TCGCCCGCAG
TCCGTTGCGG CGCAGAAGTT CGCGCTGCCG ATGAACCTGC TCACCTTCAA TCGCAAGGCG
CGCGATATGA AGACCGTGCG GATGATGCTC AGCAAGGTCT ACTCGCGGTG A
 
Protein sequence
MTAAPETSTG ELISTNPRTG AEVARFAIAD AAAVDAAVAT AHTAAQWWGG LEPKARRSWL 
LRFRAELSRR AEDLAAVVAA ETGKPVDDAL LEVMLAVVHL DWAAKNAEKV LGRRSVGTGM
LGANLAATVE YRPFGVVGVI GPWNYPVYTP MGSISYALAA GNAIVFKPSE LTPAVGQFLA
DTWAAACPGQ PVLQAIHGAG ETGAALCRSA VDKLAFTGSA ATARRVMATC AENLTPVAIE
GGGKDAFIVD SDANIDSAVD AAVFGAFGNA GQTCAGVERV YVVGDKYDEF VDKLAAKSRE
IHGGSEDSAD YGPATMHKQL TVIASHIDDA LNRGGRAIVG GRESVGENTV QPVVLVDVPE
DSTAVTEETF GPTVVVNRVK DIDEAIDRAN NSTYGLSAAI MTKDLNRGRE LARKLRTGAV
AVNSFLSFAS VPALPFGGIG DSGFGRIHGA DGLREFSRPQ SVAAQKFALP MNLLTFNRKA
RDMKTVRMML SKVYSR