Gene Tpau_3952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3952 
Symbol 
ID9158133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4071702 
End bp4073168 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content69% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003648863 
Protein GI296141620 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.73153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAC AGGCGGAACC GCAGGTGACG TACCGCATGT CGATCGACGG CCAGGCGCGC 
GAGAGCTCCT CGGGCGAGGT CATCGAGGTC CGTGACAAAT TCGACGGTGC GCTGCTCGGC
ACGATTCCCG CCGGCACCGC CGAGGACGCA CAGGAGGCGC TCGACGCCGC GCCCGACGGT
GCGCGGGCCT GGGCCGCCAC ACCCGCGCAC CGTCGGGCGG CCGTCCTCAA GGCGGCAGCG
GCGCAGGTCC GCGTCGACCG CGATGTGCTG AGTGCGATGC TGTCGGCGGA GAACGGTAAG
ACCATCGCGA ACGCGCGGCT GGAGATCGAC ACCACCGCCC GGATCTTGGA GGGCTTCGCC
GAGGAGGGCC TGCGCCTGTT CGGGCAGACC GTTCCGCTCG ACATCCAGGA GGGCCTGGAG
TCCGATCTCA TGCTGACGGT TCGTGAGCCG CTCGGCGTGA TGGTCGGCAT CGTGCCGTTC
AACTTCCCGG CCGAGCTCTA CGCGCACAAG GTGGGCGCGG CGCTGGCCGC CGGTAACGCG
ATGATCGTCA AACCGCCAGA AGATGATCCG ATCGTCACCA TGATGCTCAC CGAGATCCTG
CACCGTGCCG GCGTACCGCA CGCCGCCCTG CAGGTGGTCA CGGGATACGG GCACATCGTC
GGGCAGCACC TCTCCAGCAG CCCGGACATC GCGGCGGTCA CGTTCACCGG GAGCACCGAG
GTCGGAGCGA TCATCGCCGC GAACGCCGGC CGCAACATCG TCCGAGCGTT CCTCGAGCTG
AGCGGCAACG ACGCCTTCAT CGTGTGCGAC GATGCCGATC TCGACGCCGC CGTCGAGCAG
GCGATCGCCG GGCGCGTCTA TGCCAACGGG CAGGTGTGCG TGGCGACCAA GCGGATCATG
GTGGTGCGCA GCCGATACGA CGAGTTCCTC GAACGCCTGC GCGTGCGGGT GGCCGCGCTG
CGCACCGGCG ACCAGCGCGA CGAGGCCACC GACGTGGGCC CGCTGATCAG CGTCGCCGCC
GCGCGCACCG TCGAGGCGCA GATCGCGGCC TCGGTCGAGC AGGGCGCGCG ACTGCTCGTC
GGGGGCACCC GTGACGGGGC CTTCATCGCG CCCGCGCTGC TGGAAATCGA CACGCGGGTG
GGCATCGCCA CCGACGAGGA GATCTTCGGC CCCGTCTTCT CCGTGCTGTC CGTCGGCGAT
CTCGACGAGG CGATCGACCT CGCCAACGCA TCCCGATTCG GCCTCAACGC CGCCGTCTTC
ACCAGCGACG TCACGCGCGC GATCCAGGCC GGCCGGCGCA TCCAGGCCGG GATCGTCTCG
ATCAACGGCG GCAACGCCTA TCGGCCCGAT GTGGCCGCAT TCGGTGGCTA CAAGAAGAGC
GGCATCGGAC GCGAGGGACT CGGTTACACG CTCGACGAGT TCAGTCAGGT CAAGAGCATC
GTGCTGCGCG GCGTCCTCAA CTCCTGA
 
Protein sequence
MTTQAEPQVT YRMSIDGQAR ESSSGEVIEV RDKFDGALLG TIPAGTAEDA QEALDAAPDG 
ARAWAATPAH RRAAVLKAAA AQVRVDRDVL SAMLSAENGK TIANARLEID TTARILEGFA
EEGLRLFGQT VPLDIQEGLE SDLMLTVREP LGVMVGIVPF NFPAELYAHK VGAALAAGNA
MIVKPPEDDP IVTMMLTEIL HRAGVPHAAL QVVTGYGHIV GQHLSSSPDI AAVTFTGSTE
VGAIIAANAG RNIVRAFLEL SGNDAFIVCD DADLDAAVEQ AIAGRVYANG QVCVATKRIM
VVRSRYDEFL ERLRVRVAAL RTGDQRDEAT DVGPLISVAA ARTVEAQIAA SVEQGARLLV
GGTRDGAFIA PALLEIDTRV GIATDEEIFG PVFSVLSVGD LDEAIDLANA SRFGLNAAVF
TSDVTRAIQA GRRIQAGIVS INGGNAYRPD VAAFGGYKKS GIGREGLGYT LDEFSQVKSI
VLRGVLNS