Gene Tpau_2558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_2558 
Symbol 
ID9156719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2654323 
End bp2655813 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content68% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003647500 
Protein GI296140257 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.141176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCAG CAGAGACCAA GACGTTCGAT TCACTGAACC CGCGCACGGG CGATGTCGTC 
GCCAGCTACC CGATCCACTC GGCTGACCAC GTCCACGCCG TGGTCGCCCG AGCCCGCGAA
CAGGCCGACT GGTGGCAGGA ACTCGGCTTC GAGGGGCGCA AGCACCAACT GAACAAATGG
AAGGGCGTCA TCACCCGCCG GATCAACCAG TTGGCGCAGA TCGTCCACGA CGAGACCGGC
AAACCCCACG GCGATGCCCT GCTCGAAGCC GCGCTCGGCA TCGACCACCT CGGCTACGCC
GCTTCGCACG CGAAGAAGGT ACTCGGTCCC AAGCGGGTCT CGTCGGGCCT GGTCATGGCG
AATCAGGCGG CGACGGTGCG CTATCACCCG CTCGGCGTGG TCGGCGTGAT CGGACCGTGG
AACTACCCCG TCTTCACCCC CATGGGCTCG ATCGCCTACG CCTTGGCCGC GGGTAATGCC
GTCGTCTTCA AGCCGTCCGA GTACACGCCC GGTGTGGGGG TCTGGCTCGC CCGCACCTTC
GAGGAGGCCG TGGGCCGCCC GGTTTTGCAG ACGGTGACGG GCTTCGGCGA GACCGGCAAC
GCGCTGTGCA CCTCCGGCGT GGGCAAGCTC GCCTTCACCG GGTCGACCAA TACCGGCAAG
AAGGTCATGG CCGCGTGCGC CGAGACATTG ACGCCGGTGG TGATCGAGGC CGGCGGCAAG
GACGCCTTCC TGGTGGACCG GGACGCCGAT CTCGAGGCCG CTGCCGACGC CGCCGCGTGG
GGCGCCTTCG CCAACGCCGG TCAGACCTGC GTCGGCGTCG AGCGGGTCTA CGTGCACAAG
GACGTCTACG ACCCGTTCCT GGACAAGCTC GTCGCGAAGG CCCGCGAGGT CACCGCGAAC
GCTTCGGACG ATTCCAAGAT CGGCCCGATC ACCATGCCCA GCCAGCTACC GATCATCAAG
TCGCACATCG ACGACGCCCT CGCCCGCGGC GGGCGAGCGC TGGTCGGCGG TGCCGATGCG
GTCGGCGAGC GGTTCGTCCA GCCGACGGTG CTCGTCGACG TCCCGGAGGA TTCGATCGCG
GTCACCGAGG AGACCTTCGG CCCCACCGTG ACGGTCGCGA AGGTGGAGTC GATGGACGAG
GCGGTGGAGA AGGCGAACGC CACCCGCTAC GGCCTGGCGG CGACGGTCTT CTCGAAGGCC
CGCGGAATGG AGCTCGCCGA CAAGATCCGG TCGGGTATGG CCTCGGTGAA CGGCATCATC
ACCTTCGCGG GTGTGCCGAA CCTGCCGTTC GGCGGCGTGG GCGACTCCGG CTTCGGCCGC
ATCCACGGCG CGGACGGGCT CCGCGAATTC AGCTACGCCA AGGGCATCGC GCGCAAGCGG
TTCACTCCGC TGCTCAACCT CACCAGCTTC GCGCGCACCA AGGCGCAGGA GGGACAGCTC
GCGCAGATCG TCACGCTGCT GCACGGTCGG CAGGGCACGA TCGAGAAGTA G
 
Protein sequence
MTAAETKTFD SLNPRTGDVV ASYPIHSADH VHAVVARARE QADWWQELGF EGRKHQLNKW 
KGVITRRINQ LAQIVHDETG KPHGDALLEA ALGIDHLGYA ASHAKKVLGP KRVSSGLVMA
NQAATVRYHP LGVVGVIGPW NYPVFTPMGS IAYALAAGNA VVFKPSEYTP GVGVWLARTF
EEAVGRPVLQ TVTGFGETGN ALCTSGVGKL AFTGSTNTGK KVMAACAETL TPVVIEAGGK
DAFLVDRDAD LEAAADAAAW GAFANAGQTC VGVERVYVHK DVYDPFLDKL VAKAREVTAN
ASDDSKIGPI TMPSQLPIIK SHIDDALARG GRALVGGADA VGERFVQPTV LVDVPEDSIA
VTEETFGPTV TVAKVESMDE AVEKANATRY GLAATVFSKA RGMELADKIR SGMASVNGII
TFAGVPNLPF GGVGDSGFGR IHGADGLREF SYAKGIARKR FTPLLNLTSF ARTKAQEGQL
AQIVTLLHGR QGTIEK