Gene Tpau_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_2054 
Symbol 
ID9156209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2138555 
End bp2139886 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content67% 
IMG OID 
Producttryptophan synthase, beta subunit 
Protein accessionYP_003647005 
Protein GI296139762 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.204444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGGATA CTGGGCTGGT GAGTCTCCCA GAAGCCAGCA CCGGACTGTC CCAGCGATCG 
GCGTACGAGC CGGATGGGAC CGGGCACTGG GGAGTCTGGG GCGGTAGACA CGTACCTGAA
GCGCTGATGG CTGTGATCGA GGAGGTCACG GCCGAATTCG AGAAGGTCCG CAGCGATCGC
GAGTTCCTGG CGGAGCTCGA CCGGCTCCAG CGGCAGTACA CCGGTCGCCC GTCTCCCTTG
TACCCGTGCA CCCGGCTCGG CGAGCATGCC GGCGGTGCGT CGATCCTCCT CAAGCGCGAG
GACCTCAACC ACACCGGCAG TCACAAGATA AACAACGTGC TCGGACAAGT GCTGCTCGCC
AAGCGGATGG GCAAGACGCG GGTGATCGCG GAGACCGGCG CCGGTCAGCA CGGCGTGGCA
ACGGCGACGG CATGCGCGCT GCTCGGCATC GAGTGCGTGA TCTTCATGGG AAAGGTCGAT
ACCGACCGGC AGGCGCTCAA CGTAGCCAGG ATGCGGTTGC TCGGCGCCGA GGTGATCGCG
GTGGCGTCCG GATCGGCGAC TCTCAAAGAC GCGATCAACG AGGCGCTTCG GGATTGGGTC
AGTCACGCCG ACGACACCTA CTACTGTTTC GGCACCGCCG CCGGGCCGCA TCCGTTCCCG
CTGCTGGTGC GAGACCTGCA GCGCATTGTC GGAATGGAGG CACGGGAGCA GGTGCTCGAC
CTCGCCGGGC GGCTCCCGGA CGCCGTAGTC GCCTGTGTCG GCGGCGGGTC CAATGCCATC
GGCATCTTCC ACCCCTTCAT TCCGGACGAG GGCGTACGCC TGGTCGGGTG TGAAGCCGCC
GGCGACGGAG TCGAAACCGG CAGGCACGCG GCCACCTTCA CCGGCGGGAC GCCCGGTGCC
TTCCAGGGGG CGTACTCGTA TCTGTTGCAG GACGACGACG GACAGACGAT CGAATCGCAC
TCGATCTCCG CCGGGCTCGA TTATCCGGGC GTCGGCCCGG AGCACGCCGA GCTCAAGGAG
TCCGGTCGCG CCGAATACGT CCCGATCACC GACGCCGAGG CGATGGACGC CTTCGAGTTG
TTGTGCCGGA CCGAGGGGAT CATCCCGGCG ATCGAGTCCG CGCACGCGGT GGCCGGCGCG
CTCAAACTGG GGGCCGAACT CGGCCCGGGG GCGGTGATCG TGGTGAACCT CTCCGGTCGC
GGAGACAAGG ACGTCGACAC CGCCGGGCGG TGGTTCGGGC TGCTCGACGA CGAAGGGAAC
GTAGTGGGGC AGAAGGTGAT CGGTGAGGAT CTCAACACCG AATTCCGCGA CGCCGAGGAG
GCCGCGAAAT GA
 
Protein sequence
MRDTGLVSLP EASTGLSQRS AYEPDGTGHW GVWGGRHVPE ALMAVIEEVT AEFEKVRSDR 
EFLAELDRLQ RQYTGRPSPL YPCTRLGEHA GGASILLKRE DLNHTGSHKI NNVLGQVLLA
KRMGKTRVIA ETGAGQHGVA TATACALLGI ECVIFMGKVD TDRQALNVAR MRLLGAEVIA
VASGSATLKD AINEALRDWV SHADDTYYCF GTAAGPHPFP LLVRDLQRIV GMEAREQVLD
LAGRLPDAVV ACVGGGSNAI GIFHPFIPDE GVRLVGCEAA GDGVETGRHA ATFTGGTPGA
FQGAYSYLLQ DDDGQTIESH SISAGLDYPG VGPEHAELKE SGRAEYVPIT DAEAMDAFEL
LCRTEGIIPA IESAHAVAGA LKLGAELGPG AVIVVNLSGR GDKDVDTAGR WFGLLDDEGN
VVGQKVIGED LNTEFRDAEE AAK