Gene Tpau_4321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_4321 
Symbol 
ID9158503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014159 
Strand
Start bp80430 
End bp83558 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content77% 
IMG OID 
ProductAcyl transferase 
Protein accessionYP_003649221 
Protein GI296141979 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCCC CGACCCTCTC CCGGTCCGGT TCGACGCCGG GGCTCCGTCC CCGGATCGAC 
GGTGCACCCC TGCTCCCGGA CGGCCTCCCG GCCTTCGTGC TCAGCGCCGA CGGTGCCGAT
TCCCTCGCCG CCGCCGCCGC GCAGCTCCGC CGCTACCTCA CCGAGCACCC GCAGGTACCG
CTGCCCGCCG TCGCCGCGGC GCTGGTCGCG ACGCGCGACC TGCGGCGGCA CCGCGCGATC
GTGCACACCG ACGACCGCGG CGAGCTCGTC ACCGCCCTCG ACGCGCTGGC AGCGGACCCG
GTGCGCCCGG TCGGTACCGC CGTTCCCGGC CTGCACACGG GCGTCGCCGC CGCGGGCGCG
CCGGCGTTCG TCTTCCCGGG CCAGGGCAGC CAGCGCCGGG GCATGGGCGC GCTGTTCCAC
CGCGAGTCCG CGGTGTACCG CGACACGGTC GAGAAGATCC ACGCGATCGC GCTGGAGGTC
TTCGGCTCGT CGGCCCGCGA CTACCTGCTC GGCACCGGCG AGTGGGAGCA GGGCGGCCGG
CCCGTCCCCG TCGAGGTGGT CCAGCCCGCG ATCTTCATGC AGGCGCTCGG CCTGGCCGCG
ATGTGGCGGG CAGCCGGCAT CGAGCCGCGG ATCCACGTCG GGCACAGCCA GGGTGAGATC
GCCGCGGCGG TCGCCGCGGG CACCGTCGGC CTCGCCGACG GGCTGCGGCT GGTCACGCGG
CGCGCCCTCG CGGTCCGCGA CAACGCGCCC ACCGGGCATT CGATGGCGGT GCTCGGCACC
GACCGCGAGC GGTGCGCGGC CATGCTGGCC CGGACCGTCG GCTTCGCGGA ACTGTCCGTG
GTGAACTCGG CGCACGTGCT GTGCGTCAGC GGTGAGCGCG ACGTGGTGGC GGGGCTGGTC
GCGCAGGCCA CCGAGCGGGG GATCTTCGCC CGCGAGATCC GGGTCGAGTA CCCCGCGCAC
ACCTCGCTCG TGGGCTCCAT GTGGCACCTG CGGGAGAAGT GGATGGGCGT GATGGACCAT
CCCGCGTTCC TGCCCACCGA GCACCTGCTC ATCGGCGGCA CGCTCGGCGA GGCCGTCCCC
GCCGACACCG ATTTCCTCGA CTACTGGTTC TGGAACCTCA AGAACCCGGT CCGGTTCGAC
CTCGCGACGC GGGCCGCACT CGACGCGGGC GCCGACCGGC TGATCGAGCT CGCCGAGCAC
CCCACGCTGC AGCTCGCACT GCACGAGAAC ATCGCCGACG CCGGTGCCCG CGCCACCGTG
ACCGGCAGCT CGCGGCGCGA CGCGACCGAC CTGTCGGAAT TCTCCTCCGC GGTCGCCGAC
GTGCTCGTGA CCCACGCCCG CGACCTCGAC GTCCAGCGCA CACTGGTCGC GGAGGAACTG
CCGCCCGGCT TCCCCGCGGC CCCGCTGTCC CGGCAACGCC TGTGGGCCGC CCTCCCGGGC
CGGCCCGCGA CCGCGGGGCC CGTCCGGCCC CGCACGCGTG TCCTCGACAC CGTCTGGACC
GATCTCGACG CACCGGTCAG CGCGCCACCG CGACCGCTCG CGATCATCGA CCCCACCGGG
GCGCACGCCG ACCTGGCCGC CGCCCTCCTC GACGCCGCGG CCCGGTACGG GACACCCGCC
CGCCGTAGCG ACCGCGCCGG TGACGACGAG ATCGCGGTGG TCCTCGTCCC GGGATCCGCG
GAGACCGACA CGACGATCGC CGCCGTCGGC GAGTTGCTCG CCGATCGCCG CTGGTGGGCC
GGAGTACAGC CCGCGGCCGG GATCGCCGCC GTCACGGCGG GGGCGGTCGT CGCGGACCCG
GCCGACCCGG GGCCCGACGG TGCCGCCGCG GCGATCGCCG TCGGTTTCCG CGCGTTCGGC
GCGGACCTGC CCGGCGTCGA GGTCCGGCAC CTCGATCTCG ATCCGCGCGC CGACGCGGCG
GCGCAGGCCG GCACCGCGAT CCACGCGCTG CACGTGGCGG GCGAGCCGCG TCTGGCCCTG
CGGTCCGGCG CCGTGCGCGC CGAACGCTGG GTCGACGCCG AGCCCACCGA AACGGGCACG
GAGCCTCCTG CGGGACTGCT GGAGACTGCG CGGAACGTAG TGATCAGCGG GGGTACCGGA
CACCTGGGGC TCGCGTTCGC GGCGCACGCG GCGGCCCACG GCGCCGCCTC GGTCACTCTG
CTCTCCCGCT CGGGCGGGGG AACGACCGTC CGGCACGCGC TGGCCCGGAT CGCCCGCCGC
CATCCCGCCT GCGCGGTCAC GGTGGTGCCG TGCGATGTCA CGGACCCGGC GGCCGTCGCC
ACCGCCCTCG CCGGCGCCGG GCGCGCGATC GACCTCGTCG TGCACGCCGC CGTCGGATAT
CGACGTTGTG CCGCAACGGA TCTCGATGCA TCGGACTTCA CCGCCGCGGC CGCGGCCAAG
GTGGGCGGAC TACGCACGCT CGCGCAGGCG GTGCCGGATG CGACGCTGCT GACCTGCAGC
TCCGCCGCCG CCGCGCTTCC CGGCGCGGGC CAGGCCTGGT ACGCCGCGTC GAACACGCTG
GCCGAGGCCG AGGCCGCCGC GCTCCGGCGG GCCGGACGAC GGGCCGCCGC CGTGCGGTGG
GGCCTGTGGG AGCAGGCCGG TCCGCTCGAC GAGGCGGGCT TCGCGGCCGT CACGGCGGCC
GGGGTGATCC CGCTGGCCGC CCCCGACGCG CTGGCCGCGC TCGCCCGCTC GACCGGTCCC
GAGCCAGTGA TCACCGCCGT CGACCTGCCG CGGCTGCGCG ATGTCGCCGC GGCGTTCGGC
GCGGCGGCGC TGCTCACCGA TCTCACCGAG GACACCACCC CCGCCGTTCC GGCACCGTCG
GACGCGGTCC CCGGGACAGC GCCCGACGCC CCCGCGACCG TGACCGTGAC CACCGACGCC
GCCCCCGGAC CCGCGCCGGG CGGGGGCGCC GATGTCGCGG CGGTGCTCCG GCACCACCTC
GCGCGGGCGC TCGCGGTGCC CGCGGACACG CTGGACCCCG ATGTCGCGCT CGTCGCCCTG
GGACTCGACT CCCTGCAGGC CCTGGAGCTG CGCACCGCGG TCCGCGACGA ACTGGACGCC
GAACTCCCGC TCGAGGCGAT CCTCGGCGGC GCGACGCTCG CCGAGGTCAG CGCGACCCTG
GCCGGCTGA
 
Protein sequence
MSSPTLSRSG STPGLRPRID GAPLLPDGLP AFVLSADGAD SLAAAAAQLR RYLTEHPQVP 
LPAVAAALVA TRDLRRHRAI VHTDDRGELV TALDALAADP VRPVGTAVPG LHTGVAAAGA
PAFVFPGQGS QRRGMGALFH RESAVYRDTV EKIHAIALEV FGSSARDYLL GTGEWEQGGR
PVPVEVVQPA IFMQALGLAA MWRAAGIEPR IHVGHSQGEI AAAVAAGTVG LADGLRLVTR
RALAVRDNAP TGHSMAVLGT DRERCAAMLA RTVGFAELSV VNSAHVLCVS GERDVVAGLV
AQATERGIFA REIRVEYPAH TSLVGSMWHL REKWMGVMDH PAFLPTEHLL IGGTLGEAVP
ADTDFLDYWF WNLKNPVRFD LATRAALDAG ADRLIELAEH PTLQLALHEN IADAGARATV
TGSSRRDATD LSEFSSAVAD VLVTHARDLD VQRTLVAEEL PPGFPAAPLS RQRLWAALPG
RPATAGPVRP RTRVLDTVWT DLDAPVSAPP RPLAIIDPTG AHADLAAALL DAAARYGTPA
RRSDRAGDDE IAVVLVPGSA ETDTTIAAVG ELLADRRWWA GVQPAAGIAA VTAGAVVADP
ADPGPDGAAA AIAVGFRAFG ADLPGVEVRH LDLDPRADAA AQAGTAIHAL HVAGEPRLAL
RSGAVRAERW VDAEPTETGT EPPAGLLETA RNVVISGGTG HLGLAFAAHA AAHGAASVTL
LSRSGGGTTV RHALARIARR HPACAVTVVP CDVTDPAAVA TALAGAGRAI DLVVHAAVGY
RRCAATDLDA SDFTAAAAAK VGGLRTLAQA VPDATLLTCS SAAAALPGAG QAWYAASNTL
AEAEAAALRR AGRRAAAVRW GLWEQAGPLD EAGFAAVTAA GVIPLAAPDA LAALARSTGP
EPVITAVDLP RLRDVAAAFG AAALLTDLTE DTTPAVPAPS DAVPGTAPDA PATVTVTTDA
APGPAPGGGA DVAAVLRHHL ARALAVPADT LDPDVALVAL GLDSLQALEL RTAVRDELDA
ELPLEAILGG ATLAEVSATL AG