Gene Tpau_3706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3706 
Symbol 
ID9157886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3825245 
End bp3826612 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content73% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003648623 
Protein GI296141380 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.114919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGA CCAGCGTTCG AGACATCGTC CGCGGACTTC TCCCGCAGGC CCGCGCCGAT 
CTGGCGGAAC TGGTCGCGCT GCCCTCCGTG CACGCCTCTC CCGAATTCGG CGAGGAACCG
AACCGGGCCG CCGCGCACTG GGTCGCCGAC GCCTTCGGCG GCGCCGGGAT CGAGAACATC
GAGCAGATCT CGACATCGGA CGGATCGATC GCGGTGGTCG GCCACACCCC GGCGCCCGCC
GGAGCCAAGA CCGTGCTCCT GTACAGCCAC TTCGATGTGC AGCCGCCCGG CCCCCGCGAG
CAGTGGGAAT CCGATCCGTT CACCCTCACC TCGCGCCCGG GCCCGGGTGC CGGGGCCGAG
CGCTGGTACG GGCGCGGCGC CGCCGACTGC AAGGGCAACC TGGTCGCGCA CCTCACCGCC
CTGCGCGCCG TGCGGGAGGC CACGGGCGGC CTGCCGGTGG GGGTACGGGT CATCGTCGAG
GGGTCCGAAG AGGGCGGCGG CGAGGGCCTC GACGATCTGA TCGCGCAGCG TCCCGACCTC
GCCCACGCCG ATCTCATCCT CATCGCCGAC ACCGGCAACG TCGCTGTGGG ACGTCCGACG
CTGACCACCT CGCTGCGGGG GGTCGCGAGC GTGCGGGTCG AACTCACCAC CGGCCGATCC
GACCTGCACT CGGGCCAATT CGGCGGCGCC GCCCCCGATG CGCTCGCGGC ACTGATCGCG
CTGCTGGCGA CGCTGCGCGA TGAACGCGGG AACACCACGA TCGACGGCCT CGACACCTCC
GCCCGCTGGG CCGGCGAGCC CTACGACGAG GCTGCCTTCC GCGCTGACGC CGCGCTTGTC
GACGGCACCG AGATACTCGG CTCCGGCCTG ATCGGCGATC AACTCTGGGC GCGACCGGCC
GTCACCGTGA TCGGCCTCGA CGCCCCCGCC ACCGCCACTG CGGCAGCAGC GATCGCGCCC
CGCGCCGCCG CGCTGCTGAA CCTGCGGGTA CCGCCGGGCA CCGATCCCCG CGCCGCGGGC
GACCTGCTGG TCGCGCACCT GAAGGCGCAC ACGCCGTGGG GTGCGCACGT CGACGCCGAG
GTGGAGTCCA CCGGCGAGCC CTTCGCCGCC GACACCACCG GCCCCGGCTA CGACGCCCTG
CGCGCCGCCC TCACCGAGGC CTACGACGGT GCCGAGGTGG TCACCAGTGG CCAGGGCGGT
TCGATCCCGC TGTGTACGCG GTTGCGCAAG GCCGCGCCGT CCGCCGAGAT CGCGCTGCTC
GGCGTCGAGG AGCCGCTGTG CCGGATCCAC GCACCCAACG AATCGGTCGA CCCCCGCGAA
CTGGAGCGGA CCGCGCTCGC CGAGGCGATC CTGCTGACCT CGCTGTGA
 
Protein sequence
MTETSVRDIV RGLLPQARAD LAELVALPSV HASPEFGEEP NRAAAHWVAD AFGGAGIENI 
EQISTSDGSI AVVGHTPAPA GAKTVLLYSH FDVQPPGPRE QWESDPFTLT SRPGPGAGAE
RWYGRGAADC KGNLVAHLTA LRAVREATGG LPVGVRVIVE GSEEGGGEGL DDLIAQRPDL
AHADLILIAD TGNVAVGRPT LTTSLRGVAS VRVELTTGRS DLHSGQFGGA APDALAALIA
LLATLRDERG NTTIDGLDTS ARWAGEPYDE AAFRADAALV DGTEILGSGL IGDQLWARPA
VTVIGLDAPA TATAAAAIAP RAAALLNLRV PPGTDPRAAG DLLVAHLKAH TPWGAHVDAE
VESTGEPFAA DTTGPGYDAL RAALTEAYDG AEVVTSGQGG SIPLCTRLRK AAPSAEIALL
GVEEPLCRIH APNESVDPRE LERTALAEAI LLTSL