Gene Tpau_3412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3412 
Symbol 
ID9157587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3500636 
End bp3502936 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content70% 
IMG OID 
ProductTAP domain protein 
Protein accessionYP_003648335 
Protein GI296141092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGCTC GGAAATCTCC CTTCGCCCGC GGATCCGGCC GACCGGCCGG CCTCGCGCTC 
ACCGTCGCCG TGGTCGTCAC CCTCGCCCCC GGCGTCACCT CGTTCACCGC GTCGGCCACC
ACCACCAGCA CGGGGACGAG TTCCGTCGCC TGGGGCGACT GCCCCAGCGA CGTCAAGGTG
GTCAACACGG ACGGTGCCGT CGCGAAGTGC ACCACCATCC GCGTGCCGGT GGACTACCAG
GATCCCGCGG CGGGATCCTT CGACATCCTC GCGACCGGGT TCATCCCCGC CGGCACCCCC
AAGGGCGTGA TCTTCGGCAA CCCGGGCGGG CCGGGAGGTT CGGCACTGGA CTTCTGGGGC
GACACGTTCG CCACCCCGCG CGAGCTGTTC CGCGACTACA TCCTGATCGG GCTGCAGCCG
CGCGGCCTGC TGCACGGCAC CAGCCTGCAG TGCAGCACGG CGCAGACCTG CACCGACCTC
GCCGACGACG CGTACAAGAA GACGTTGACC ACCGAGAACA TCGCCCGCGA TATGGACGAA
CTCCGCAAAG CGCTGGGCGT CGAGAAGATC GACTACTACG GCGCCTCCTG GGGCACCGAA
CTCGGTGCCG CGTACGCCAC CCTGTTCCCG CGCAACGTCG ACAAGATGAT CCTGGACTCG
AACATCGACC CCGTGACGCG CGGCGTCGGG GCCGGCGAGC TCGCGAACGC CCGCGCCGAA
GAGGTCTTCG AGCGACGCCT GTACGACTTC TTCGACTGGG CGGCGGCGAA CAACTCCGTC
TTCGGACTCG GCGACACTCC GTACAAGGTG TACTCGGCCT GGAACAGCGC CCGGCTCGTC
GAGGACCCGA AGTCGTGGGT CTTCAACTAC CTTCCCCCGG AACTGCGCGC GAACGATCTG
CCGCCGGCGA TGCGCGCCGA TGCGAACGCG ATCCTTCCCG ATCTCAACGC CACGATCCGG
GACACGTCCA AGCGTGACTT CGCCGCGCGC CTCGCGGCCG CCGCGAGCCG GGGCGAAGTC
ACGAGCACAG CCGTGGCCGG TGGCTACCCG GAGAGCCCAC TCCTGGTGAA CGCCACGACC
TTCATGTACA CCCGCCGGAA CTGGTCGCTG TACGCCGATC AGATCGATCT CGTGCTGCGG
CCCATCGATA CCAAGGCGCT GGCGCGGCTC GCGGACCCCA CAACGATCAA TCCGGCGGCA
ACGGCGCTCT TGCGCGGCGT CGGCGCCCTC TCCCCGTACG TCGGGCACGC CGGGACGACT
GCCGCGGAGG GCCTGTCGGC GACACAGGGA GTGGCGGTCA ACGCCAGCTC GGCCGAGCTC
GGGCGCACGG GGAGCGTCCG GGACACCCCT GATCCGAAGT TCGTGACATT GCGCGACGAA
GCATTGCAGG ATCTCGAGCG GGCGCGCGCC GCCGGTGAGT CGACCCCCAC CCTCCGGGCC
AAGGCGGCGA TCGCGCGCGC CTTCACCTAT ATGAGCTGGG GCTACGTGAA TCCGGCACCC
TGGTCGGGCG ACGGTCTCAC CACCGCGCCC CTGCTCCTGC AGAGCCTGAC CGACCCCGCC
ACCGGTGGCG GTGAGAGTCT CGCCGAGGCA CTCGGCGGCC ACCTGATCAC CGTCGAGGGC
GGGGATCACG GAGTGTTCCG CATCGGCAAC ACCACCGTGG ACCGCGCGGT GCTCGACTAC
CTGGCCAGCG GCACCACTGA CATCACCCGT GCACCCGAGA CGGCCGTCAC CGCGGTGGAC
TACCTCAAGC CCGTGCGCGA CTTCGTCACC AGGATGCGGA TCGCCTCCCC GCAGGTCACG
GGTCCGACCA TCCCCGATAC CGACCTCAAT CGCCTGCTGT CCACTCCCGG CCCCAACGTC
GACGCCGTCT CGGTGGTGCC CCGGCCGGTG TTCGCGCCTC GGGAGACACC CGCGACTGAC
CCTGAGCCCG CACCGGCCGC ACTCGCCGCC GGTATCACGG CGGACGTGGA TCCGGACGGT
GCCGATGCAC GCCCGGCATC CACCGCGACC AGCAGCATTC CGGAGTCCGC GGTGGCGACC
CAAAGCTCCA GCCCTGCAAC GCCCTCCGAC CACGCGGCCG GTCGCGCACC GGAGACGACT
GCCGCGGATC CGTCCACGGC GAAGCCGGAA CCGAGCACGA CCGAGGATTC CAATCCCGCC
GTCGGCGCGA CCGAGCAGAC CACGCCGTCG CCGAGTTCCG CGACCGCACC TACCGAACCG
GCAACGACCG CCGAAACACG TGCCGCTCCC GGCCCGGAGT CGAGCGGAGC GGGTGCGGCG
AATGCGGCTC CCGCCGCGTA G
 
Protein sequence
MFARKSPFAR GSGRPAGLAL TVAVVVTLAP GVTSFTASAT TTSTGTSSVA WGDCPSDVKV 
VNTDGAVAKC TTIRVPVDYQ DPAAGSFDIL ATGFIPAGTP KGVIFGNPGG PGGSALDFWG
DTFATPRELF RDYILIGLQP RGLLHGTSLQ CSTAQTCTDL ADDAYKKTLT TENIARDMDE
LRKALGVEKI DYYGASWGTE LGAAYATLFP RNVDKMILDS NIDPVTRGVG AGELANARAE
EVFERRLYDF FDWAAANNSV FGLGDTPYKV YSAWNSARLV EDPKSWVFNY LPPELRANDL
PPAMRADANA ILPDLNATIR DTSKRDFAAR LAAAASRGEV TSTAVAGGYP ESPLLVNATT
FMYTRRNWSL YADQIDLVLR PIDTKALARL ADPTTINPAA TALLRGVGAL SPYVGHAGTT
AAEGLSATQG VAVNASSAEL GRTGSVRDTP DPKFVTLRDE ALQDLERARA AGESTPTLRA
KAAIARAFTY MSWGYVNPAP WSGDGLTTAP LLLQSLTDPA TGGGESLAEA LGGHLITVEG
GDHGVFRIGN TTVDRAVLDY LASGTTDITR APETAVTAVD YLKPVRDFVT RMRIASPQVT
GPTIPDTDLN RLLSTPGPNV DAVSVVPRPV FAPRETPATD PEPAPAALAA GITADVDPDG
ADARPASTAT SSIPESAVAT QSSSPATPSD HAAGRAPETT AADPSTAKPE PSTTEDSNPA
VGATEQTTPS PSSATAPTEP ATTAETRAAP GPESSGAGAA NAAPAA