Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3412 |
Symbol | |
ID | 9157587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3500636 |
End bp | 3502936 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | TAP domain protein |
Protein accession | YP_003648335 |
Protein GI | 296141092 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGCTC GGAAATCTCC CTTCGCCCGC GGATCCGGCC GACCGGCCGG CCTCGCGCTC ACCGTCGCCG TGGTCGTCAC CCTCGCCCCC GGCGTCACCT CGTTCACCGC GTCGGCCACC ACCACCAGCA CGGGGACGAG TTCCGTCGCC TGGGGCGACT GCCCCAGCGA CGTCAAGGTG GTCAACACGG ACGGTGCCGT CGCGAAGTGC ACCACCATCC GCGTGCCGGT GGACTACCAG GATCCCGCGG CGGGATCCTT CGACATCCTC GCGACCGGGT TCATCCCCGC CGGCACCCCC AAGGGCGTGA TCTTCGGCAA CCCGGGCGGG CCGGGAGGTT CGGCACTGGA CTTCTGGGGC GACACGTTCG CCACCCCGCG CGAGCTGTTC CGCGACTACA TCCTGATCGG GCTGCAGCCG CGCGGCCTGC TGCACGGCAC CAGCCTGCAG TGCAGCACGG CGCAGACCTG CACCGACCTC GCCGACGACG CGTACAAGAA GACGTTGACC ACCGAGAACA TCGCCCGCGA TATGGACGAA CTCCGCAAAG CGCTGGGCGT CGAGAAGATC GACTACTACG GCGCCTCCTG GGGCACCGAA CTCGGTGCCG CGTACGCCAC CCTGTTCCCG CGCAACGTCG ACAAGATGAT CCTGGACTCG AACATCGACC CCGTGACGCG CGGCGTCGGG GCCGGCGAGC TCGCGAACGC CCGCGCCGAA GAGGTCTTCG AGCGACGCCT GTACGACTTC TTCGACTGGG CGGCGGCGAA CAACTCCGTC TTCGGACTCG GCGACACTCC GTACAAGGTG TACTCGGCCT GGAACAGCGC CCGGCTCGTC GAGGACCCGA AGTCGTGGGT CTTCAACTAC CTTCCCCCGG AACTGCGCGC GAACGATCTG CCGCCGGCGA TGCGCGCCGA TGCGAACGCG ATCCTTCCCG ATCTCAACGC CACGATCCGG GACACGTCCA AGCGTGACTT CGCCGCGCGC CTCGCGGCCG CCGCGAGCCG GGGCGAAGTC ACGAGCACAG CCGTGGCCGG TGGCTACCCG GAGAGCCCAC TCCTGGTGAA CGCCACGACC TTCATGTACA CCCGCCGGAA CTGGTCGCTG TACGCCGATC AGATCGATCT CGTGCTGCGG CCCATCGATA CCAAGGCGCT GGCGCGGCTC GCGGACCCCA CAACGATCAA TCCGGCGGCA ACGGCGCTCT TGCGCGGCGT CGGCGCCCTC TCCCCGTACG TCGGGCACGC CGGGACGACT GCCGCGGAGG GCCTGTCGGC GACACAGGGA GTGGCGGTCA ACGCCAGCTC GGCCGAGCTC GGGCGCACGG GGAGCGTCCG GGACACCCCT GATCCGAAGT TCGTGACATT GCGCGACGAA GCATTGCAGG ATCTCGAGCG GGCGCGCGCC GCCGGTGAGT CGACCCCCAC CCTCCGGGCC AAGGCGGCGA TCGCGCGCGC CTTCACCTAT ATGAGCTGGG GCTACGTGAA TCCGGCACCC TGGTCGGGCG ACGGTCTCAC CACCGCGCCC CTGCTCCTGC AGAGCCTGAC CGACCCCGCC ACCGGTGGCG GTGAGAGTCT CGCCGAGGCA CTCGGCGGCC ACCTGATCAC CGTCGAGGGC GGGGATCACG GAGTGTTCCG CATCGGCAAC ACCACCGTGG ACCGCGCGGT GCTCGACTAC CTGGCCAGCG GCACCACTGA CATCACCCGT GCACCCGAGA CGGCCGTCAC CGCGGTGGAC TACCTCAAGC CCGTGCGCGA CTTCGTCACC AGGATGCGGA TCGCCTCCCC GCAGGTCACG GGTCCGACCA TCCCCGATAC CGACCTCAAT CGCCTGCTGT CCACTCCCGG CCCCAACGTC GACGCCGTCT CGGTGGTGCC CCGGCCGGTG TTCGCGCCTC GGGAGACACC CGCGACTGAC CCTGAGCCCG CACCGGCCGC ACTCGCCGCC GGTATCACGG CGGACGTGGA TCCGGACGGT GCCGATGCAC GCCCGGCATC CACCGCGACC AGCAGCATTC CGGAGTCCGC GGTGGCGACC CAAAGCTCCA GCCCTGCAAC GCCCTCCGAC CACGCGGCCG GTCGCGCACC GGAGACGACT GCCGCGGATC CGTCCACGGC GAAGCCGGAA CCGAGCACGA CCGAGGATTC CAATCCCGCC GTCGGCGCGA CCGAGCAGAC CACGCCGTCG CCGAGTTCCG CGACCGCACC TACCGAACCG GCAACGACCG CCGAAACACG TGCCGCTCCC GGCCCGGAGT CGAGCGGAGC GGGTGCGGCG AATGCGGCTC CCGCCGCGTA G
|
Protein sequence | MFARKSPFAR GSGRPAGLAL TVAVVVTLAP GVTSFTASAT TTSTGTSSVA WGDCPSDVKV VNTDGAVAKC TTIRVPVDYQ DPAAGSFDIL ATGFIPAGTP KGVIFGNPGG PGGSALDFWG DTFATPRELF RDYILIGLQP RGLLHGTSLQ CSTAQTCTDL ADDAYKKTLT TENIARDMDE LRKALGVEKI DYYGASWGTE LGAAYATLFP RNVDKMILDS NIDPVTRGVG AGELANARAE EVFERRLYDF FDWAAANNSV FGLGDTPYKV YSAWNSARLV EDPKSWVFNY LPPELRANDL PPAMRADANA ILPDLNATIR DTSKRDFAAR LAAAASRGEV TSTAVAGGYP ESPLLVNATT FMYTRRNWSL YADQIDLVLR PIDTKALARL ADPTTINPAA TALLRGVGAL SPYVGHAGTT AAEGLSATQG VAVNASSAEL GRTGSVRDTP DPKFVTLRDE ALQDLERARA AGESTPTLRA KAAIARAFTY MSWGYVNPAP WSGDGLTTAP LLLQSLTDPA TGGGESLAEA LGGHLITVEG GDHGVFRIGN TTVDRAVLDY LASGTTDITR APETAVTAVD YLKPVRDFVT RMRIASPQVT GPTIPDTDLN RLLSTPGPNV DAVSVVPRPV FAPRETPATD PEPAPAALAA GITADVDPDG ADARPASTAT SSIPESAVAT QSSSPATPSD HAAGRAPETT AADPSTAKPE PSTTEDSNPA VGATEQTTPS PSSATAPTEP ATTAETRAAP GPESSGAGAA NAAPAA
|
| |