Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1145 |
Symbol | |
ID | 9155285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1174606 |
End bp | 1176474 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003646115 |
Protein GI | 296138872 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0845067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGGCGAC GGTCCATGGG GCTGACGGTG ATGTCGGCTC TGCTCGCGGC GACGCTCACC GGGTGCGTGG CGGATCCGCC CCCCGCCGTC CCCGGATCGG AGACCTCGGC CCCGCCGCAG GCTCCCGTGG TCAGCGGCGC CACCATCCTG GTCGCGATCG ACGGCCTCGG CGCCGGCTTC AACCCGCATC TCGCTTCCAA CAGCTCGCCC GCCGCGCGAG CGCTGGCCTC GATGACATTG CCCAGCGTGT TCCGCGAGGT CAAGCGGCCG GACGGTCGCG TCGAACTGGA ACAGAACACC GACCTGGTGC CGACCATCGA GCATCCTGCG CCGAAGCAGA TCGTGTACAC GATCCGGCAG GATGCGCAGT GGTCCGACGG AGCTCCCATC GCCGCCGAGG ACTTCGTCTA CCTGTGGGAG TCGATGACCA CCGCCCCTGA CACCGTGGGT TCGGCCCCGT ACCGGCAGAT CGAATCGGTG CAGTCCGGTG CCGGCGGCAA GCGGGTGATC GTCACCCTCA AGGACGATCT ACCCGGCTGG CGCACCCTCT TTCAGAATCT GCTCCCCAGT CACCTGGTGA AGGGATCACC CGGCGGGTTC AACGGCGCCA TGCGCGACCG GATCCCCGCC TCGGGCGCGG GCTACCTGGT CAAGAAGGTC GACGTGGCCC GCGATGAGGT GACCCTCGAA CGTAACGATC GCTATTGGAT GAACCCGGCT CGCGTCGAGC GCATCGTGCT CCGCAAGGGC GGCAGCGATG CCCAACTCGC CAATTCCGTA CGCACCGGCG ACGTCCAGGT GGTCGCGGTG CACGGCGGGC CGTCGTTGCT GGCCCAGCTG ACGTCGATCT ACCAGGTGCG CGCCGCCGAA CAGCCCGCGC CGCGCACGCT TTCGCTCACC CTCAACACCC GCTCGCCGTT CCTGTCCGAC GTGGCCGTGC GGCGGGCCCT GCTCGAGGCC GTCAACGTCG ATCTGCTGAC CACCGTCGGG TCCAGCGGCG ACACCGTCCG GCCCGCCCGC GCGCAGGTAC TCGCACCGTC GGATCCCGGC TACTCGCCCA CCATGCCCGT CCCGCCCGCG CCGGCCGAGG CGAAGCGGCG ACTGGAACAG GCCCTCGGCG CCGCCGGATA CACCTTGCAG GACGCGCCGA TCCCGCCCAC CCAGAGCGCA CCGCCCACCC CGGCGCCGAC ACCCGTCACC CCGGCCACAC CCGCGCCCGC GACCAGTTCC GACCCGTCGG CGCTGCCCGT CCCGCCGTCG CCCTCGCCCG GCCAGAACGT GCAGCAGTAC GTGCGCGACG GTGTGCCCAT GTTCTTGCGG ATCGGCGTGC CGCAGGGCAA CCAGGAGGCC TTCGCGGTCG CGAGTAATGC CACCGATCAA CTGCGTTCGA TGGGAGTGTT CGCCTCCGTC GTGATCCGGG AACCGCAGGT GCTCACCGGC ACTGCCCTGC TCGACGGCTC GGTCGATGCG GTGGTCGGCT GGTCCGGCCT GGCGCCCACC CCCGTCGAGC GGCTCGCTAG CCGCGTCGAA TGCCCGCCGC CGCCGAGCTC CTCCAGCGAT GCCACGCCGA CCGCCACCCC CAGCGCGCGG CCCAGCATCA CCACCGACCC GCAGGCGGTC ACCGTCGGCG GCAACCTGGC CGGACTGTGT GATCCCGCGC TCCAGGGACT GGCCCTGGAC GGCCTGCGCG GCGGCCCCAT CGATCTCGGA GCGATCGACG CCGACCTGTG GTCCAAGGCC ACCGTCCTAC CGATCCTGCA GGACGTGACA CTGTCCGCGA CCGGCCCCAC CGTGCAGGGC GTCACCCTCG ACGGCCCGCC GGTGGACGGA GTGTTCACCG GCGTCGACCG GTGGGAGCGC ACCAAGTGA
|
Protein sequence | MRRRSMGLTV MSALLAATLT GCVADPPPAV PGSETSAPPQ APVVSGATIL VAIDGLGAGF NPHLASNSSP AARALASMTL PSVFREVKRP DGRVELEQNT DLVPTIEHPA PKQIVYTIRQ DAQWSDGAPI AAEDFVYLWE SMTTAPDTVG SAPYRQIESV QSGAGGKRVI VTLKDDLPGW RTLFQNLLPS HLVKGSPGGF NGAMRDRIPA SGAGYLVKKV DVARDEVTLE RNDRYWMNPA RVERIVLRKG GSDAQLANSV RTGDVQVVAV HGGPSLLAQL TSIYQVRAAE QPAPRTLSLT LNTRSPFLSD VAVRRALLEA VNVDLLTTVG SSGDTVRPAR AQVLAPSDPG YSPTMPVPPA PAEAKRRLEQ ALGAAGYTLQ DAPIPPTQSA PPTPAPTPVT PATPAPATSS DPSALPVPPS PSPGQNVQQY VRDGVPMFLR IGVPQGNQEA FAVASNATDQ LRSMGVFASV VIREPQVLTG TALLDGSVDA VVGWSGLAPT PVERLASRVE CPPPPSSSSD ATPTATPSAR PSITTDPQAV TVGGNLAGLC DPALQGLALD GLRGGPIDLG AIDADLWSKA TVLPILQDVT LSATGPTVQG VTLDGPPVDG VFTGVDRWER TK
|
| |