Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1199 |
Symbol | |
ID | 9155339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1224809 |
End bp | 1226260 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003646169 |
Protein GI | 296138926 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.420226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCACCG GACCGAGATC AGGGTTCGCC CCTGGGCGTA CCGCGAAGCT GGCTGCGGCC GGAATCGCCG CCACCATGGG GGTGACGATC CTGGCGGGCT GCGGGGCCAA GGAGGACGGC GTCACCCTCA ACGTGTACGC GCCCGCCGAC GGCGCCACCC TCGTCAAAGA GGTGGCCGGA GACTGTTCGA CCAGCGGGTA CACCGTGGTC GGCCACGCCC TTCCCAAGAG CGCCGACGAT CAGCGCCTGC AACTGGCCCG TCGGGTGACC GGTAACGACC GCACCATGGA CCTGATGGGG CTGGACGTGA ACTGGACCGC GGAGTTCGCG GAGGCGGGCT GGATTCTCCC GCTGCCCGAG AATCTGACCC GGACGGCGGA GAAGACCGTG CTCGCGGGCC CGCTCAAGAC CGCCATGTGG CAGGACAAGC AGTACGCGGC GCCGGCCTGG ACCAACACCC AACTGCTCTG GTACCGCAAG GACGCGCTGG AAAAGGTGCT CGGCCGCAAG ATCGGCCCCG GTGTCCCCAA GCTCACCTGG GACCAGGTGG TGCAATACGC GGAGAAATCC GGTCAACTCG GTGGCCCGAC ACAGATCGAG GCCCAGGCCG CGCAGTACGA GGGCGTGGTG GTGTGGTTCA ACTCGCTGCT CGAAAGCGCC GGTGGCCGGA TGGTGGCCGA CGACGGAAAG ACGGTGACGC TCACCGACAC CCCGGAGCAC CGCGCCGCCA CGGTCAAGGC GCTCTCGATC ATGAAGGCCG TGGCCACTGC GCCCGGCCGC GATCCATCGT TCACCCAGCT CAAAGAGGGC GAGTCGCGCC TGGCGATGGA GTCGGGCAAG GCGATCTTCC AAGTCAACTG GCCCTTCGTT TTCGCGGGCG TCAAGCAGAA CGCCGCGGCG GGCTCGGTGC CGTTCCTGCC CGAGCTCACC AAGTACGACG CGTTGCTCAA TCCTCCCAAG GACGAGAAGA ATCCGCCCGA GCCGACGGTC GCGCAGCTGG GCGAGATCAA CAACCTGACC CGGCAGAAAT TCGACTTCGC CCCCTTCCCA TCGGTGATCC CCGGTAAGCC CGCGAAGACC ACCGTGGGCG GCATCAATTT CGCCGTCTCG AAGACCACCC GGTACGAGAA GCAGGCCTTC GAGGCGCTCG CCTGCCTCAC GAACGAGGCC GCCGAGCGGA AGTACGCCGT CAAGGGCGGT ACCCCACCGG TCCTGCCGAA GCTCTATGAC GATCCCGAGT TCCGCAAGGC CTACCCGATG GCCACCCTGA TCCGGGACCA ATTGCAGGAC AACACCGCCG CGGTGCGGCC GATCACACCC CAGTATCAGG CGATGTCCAC GCTGCTCCAG GCCACGCTCG CCCCGGTGGG GGCGTGGGAT CCCGAACAGC TCGCGGACCG GCTCGCTGAT GCGGCTGAGA AGGCCATGAA TGGAAAGGGC CTGGTGCCAT GA
|
Protein sequence | MSTGPRSGFA PGRTAKLAAA GIAATMGVTI LAGCGAKEDG VTLNVYAPAD GATLVKEVAG DCSTSGYTVV GHALPKSADD QRLQLARRVT GNDRTMDLMG LDVNWTAEFA EAGWILPLPE NLTRTAEKTV LAGPLKTAMW QDKQYAAPAW TNTQLLWYRK DALEKVLGRK IGPGVPKLTW DQVVQYAEKS GQLGGPTQIE AQAAQYEGVV VWFNSLLESA GGRMVADDGK TVTLTDTPEH RAATVKALSI MKAVATAPGR DPSFTQLKEG ESRLAMESGK AIFQVNWPFV FAGVKQNAAA GSVPFLPELT KYDALLNPPK DEKNPPEPTV AQLGEINNLT RQKFDFAPFP SVIPGKPAKT TVGGINFAVS KTTRYEKQAF EALACLTNEA AERKYAVKGG TPPVLPKLYD DPEFRKAYPM ATLIRDQLQD NTAAVRPITP QYQAMSTLLQ ATLAPVGAWD PEQLADRLAD AAEKAMNGKG LVP
|
| |