Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3915 |
Symbol | |
ID | 9158096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4030907 |
End bp | 4032538 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003648826 |
Protein GI | 296141583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAGAA GACGAAGTAG GTCGACGGCG ACGCGGGCGA TCCTCGCGAT CGCGGTGGCC GCGAGCCTGG CGGTATCGGG CTGCAGCATG CAGGGGGCGC GTTCGCAGAA CACGGTGGGC CCCGACGCGA TCATCCGGGT GAACGGCGGC GAACCGCAGA ACGGGTTGAT CCCGTCGGAC ACCAGCGAGA ACATGGGCGG CCGCGTGGTC GACTCCCTAT TCACCGGTCT GTACAGCTAC AAGGCCGACG GCACGCCCGA GTTGGCGAAC GCCGAGTCGG TCGAGACCAC CGACAACAAG CGCTTCCTGG TCAAGCTCAA GCGGGACTGG AAGTTCACCG ACGGCACGCC GGTCAAGGCG GAGAACTACG TGCGTGCCTG GAACTTCGGT GCCGCAGTGG GCAATCTGCA GAAGCAGCAG AGCTTCTACG CTCCGATCGC GGGTTTCGAC GAGGTGTCCG AGAAGGGTGC CACGAAGACC GAGATGCGCG GCCTGCAGGT GATCGACGAC TACACCTTCT CGATCGAACT GGCGGCGCCG AACATCGACT TCAAGCTTGC GTTGGGGTTC ACACCGTTCG TTCCGCTCCC CGATGTCTTC TTCACCGAGG GTAAGGAGAA GTTCGGCCAG AACCCGGTGG GCAACGGGCC GTACAAACTC AAGCAGTGGC GGCACAACGT GCAGCTCGAG GTGGTGCGGT ACGAGGACTA CAAGGGTCCG AAGCCGAAGA ACGGCGGGCT CACCTTCATC ATGTACGAGT CCTACGATCC CGCGTACATC GACCTCACGT CCGGGAATCT CGACGCGCTC GACAACATCC CGAACAGCGC GCTGCGCTCG TTCCAGAAGA CGCTGGGCAA GAAGGCGATC ATCAAGTCGA CCGCGCAGAC CCAGAACTTC GTGATCCCGC AGTTCCTGGA GCACTTCGGC AGCGACGAGG AGGGCCGCCT GCGCCGGCAG GCGATCTCCA TGTCGTTCGA CCGGCAGCAG ATCATCGATG TGGTGTTCCA GGGCTTCCGG AATCCCGCGC TCGAGTTCAC CGCTCGCTCG ATCCCCGGGT GGGACGGAAA CATCCCCGGC AATGGCAACG TCAAGTACAA CCCGGAGCTG GCCAAGCAGC GCTGGGCGCA GGCGAACGCG ATCAAGCCGT GGACCGGATC GTTCACCATC GCCTACAACT CCGACGGCGA TCACAAGCAG TGGATCGATG CGGTGACCAA CCAGATCAAG AACACGTTGG GCATCGATGC GGCGGGCAAG CCCTACGCGA CGTTCAAGCA GATCCGCGAT GAGCTCACCA AGAAGACGAT CAAGTCCGCG GGGCGCAGCG GTTGGCAGGG CGACTACCCG ACGCAGCTCA ACTTCTTGGA GTCCAACTAC CTCACTGGCG CTGGGTCGAA TGACGGCGAC TACAGCAATC CCGCGTTCGA CGCCAAGATC GCCGAGGCGC AGCAGGCGCT CGACCCGGTG CAGTCGACCA GGCTCGTCAA CGAGGCACAG GCGATCCTTC TCAACGACTT GCCCGTGGTC CCGCTGTGGG ACTACAAGGC CGCGGCCGGC GTCGGCGACG GTGTCAAGGG TGATCTCACC TGGAACGGCC GCTTCGACTT CACCAACATC ACGAAGGAGT AG
|
Protein sequence | MVRRRSRSTA TRAILAIAVA ASLAVSGCSM QGARSQNTVG PDAIIRVNGG EPQNGLIPSD TSENMGGRVV DSLFTGLYSY KADGTPELAN AESVETTDNK RFLVKLKRDW KFTDGTPVKA ENYVRAWNFG AAVGNLQKQQ SFYAPIAGFD EVSEKGATKT EMRGLQVIDD YTFSIELAAP NIDFKLALGF TPFVPLPDVF FTEGKEKFGQ NPVGNGPYKL KQWRHNVQLE VVRYEDYKGP KPKNGGLTFI MYESYDPAYI DLTSGNLDAL DNIPNSALRS FQKTLGKKAI IKSTAQTQNF VIPQFLEHFG SDEEGRLRRQ AISMSFDRQQ IIDVVFQGFR NPALEFTARS IPGWDGNIPG NGNVKYNPEL AKQRWAQANA IKPWTGSFTI AYNSDGDHKQ WIDAVTNQIK NTLGIDAAGK PYATFKQIRD ELTKKTIKSA GRSGWQGDYP TQLNFLESNY LTGAGSNDGD YSNPAFDAKI AEAQQALDPV QSTRLVNEAQ AILLNDLPVV PLWDYKAAAG VGDGVKGDLT WNGRFDFTNI TKE
|
| |