Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2050 |
Symbol | |
ID | 9156205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2135065 |
End bp | 2136060 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_003647001 |
Protein GI | 296139758 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCC GACATCGTCT GGCCGCTCTC GGCATCGCAT TCGCCCTGAC GGGCGGGACC CTCGCGGGCT GCGGCGGTCA GAGCACCGAG TCGACCACGG CGAGCGTCAC CGGCGCCGCC GGACAGCAGC TCAATCTCAC CGCGAATCAG GACCGCGTCA TCCCGGTGAA GGACGAGGCC GCGGCGGCCC TGCTCCCCCA GCCAGTCCGC GACCGCGGCA CCCTGCGAGT GGGGCACGGC TCAGGAGCGG GCAACGCTCC CTTCTACTTC CCGGCCACCG ACGACGAGCG CGTCTTCATC GGCGACGAGA GCGACATAGC CCACCTCATC GCAGGTGCAC TCGGGTTGAA ACTGGAGGAG TCGAACACTT CATGGGAGGG CCTGTTCCTC GGCATCGATA GCGGCCAATT CGACCTGGGG GTCTCGAACA TCACCGTGAC CGAGGCGCGC AAGGAGAAGT ACGACTTCGC CACCTACCGC AAGGACGACC TGGCAGTCGA GGTCCAGGAA TCCTCCTCGC TCACCTTCAC CGGGCCGGAT TCGCTGTCGG GTAAGACCGT CGCGGTGGGC TCGGGCACGA ATCAGGAGCG CATCCTGCAA CGCTACGACG CCGATCTCCG CGCGGCCGGC AAGCCTGCGA TCACGATCAA GAACTACCAG GACAACACCG GCACGTACGC CGCACTGGCG TCGGGCCAGA TCGACGCCTA CTTCGGGCCG AACCCGATCG CTGCTTTCCA TGTCGCACAG ACCAAGGGAA CGCCGCAGGC CACGAAGATC GTCGGTCGAG TCTCGGGTGC CGGATCCTCA CTGCAGGGAC TGATCGGTGC CACAACGAAG AAGGACAGCG GCATCATCCG CGCCGTGCAA GCGGCGCTCC AGTCCACGAT CCGCAGTGGC GACTACACGA AGGTCTTGCA CCGGTGGGGG CTCGATGCCG AGGCGGTACC CTCCTCCGAG ATCAATCCAC CGGGACTGCC GAAGAACGAC AAATAG
|
Protein sequence | MTTRHRLAAL GIAFALTGGT LAGCGGQSTE STTASVTGAA GQQLNLTANQ DRVIPVKDEA AAALLPQPVR DRGTLRVGHG SGAGNAPFYF PATDDERVFI GDESDIAHLI AGALGLKLEE SNTSWEGLFL GIDSGQFDLG VSNITVTEAR KEKYDFATYR KDDLAVEVQE SSSLTFTGPD SLSGKTVAVG SGTNQERILQ RYDADLRAAG KPAITIKNYQ DNTGTYAALA SGQIDAYFGP NPIAAFHVAQ TKGTPQATKI VGRVSGAGSS LQGLIGATTK KDSGIIRAVQ AALQSTIRSG DYTKVLHRWG LDAEAVPSSE INPPGLPKND K
|
| |