Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1639 |
Symbol | |
ID | 9155789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1712084 |
End bp | 1713475 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003646599 |
Protein GI | 296139356 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.102178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGGA CGATGAACAA GACGTTTTCT CGATTCGTGA CAGCGGTCGC CGCGGTATCC GTGGTGACCT CCACTGCAGG GTGCGGTTCC GGTGGCACGG CGCCCTCGAA CGTGATCAAT TACTGGCTGT GGGATAACAA TCAACAACCC GTCTATCAAA AGTGCGCTGA CGCTTTCGAA GCGGCGCACC CGGGCCGGAA AGTCGCGATC ACCCAGTACG GCTGGAGCAG CTACTTCACG AAGCTCGCAT CGGGCTTCAT CGCCGACACC GGACCGGACG TCTTCACCGA TCACGTGTCC AAGCTCGGAC AGCACCTCGA CCTCGAGGTG CTCCAGCCCC TGGACGAACT CGCCGCCACC AAGGGGATCA AGGACGAGGA CTACCTTCCT CAGCTCGCCG CGCTGTGGAA GGGGCCCGAC GGTCGGCGGT ACGGCGTGCC CAAGGACTGG GACACCGTCG CGTACTTCTA CAACAAGGAC GCCACCGCCG CCGCAGGCGT GACCGACGCC GAGCTCCAGA GCATGACCTG GAACCCCGAC GACGGAGGCA CACTCGAGAA GGTCCTCGCC CGGCTCACCG TCGATGAGAA GGGTGTACGG GGTGACCAGC CCGGCTTCGA CAAGACGCGA GTGAAATCGT ACGGTCTCGC GGGGACGGAT TCGGGCTACG GCGGATTCGG GCAATCGCAG TGGTCGCCGT ACACCGGTTC GATCGGGTGG AACTTCACCG ACAAGAATCC CTGGGGCGCG AGGTTCAACT TCGATGACCC CAAGGTCCAG AAGACGATCG ACTGGTACTT CGGGTTGGCG AAGAAGGGGT TCATGGCCCC CTTCGCGGTG GCCGGTAACA ACACGTCCGG GATCGGTGCC GACAAGCAGA TGAGCGCGGG CAACGCCGCC ATGGCGCTCG CCGGATCCTT CATGATCTCG TCCTACTTCA AGCTCGTCGA TCCGCAGGGC AAGCCCCTCC CGATCGGTTT GGCGCCCACG CCCGTCGGGC CGTCCGGGAA ACGGGCGTCG ATGTTCAACG GACTCGCGGA CGTCGTCTCG AAGCAGTCGA AGAATCCGGA GCTTGCGGGG GAGTGGGTGG CGTTCTTGGG AAGTGACGCG TGCCAGGACA TCGTCGGGGA CTCCGGTGCG GTTTTCCCCG CCCGGCCCAA CGGTATGACC ATCGCGAAGC AGCGCCAGGC GGCGGCCGGT GTGGACATCA CCCCGTTCAC GATGCATGTG GACGACGGCA CGACATTCAC GATTCCGGTG ACCACCGATG CCGCCGACAT CGTCCCGCTC ATGCAGTCTG CGTTCGACCC GATCTATCTG GGATCGGCGT CGGGATCGTC GCTGTCCACG CTGAATCGGC AGATGAACAG GCTGCTCGAG AGCAACAGCT GA
|
Protein sequence | MSRTMNKTFS RFVTAVAAVS VVTSTAGCGS GGTAPSNVIN YWLWDNNQQP VYQKCADAFE AAHPGRKVAI TQYGWSSYFT KLASGFIADT GPDVFTDHVS KLGQHLDLEV LQPLDELAAT KGIKDEDYLP QLAALWKGPD GRRYGVPKDW DTVAYFYNKD ATAAAGVTDA ELQSMTWNPD DGGTLEKVLA RLTVDEKGVR GDQPGFDKTR VKSYGLAGTD SGYGGFGQSQ WSPYTGSIGW NFTDKNPWGA RFNFDDPKVQ KTIDWYFGLA KKGFMAPFAV AGNNTSGIGA DKQMSAGNAA MALAGSFMIS SYFKLVDPQG KPLPIGLAPT PVGPSGKRAS MFNGLADVVS KQSKNPELAG EWVAFLGSDA CQDIVGDSGA VFPARPNGMT IAKQRQAAAG VDITPFTMHV DDGTTFTIPV TTDAADIVPL MQSAFDPIYL GSASGSSLST LNRQMNRLLE SNS
|
| |