Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_0446 |
Symbol | |
ID | 8601743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 509387 |
End bp | 510445 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003298082 |
Protein GI | 269124712 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACCA AGTTGCTGGC CATCGCCTCG GTGACGGGTC TGGCCGTCGC ACTCAGCGGC TGCGGCGGCG GTGACTCGGG CGGGTCCGAC TCGACGCTCA CGGTGACCAT GTGGGGCGGC GCTGCACAGA AGGCCCACGT CGACTCCTAC TTCACCCCGT GGGCCGAGGC CAACGGCGTC ACCATCAAGC AGGACTCGCC GACCGACTAC GCCAAGATCA AGGCGCAGGT GGAGGCGGGC AAGGTCACCT GGGGCCTCAC CGAGGTGGAG CCGAACTTCG CCAACACCGC CTGCGAGTCG GGCCTGCTGG AGAAGCTCCC GCAGGACATC ATCGACAAGG CCAAGGCCTC GGGCGTCGCC GAAGAGCAGA TCGGCGAGTG CGCCATCCCG ATCCTGCAGT ACTCCTTCAC GATCGCCTAC AACACCAAGA CCTTCTCCGG CGATCACCCC AAGACCTGGG CCGAGTTCTT CGACACCCGG CGCTTCCCCG GCAAGCGGGG CTTTTGGAAG TACGCCACCG GCGGCATGTT CGAGGCCGCG CTGCTGGCCG ACGGCGTCAA GCCCGACGAG CTCTACCCGC TCGACATCGA CCGGGCCTTC AAGAAGCTGG AGACGATCAA AAAGGACATC GTCTTCTACG ACACCGGCGA CCAGATGACC CAGATGCTGG CCAGCGGCGA AGCCCCGCTG GTGCAGGCCT GGAGCGGCCG GATCTACCAG GCCCGGCAGG AAGGGGAGAA GGTCGCCAAC GAGTGGAACG AGAACCTGGT CTCCTACGAC CAGATCGCCG TTCCCAAGGG CTACCCGAAC AAGGAACTGG CCTTCCAGTG GATGCGCTGG TTCCTGGACA ACCCCAAGGC CCAGGCCGCC GATGCCGACG CCTCGATCTA CGGCCCGGCC AGTGAGAACG CCCTCCAGTA CGTCTCCGCC GACGTCCGCA AGGAACTGCC CGGCCACCCG GCCAACGCCG AGAAGGCCAT CGGCCTGGTG AACTACGACT ACTGGGCCGA GCACTACGAC AGCGTCACCG AGCGTCTGAA CGCCTGGATC GCCCAATGA
|
Protein sequence | MKTKLLAIAS VTGLAVALSG CGGGDSGGSD STLTVTMWGG AAQKAHVDSY FTPWAEANGV TIKQDSPTDY AKIKAQVEAG KVTWGLTEVE PNFANTACES GLLEKLPQDI IDKAKASGVA EEQIGECAIP ILQYSFTIAY NTKTFSGDHP KTWAEFFDTR RFPGKRGFWK YATGGMFEAA LLADGVKPDE LYPLDIDRAF KKLETIKKDI VFYDTGDQMT QMLASGEAPL VQAWSGRIYQ ARQEGEKVAN EWNENLVSYD QIAVPKGYPN KELAFQWMRW FLDNPKAQAA DADASIYGPA SENALQYVSA DVRKELPGHP ANAEKAIGLV NYDYWAEHYD SVTERLNAWI AQ
|
| |