Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3377 |
Symbol | |
ID | 8604723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3894483 |
End bp | 3895727 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003300953 |
Protein GI | 269127583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0107966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGT CTGGTCCCGT GGACCGAGTG TCGCGGGCGC TGCGCGGGCT GAGCGGCCGG CACCGGCTCG CCGCGGCCCT GGCGGTCCTC CTGGTGGTGG CGCTGATCGT GGTCGCGGCC GGACGAGACG GCCGGAACGC GCCCCCCGGC ACGGCGCGGG CCGCGCGCAC GCTCGGACCC GGTGAGGGCG CGCTCAACCT GGTGGTCTGG CCCGGTCTGG CGGAACGCGG CGGCAGCGCC GAGCACGTCG ACTGGGTCAC CCCCTTCGAA GAGCGCACCA GCTGCAAGGT CTCTCTCAAA CAGGTGTCCA CCGTGCAGGA GATGGTGGAC CTGATGTCCG ACCCCGACCG GCGCTATGAC GGCGTCTCGG CGCCCCCCGA GGTCGCCGGG CTGCTGATCG ACGGCGGCCA TGCGGCACCG GTGAACCCGA ACCTGGTCGA GGGCTACAAG CGCCTGGAGT CCCGGCTGCG CTCGCTGCTC AGGCGGGATA AGACGGTCTA CGGGGTGCCC TATGTGTGGG GCGCGAACCT GCTCATGTAC GACCAGCGGG CCGTGCAGCC GCCTCCCAGC GGCTGGGCGG ACCTGTTCGA CCCGCAGGAG GCCGGGCGGT ACAGCGGCAG GCTGATCATG CGGGACACGC CGCTGGCGCT GGCCGAGGCC GCGCTGTACC TGCGTTCGAA GAAGAAGTCG CTGGACATCA CCGACCCCTA TGCGCTGACC CCCGAGCAGC TCGACGCGGC GGCCCAGGTG GTGCGCCGCC AGCGTCCCCA CGTCAGGGCC TACTGGAGTG AACCGGCCGA TGCGGTCAGC GCGTTCGCCG CGGGCGAGGC GGTCATCGGC CAGGTCTCCT CCTACCAGCT GGACGTGCTG AGCAGGGCGG GGCGCCCGGT GGCCGGCATC GAGCCCCGGG AGGGCGTGAC CGGCTGGGTC AACTCCTGGC TGATCGGCGC CCGGGCCGAG CACCCCAACT GCATGTACCA GTGGCTGAAG TGGACGGTCT CCCCGGACGT GCAGCGGCAG GTCGCCCAGT GGGCCGGGGT GGCGCCGGCC AACCCGCAGG CGTGCGAGGG CGACCGGCTC AGCGCATCCT TCTGCGCCAC CTACCGGGTC GGCGACGGGG ACTTCCTGAA GAAGGTGATC TTCGCCCGGA CGCCCACCGA GGACTGCGGC GGCGAGGGGC ACGGGTGCAC CGACTACGCC GAGTGGGTCC GGGCCTGGCA GGAGTCCCGC GGCACGGCCC ACTGA
|
Protein sequence | MSQSGPVDRV SRALRGLSGR HRLAAALAVL LVVALIVVAA GRDGRNAPPG TARAARTLGP GEGALNLVVW PGLAERGGSA EHVDWVTPFE ERTSCKVSLK QVSTVQEMVD LMSDPDRRYD GVSAPPEVAG LLIDGGHAAP VNPNLVEGYK RLESRLRSLL RRDKTVYGVP YVWGANLLMY DQRAVQPPPS GWADLFDPQE AGRYSGRLIM RDTPLALAEA ALYLRSKKKS LDITDPYALT PEQLDAAAQV VRRQRPHVRA YWSEPADAVS AFAAGEAVIG QVSSYQLDVL SRAGRPVAGI EPREGVTGWV NSWLIGARAE HPNCMYQWLK WTVSPDVQRQ VAQWAGVAPA NPQACEGDRL SASFCATYRV GDGDFLKKVI FARTPTEDCG GEGHGCTDYA EWVRAWQESR GTAH
|
| |