Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4716 |
Symbol | |
ID | 8606078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 5342238 |
End bp | 5343839 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003302275 |
Protein GI | 269128905 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGCA AGAGGTGGCT GCGCGGCGCC GCCGGGGCGC TCGCGGTCGT GGCGCTGACG GCGGCATGCA CCTCGGGCGG AAGCGGCGAC GGCCAGAGCA AGGCCCAGCC CCAGGCCACC TACAAGGTCG GCATGATCGG CACCGACTCC GGCGGCACCC CGGTCAAGGG CGGGACGCTG ACGTTCAGCA CCTACACCGA GGCCGGCCTG CTCGACCCGG CGCAGACCAT CGTCGCCGGC TCCACCGGCG GCATCGAGAT GGCCGCGGTC TACGACGTGC TGCTGCGCTG GGACAGCGCC TCCGGCGAGG TCGTCCCGCA ACTGGCCAAG GACATGAAGG CCAGTGACGA CAACAAGACC TGGACGCTCA CCCTGCGCGA GGGCGTGAAG TTCAGCGACG GCAAGCCGCT GGACGCCGAG GCCGTCAAGT GGAGCATCGA GCGTTACGTC GACAAGGGCG GCGACGACGC GATGCTGTGG AAGCGCAACG TCGAGGCCGT CGAGACGCCC GACGACCTGA CCGTGGTCTT CAAGCTCAAG AAGGGCTGGG CCGACTTCGA CTTCATGCTC ACCGGCGGCC TCGGCATGAT CGTCGCCAAG TCCGCCGACG CAGGCAAGGA GTTCAAGCCG GTCGGCGCGG GCGCCTTCAC CTTCGGCAGC TACAAGCCGA AGGAGGAGAT GATCCTCAAC GCCCGCGAGG ACTACTGGGA CGGGCGGCCC AACCTCGACA AGATCCGCAT CACCTACATC GCCGACCCCA ACGCCGCGCT GGACACCCTC AAGTCCGGCC AGACCCAGGT CGGCTTCTTC CGTGACCCGC TGGTCGTCCA GAAGGCCCTG GACGCCGGCT ACCACGGCTT CATGAACATG GTGGCGCTCG GCAACGTCGC CGTCATCAAC GCCGCCAAGG GCCGGCCCGG CGCCGACGCG CGGGTGCGGC AGGCGATGTT CCACGCCATC AACCCGGACG TGATCTACCA GCGCGCCTAC TCGGGCGTCC TGCAGGGCAG CTCGGAGATC TTCCCGAGCT TCTCGCGCTG GCACACCGGC ACCAAGCCGC TGGCCTACGA CCCCGACAAG GCCAAGCAGC TGCTGGAGCA GGCCAAGGCC GACGGCTTCG ACGGCAAGAT CAAGTACCTG GACGCCCAGG ACCCGGCCTC CCAGGCCACC GCGTTGGCGA TCAAGGCGAT GCTGGAGAGC GTCGGCTTCC AGGTGGAGCT GGACCTGGTG CGCACCCCCA CCGACCAGAT CACCCGGGTG GCGGCCAAGC GCGACTACGA CATGTCGGGC TGGGGCCTGA GCTGGCGCGA GGCCGGCCCC TACGGCCGGA TGTTCACCAC CTTGCACAGC GAGGGCAACG CCACGGTCGG CATGGCGACC ACCCCCGAGA TGGACGCCCT GATCGAGGAG TTCCAGGGCG CCGCCACCCC TGAGGAGCAG CGCGCCGTCA TGGCACGCAT CCAGGAGCAG TGGAACAAGG ACGTCCCCGC CCTGGTGTTC GGCCCGATGC CGGAGTTCGT GGCCTGGCCG AAGACCGTGC ACGGCGTCGA AGGCACCGTC AACAGCATGA TCCTGCTGGA CGACGCCTGG ATCGGCCAGT AA
|
Protein sequence | MSSKRWLRGA AGALAVVALT AACTSGGSGD GQSKAQPQAT YKVGMIGTDS GGTPVKGGTL TFSTYTEAGL LDPAQTIVAG STGGIEMAAV YDVLLRWDSA SGEVVPQLAK DMKASDDNKT WTLTLREGVK FSDGKPLDAE AVKWSIERYV DKGGDDAMLW KRNVEAVETP DDLTVVFKLK KGWADFDFML TGGLGMIVAK SADAGKEFKP VGAGAFTFGS YKPKEEMILN AREDYWDGRP NLDKIRITYI ADPNAALDTL KSGQTQVGFF RDPLVVQKAL DAGYHGFMNM VALGNVAVIN AAKGRPGADA RVRQAMFHAI NPDVIYQRAY SGVLQGSSEI FPSFSRWHTG TKPLAYDPDK AKQLLEQAKA DGFDGKIKYL DAQDPASQAT ALAIKAMLES VGFQVELDLV RTPTDQITRV AAKRDYDMSG WGLSWREAGP YGRMFTTLHS EGNATVGMAT TPEMDALIEE FQGAATPEEQ RAVMARIQEQ WNKDVPALVF GPMPEFVAWP KTVHGVEGTV NSMILLDDAW IGQ
|
| |