Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2325 |
Symbol | |
ID | 8603662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 2716313 |
End bp | 2717647 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003299929 |
Protein GI | 269126559 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0032194 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGACG GAAGACGTTC GATGGGCCGG CGGCGGCTGC TGGGCGCCGT CCTGGCCGCC GGGCTGGCGG TCTCCACCGC CGCGGCCTGC GGCGGCGGAG ACGACTCCGG CTCCGGCGAC GGCGCCCTGA AGGGCCAGAA ACTCACCGTG GCGGCGATCT GGACCGGCTC CAACGAGGGC GCCAAGTTCC AGAAGGTCCT CGACGAGTTC GAAAAGCGCA CCGGCGCCGA GGTCACCTAC ACCCCCACCG GCGACAACGT CGCCGCCTAC CTCGGCTCCA AGGTGGAGGG CAACGCGCCG CCGGACGTGG CGTTCCTGCC GCAGCAGGGC GTGCTGGTGG AGTTCGCCGA AAAGGGCTGG ATCAAGCCGC TGCCGGATGA GGCCAAGTCC CTGGTGCAGC AGAACTTCGC CAAGGTCTGG CAGGACCTCG GCTCCCACGA GGGCACCGTC TACGGCGTCT ACTTCAAGGC CTCCAGCAAG TCCACCGTCT GGTACCGCAA GCAGGCCTTC GCCGACGCCG GCATCACCAC GCCCCCCGCC TCCTGGGACG ACTTCGTCAA GACCGCCCAG ACCCTCTCCG ACGCCGGCAC CACCCCCATC TCCGTCGGCG CCGCCGACGG CTGGGTGCTG ACCGACTGGT TCGAGAACGT GTACCTGTCG GTGGCCGGGC CGGAGAAGTA CGACCAGCTC TCCAAGCACG AGATCAAGTA CACCGACCCC ACGGTCAAGG AGGCGCTGCG CAAGCTGGCC GAGATCTGGG GCAAGGACTC CTACCTGGCA GGCGGCGCCA AGGGCGCGCT GCAGACCGAG TTCCCCGCCT CGGTGCCGGC GGTGTTCGGC GATGAGCCCA AGGCCGCCAT GCTGCCCGGC GCCGACTTCG TCGCCTCCGA GATCACCTCG GCCACCAACT CCAAGGTCGG CACCGACGCC GACTTCTTCC CCTTCCCCGC CGCCGGCAGC ACCACTCCCG TGGTGGGCGC CGGTGACGTG GCGGTGATGA TGAAGGACAC CCCCGCGGCG CGGGAGCTGA TGAAGTTCCT GGCCACCCCC GAGGCCGCCA AGGTCTGGGC CGAGCAGGGC GGGTTCATCT CCCCCAACAA GGCGCTGGAC CTGTCGGTCT ACCCCGACGA GGTGCAGCGG CGGATCGCCA AGGCGGTGAT CGACGCCGGT GACGCCTTCC GCTTCGACAT GTCCGACCTG GCCCCGGCCG CCTTCGGCGG CACCAAGGGC TCCGGCGAGT GGAAGATCCT GCAGGACTTC CTGGCCGACC CCGACGATGT GGACGGCACC GCCGAGAAGC TGGAGAAGGC CGCAGCCGCC GCCTTCAAGT CCTGA
|
Protein sequence | MRDGRRSMGR RRLLGAVLAA GLAVSTAAAC GGGDDSGSGD GALKGQKLTV AAIWTGSNEG AKFQKVLDEF EKRTGAEVTY TPTGDNVAAY LGSKVEGNAP PDVAFLPQQG VLVEFAEKGW IKPLPDEAKS LVQQNFAKVW QDLGSHEGTV YGVYFKASSK STVWYRKQAF ADAGITTPPA SWDDFVKTAQ TLSDAGTTPI SVGAADGWVL TDWFENVYLS VAGPEKYDQL SKHEIKYTDP TVKEALRKLA EIWGKDSYLA GGAKGALQTE FPASVPAVFG DEPKAAMLPG ADFVASEITS ATNSKVGTDA DFFPFPAAGS TTPVVGAGDV AVMMKDTPAA RELMKFLATP EAAKVWAEQG GFISPNKALD LSVYPDEVQR RIAKAVIDAG DAFRFDMSDL APAAFGGTKG SGEWKILQDF LADPDDVDGT AEKLEKAAAA AFKS
|
| |