Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2296 |
Symbol | |
ID | 8603633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 2679992 |
End bp | 2681746 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003299900 |
Protein GI | 269126530 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000797709 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACTC CCAAAGTCTG GGCAGCGCTG ACGGCGGGCG CCGTCGCGCT CAGTCTGGGC CTGACCGCCT GCGGCGGCGG CGATGACGGC GACAGCGACA GCTCGGGCCT TAAGTTCGAC GCCGGGACCA ACGGCATCGT CAACCCGTCC GACAAAAAGG GCGGCACGCT GAAGGTGGGG ATGAGCGACG ACTTCGACTC CACCGACCCC GGCGACACCT ACTACGCCTT CGGCAACAAC TTCATCCGGC TGTACACCCG GACGCTGATG ACCTACACCT CCCAGCCGGG CGCTGCGGGC CTCAAGCCCT CGCCCGACCT GGCCGAGGCC CCGGGTGTGC CCAGCGACGA CAACAAGACC TGGACCTTCA AGCTCAAGAA GGGCCTGAAG TACGAAGACG GCACGGAGAT CAAGGCCGAG CACATCAAGT ACGCGGTCGC CCGCACCTAC GACCGCGGCG TCCTCGGCCA CGGCCCGGCC TACTTCCCCC AGCTGCTGGA CGCCGACGGC TACAAGGGCC CCTACAAGGA CAAGAACCTC GACAACTTCA AGGGCATCGA GACCCCCGAC GACTACACCC TGATCTTCAA GCTCAAGGAG CCCTTCCCCG AGTTCAACGA GCTGGTGACC TTCTCCGGCC AGACCGCCCC CGTGCCGCCG GACAAGGACA AGGGCGCCCA GTACCGGCTG CGTCCGCTCT CCTCCGGCCC CTACAAGTGG GAGGGCAACT ACCAGCCGAA GAAGGGCGGC GTGCTGGTCC GCAACGAGCA CTGGGACCCC AGCACCGACC CCAACCGCAA GGCGCTGCCG GACCGCATCG AGGTCATCGC CGGCATTGAG GCCAACGAGG TCGACAACCG CCTGATGAAC GGCGAGCTCC ACGTCGACCT GGCCGGCAGC GGCGTGCAGG ACGCCGCCCG GCAGAAGATC CTCACCAACC CGGACCTGAA GGCCAAGGCC GACAACCCGC TGGCCGGCTT CCACTGGTAC ATCCCGATCA ACCTCAAGAC CATCCCCAAC CTGGAGTGCC GCAAGGCGAT CGTGTACGCC GCCGACCGGG ACGCCATGTG GCGCGCCTAC GGCGGTGACG TCGGCGGCGA GCGGGCCACC TCCATCCAGC CGCCGAACAT CGCCGGCCGC CAGAAGGGCA CCGACTTCTA CACCTCCACC GCCCCCGGCT ACAAGGGCGA TGTGGACAAG GCCAAGGAGG CCCTGCAGAA GTGCGGCAAG CCCGACGGCT TCTCGACCAC CATGGTCTAC CGCAGCGACC GGCCCAAGGA GAAGGCCGTC GCCGAGGCCC TGGAGCAGTC GCTGGCCCGG GTCGGCATCA AGCTGACCCT CAAGGGCTAC CCGGCCGGCA CCTACACCGG TGAGCAGCTC GGCTCCCCGT CCTTCGTCAA GAAGGAGAAC ATCGGCCTGG GCACCTACGG CTGGGCGCCC GACTGGCCCA CCGGCTACGG CTACCTGCAG GCGCTCACCG ACGGCAAGGC GATCGTCGAG GCCGGCAACA CCAACGTCTC CGAGCTGGAC GACCCTGAGA TCAACAAGCT CTGGAACGAC GTGGTGAAGA TCACCGACGC CGCCGAGCGC GAGAAGATCT ACAACCGGAT CGACGAGAAG GCGCGCGAGC TGGCCGCCAT CCTGCCCAAC GTCTACGCCA AGTCCCTGCT GTACCGGCCG GAGACGCTGA CCAACGTCTA CTTCCACCAG GGCTTCGGCA TGTACGACTA CGCCAACCTC GGTGTGACCG GCTGA
|
Protein sequence | MKTPKVWAAL TAGAVALSLG LTACGGGDDG DSDSSGLKFD AGTNGIVNPS DKKGGTLKVG MSDDFDSTDP GDTYYAFGNN FIRLYTRTLM TYTSQPGAAG LKPSPDLAEA PGVPSDDNKT WTFKLKKGLK YEDGTEIKAE HIKYAVARTY DRGVLGHGPA YFPQLLDADG YKGPYKDKNL DNFKGIETPD DYTLIFKLKE PFPEFNELVT FSGQTAPVPP DKDKGAQYRL RPLSSGPYKW EGNYQPKKGG VLVRNEHWDP STDPNRKALP DRIEVIAGIE ANEVDNRLMN GELHVDLAGS GVQDAARQKI LTNPDLKAKA DNPLAGFHWY IPINLKTIPN LECRKAIVYA ADRDAMWRAY GGDVGGERAT SIQPPNIAGR QKGTDFYTST APGYKGDVDK AKEALQKCGK PDGFSTTMVY RSDRPKEKAV AEALEQSLAR VGIKLTLKGY PAGTYTGEQL GSPSFVKKEN IGLGTYGWAP DWPTGYGYLQ ALTDGKAIVE AGNTNVSELD DPEINKLWND VVKITDAAER EKIYNRIDEK ARELAAILPN VYAKSLLYRP ETLTNVYFHQ GFGMYDYANL GVTG
|
| |