Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4973 |
Symbol | |
ID | 8606339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 5626566 |
End bp | 5628239 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003302525 |
Protein GI | 269129155 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGACG GCTCTTGGAC GGGGTCACGG ATGCGGGCGG CCGGAGGCTG CGCGGCGGCG CTCGTGCTGG CCGGTGCGCT GACCGGGTGC GGCGCCGCGA TGGACGAGGG GTACGGGCCG GGAAAGCCGG GCGGCGTGCT CCGCATCGTC GGTGACGCCG ACGTCGAGCG CCTCGACCCC GCCGCCGCGA GCGAGCCTCC CGCTTACGCG CTGAACCGCG TCCTGGCGCG GACGCTTTTC ACCACCAGGG CGTCCAACAA CTTCGAAGAG AGCCTGCCGG TCCAGGCCGA CATGGCCGAG GCGGTCCCCT CCAAGGAGAA CGGCGGGATC GGCAAGGGCG GCAGGAAGTA CACCGTCCGG CTGCGCGGGA ACGTCCACTG GGACACCAGC CCGCCCCGGC CGGTCGTGGC CGGGGATTTC GTGCGCGGTT TCAAGCGGAT GTGCAACCCG GCGGCGCCGT CGCCCCACCG CGCGCACTAC ATCGCCACGA TCAAGGGGAT GAAGGCGTTC TGCGACGGCT ACGCCAAGGT CGACGCCGAC GACGCCGAGG CGATGGCCGC CTACCAGAAC GGGCACTCGA TCTCGGGACT GCGGGCCGCC GACGAGACCA CGCTGGTGTT CGAGCTGACC CACCCGGCCG CCGACTTCCT CAACATCCTC ACCCTGGCTC CGGCGGTGGC GGCGCCCAAG GAGTACGACC GGTACGTGCC CGACAGCCGC GAGTTCCGGC AGAACATCGT CTCCAACGGC CCCTACCGGA TCGCCTCTTA TGAGCCGGGC CGCTCCTACG TCCTGGAGCA CAACCCCGCC TGGCGGGCCG AGACCGACCC GATGCGGGAA CGGTTCGCCG ACCGCATCCA GATCACCCTC GGCGTGGGCT CCGCGCAGGA GGTGCGGCGC CGCATCGAGC GGGGCGAGGC CGACCTGTCG TGGGACCGGC CGGTCCCGGC CGCCGACATC GAGGAGCTGA GCGGCACGGC GGGCTTTGCG CTCCGCCAGC TCCCCGGCCG CGGCCCGTAC CTGGTGGCGG CCGGCGTGGC GGACCACCGG GTGCGGCGGG CCCTGCAGTA CACGGTGAAC CGGACGGCCG TGATCGAGGC TCTGGGCGGG CACGAGGCGG CGCGCCCGCA GCACACCCTG ATCGCCCCGG GCAACGCCGG GTACTTCGAA CACAACGCCT ACCCCACGCC CGACGACGCC GGAGACCCCG GCAAGTGCCG CGAGCTGCTC GCCGAGGCCG GCCACCGCGG CGGGCTGCGG CTGACGCTCG ACCCCGGGGA CGAGCCGGAG AAGGTCGTCC GGGCCGTCCG AAAGAGCCTG TCGGACTGTG ACATCGAAGT CACGATCCGT AAGAGCGGGG CGGAGCTGAA ACTGGTCTCC CCCGACCCCG AGTGGTTCGG CCTCAACGGC CGCTCGGCCC TGGCGCCGCT GCTCGACGGG CACGGCGACC CCCAGATCCG GCAGCTCCTC CAGGAGGCCC TGCGAACCCC CGACACCGCC CGCGCCACCG GTCTGTGGAA CCAGCTCGAC CGCCTGGTCA TGCAGGACGC CTCGATCGTG CCGCTGGCCG ACCGGGCCTT CCCCATCCAG CACTCCGCCC GGGTGCGCGG CACCCGGTTC CTGCCGCAGG CCAGGACCTA CGACTACAGC CGCATCTGGC TCACCGAGGA GTGA
|
Protein sequence | MADGSWTGSR MRAAGGCAAA LVLAGALTGC GAAMDEGYGP GKPGGVLRIV GDADVERLDP AAASEPPAYA LNRVLARTLF TTRASNNFEE SLPVQADMAE AVPSKENGGI GKGGRKYTVR LRGNVHWDTS PPRPVVAGDF VRGFKRMCNP AAPSPHRAHY IATIKGMKAF CDGYAKVDAD DAEAMAAYQN GHSISGLRAA DETTLVFELT HPAADFLNIL TLAPAVAAPK EYDRYVPDSR EFRQNIVSNG PYRIASYEPG RSYVLEHNPA WRAETDPMRE RFADRIQITL GVGSAQEVRR RIERGEADLS WDRPVPAADI EELSGTAGFA LRQLPGRGPY LVAAGVADHR VRRALQYTVN RTAVIEALGG HEAARPQHTL IAPGNAGYFE HNAYPTPDDA GDPGKCRELL AEAGHRGGLR LTLDPGDEPE KVVRAVRKSL SDCDIEVTIR KSGAELKLVS PDPEWFGLNG RSALAPLLDG HGDPQIRQLL QEALRTPDTA RATGLWNQLD RLVMQDASIV PLADRAFPIQ HSARVRGTRF LPQARTYDYS RIWLTEE
|
| |