Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3539 |
Symbol | |
ID | 8604890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 4060432 |
End bp | 4061766 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | cellulose-binding family II |
Protein accession | YP_003301112 |
Protein GI | 269127742 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0054545 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGGAGT TGAGCACCGA CGAACCGGGC TACGTCCCGC CCGACCACGA GACGACCGTG GAGATCCCGC TGCCCGGCAA GCCGCCGGCC GGACATCCCG GCCAGGCCGA GGACACCGTG GCCGACCACG GGCTCCCCGA TGGCCGGGAC GAGGCGGAGC AGACCCTCGC AGACCCGCCC GGCCCGCGGG CCGGCGGCGC CGCGCCGGAT GCGACGGTGC GGGATCCGCA GGACCCCTGG GCGCCTGCGC GCCATGACCT CCCAGGCGAT TCCGGGGAAC GCAATGCGGA GGCCGGACAG GGAGAGTGGA CCGAGCTGTT CGGCAGCGAG GACGCCCGCC GGGAGGCGGC TGCGCCCCAG CCCTCGGAAC GGCCCGCGGA ACCGGCCGCC ACGGCCTCAC TCAGCCCGCT CGCCGCGGCG GCCGTGCCTC CCGGTGCCAC CGTGCCCGAC CGCGCGGACG CCTCCCGCCC GGAGCCGCAG CCGGATGCAC CGGATTCCAC CGGTGCCCGG CCGTTGCCGC CGCCCATCGC CTCCCATGAA CAGGCTTCCC CCGCGCCGGA CGCGCAGGCC CCGGCGACCG ATACGGCCCC CGCCGTGCTT CCGCTTCCCG ACCGCCCCGT CGCCTCGGCG AGGCCGGTGA ACGCCCCCGG CGCGCAGCGG CCGCCCGCGG GCGGCCGGCG CGGCTCACGG GCACCGCTGG CCGTGGCGGC GGCCCTGGTG CTGCTCTTCG CCGTGGTCGC CGGGGTGTCG GCGCTCACCC TGATGCGCGG CGGGAAGGAC GGCGAGGCGA CCACCGCCAA GCCCCCGGCC GGCGGCGCCT CCAGCGCGCC CGGTGAGGAC GGGTCCGGAG GGACCGGCGC GCCGGCGGGA GAGGCGCCAC CGGGAGCGGG CGGCTCCGGC GTCCCCGCAC CGGCGCAGGA CGCCCCGGCG CCCGGGGCGT CACCGCAGGA CGGCGCGCCG GCCCCCGGGC GGGCGCCGGC CGACCCCACG CCGCCGCCTC GCGACCCCAT CGGCCCGGTG CTGCGCGGCA AAGGGCTGAC CTACCAGCTC GTCCAGCACG ATCCCGGCTA CTACGAGGGA CTGCTGATCA TCACCAACCA CGGTGCCGAG CCCATGCGGG AGTGGACGAT CACCTTCGAG ACGCCCGGCG CCGACGTCAA GCACGTCTGG GGCGGTGAGC TGGTGCGCGG CGGCGACCGC GTGCAGATCC GCAGCCTGGA CGGCGCTCCG CAGATCCCGC CGGGCGGCAC CTGGGAGGTC CGCTTCGGCG CCGCCGGCAG CCCGGTCGAG CCGAGGAAAT GCCGCTTCAA CGACCGCGAG TGCGGCCTGG AGTGA
|
Protein sequence | MTELSTDEPG YVPPDHETTV EIPLPGKPPA GHPGQAEDTV ADHGLPDGRD EAEQTLADPP GPRAGGAAPD ATVRDPQDPW APARHDLPGD SGERNAEAGQ GEWTELFGSE DARREAAAPQ PSERPAEPAA TASLSPLAAA AVPPGATVPD RADASRPEPQ PDAPDSTGAR PLPPPIASHE QASPAPDAQA PATDTAPAVL PLPDRPVASA RPVNAPGAQR PPAGGRRGSR APLAVAAALV LLFAVVAGVS ALTLMRGGKD GEATTAKPPA GGASSAPGED GSGGTGAPAG EAPPGAGGSG VPAPAQDAPA PGASPQDGAP APGRAPADPT PPPRDPIGPV LRGKGLTYQL VQHDPGYYEG LLIITNHGAE PMREWTITFE TPGADVKHVW GGELVRGGDR VQIRSLDGAP QIPPGGTWEV RFGAAGSPVE PRKCRFNDRE CGLE
|
| |