Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3602 |
Symbol | |
ID | 8604953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 4142790 |
End bp | 4143881 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | polysaccharide biosynthesis protein CelD |
Protein accession | YP_003301174 |
Protein GI | 269127804 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.317396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTATAA CCGTCATCCG TCCCCGCGAG CTGGGAAGCG CCGAGCTGGC GGCCTGGCGT GCGATGCAGG CGGCCGGCCC GCGGCTGGCC AACCCGTTCA TGTCCCCCGA GTACGCCCAG GCGGTGGACC GGGTGCTGAA GGGACGAGCC CGGGTCGCGG TCTTAGAGGA CGGCCCCGAC CTGGTGGGCT TCTTCCCCTT CGAGCTGAAC GGCCCGGGGG TCGGCTCCGC CATCGGCGGC TGGCTGTCGC TGTGCCAGGG GCTGATCCAC GTTCCCGGCC TGATGCGGCT GGACGCCCGG GAGCTGCTGC GCGGCTGCGG GCTGGGGGTG TGGGAGTACG GGACGCTGGC GGCCGGGCAG CCGTGGTTCG AGCCCTACAC CACCAAGACC CTCGGCTCGG TGATCATGGA TCTGAGCGGC GGGTTCGAGG GCTACCTCAA AGGGCTGCGG GAGAGGGGCT CGAAGGTCGT CAAGCAGACC CGCTACAAGG AGCGCCGGAT GGGGCGCGAG GTCGGCGAGG TGCGCTTTGA GTTCGACGTG CGCGACGCCG GCGCGCTGCG CCTGGTGCGG CAGTGGAAGT CCGCCCAATA CCGGGCGATG GGCCGCGTCG ACCGTTTCTC CCGCCGCTGG GTGGTGGAGC TGGTCGAGCT GCTGCACGGG ATGCACGGGG AGGACTTCGC CGGGTCGCTG TCCATGCTGT ACGCCGGCGA CCGCCCGGTG GCCGGGCACT TCGGGCTGCG CAGCAGGCAC ACGCTGATCA CCTGGTTCCC GGTCTACGAC CCGGCCTATG CCAAGTACTC CCCCGGCCTG GCGCTGCACC TGCACATGGC CGAGGAGGCG GCCAAGCTGG GCATCCGGGA GATGGATCTG GGGCCGGGCG TCGGCTGGCG CTACAAGGAG GAGCTGAAAA GCCACGAGAC CCCGGTCGGC GAAGGGGTGG TGCGCCGCCC CTGCCTGAGC GCCGCCGCCC ACTGGGTCCG GCGCGCGCCG CTGGCCCGGG CGCGCCGGAT GATCCTGGAC AACGAGCGCC TGTACGGCAT GGCCGACCGC GCCATGCGCC GGTACGGGGC GTGGCGCACC CGGTCCCGGT GA
|
Protein sequence | MRITVIRPRE LGSAELAAWR AMQAAGPRLA NPFMSPEYAQ AVDRVLKGRA RVAVLEDGPD LVGFFPFELN GPGVGSAIGG WLSLCQGLIH VPGLMRLDAR ELLRGCGLGV WEYGTLAAGQ PWFEPYTTKT LGSVIMDLSG GFEGYLKGLR ERGSKVVKQT RYKERRMGRE VGEVRFEFDV RDAGALRLVR QWKSAQYRAM GRVDRFSRRW VVELVELLHG MHGEDFAGSL SMLYAGDRPV AGHFGLRSRH TLITWFPVYD PAYAKYSPGL ALHLHMAEEA AKLGIREMDL GPGVGWRYKE ELKSHETPVG EGVVRRPCLS AAAHWVRRAP LARARRMILD NERLYGMADR AMRRYGAWRT RSR
|
| |