Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3897 |
Symbol | yhjU |
ID | 6485778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3780973 |
End bp | 3782652 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642739161 |
Product | cellulose synthase operon protein YhjU |
Protein accession | YP_002042872 |
Protein GI | 194446406 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03368] cellulose synthase operon protein YhjU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAGC ATACTCAAAC TCCTTCAATG CCTTCTCCGC TCTGGCAATA CTGGCGCGGT CTTTCCGGCT GGAACTTCTA TTTTCTGGTC AAGTTTGGCC TGCTGTGGGC AGGCTATCTG AATTTTCATC CTTTACTGAA TCTGGTATTC ATGGCGTTTC TGCTCATGCC AATACCAAAG TATCGCCTTC ACCGGTTGCG CCACTGGATT GCCATTCCCG TCGGCTTCGC GCTGTTCTGG CATGATACCT GGCTGCCCGG CCCGCAAAGC ATTATGAGCC AGGGGACGCA GGTGGCGGAA TTCAGCTCCG GTTATCTGCT CGATCTGATC GCCCGTTTTA TTAACTGGCA AATGATCGGC GCCATCTTCG TACTGCTGGT TGCCTGGCTT TTTTTATCAC AGTGGATTCG GGTCACGGTG TTTGTGGTCG CCATCATGGT ATGGCTGAAT GTCCTGACAT TAACCGGCCC GGTTTTTACG CTGTGGCCGG CAGGCCAGCC AACCGATACG GTGACGACGA CTGGCGGTAA TGCGGCCGCT ACCGTCGCGA CAGCGGGCGA TAAGCCGGTC ATCGGCGATA TGCCTGCGCA AACCGCGCCG CCGACGACCG CGAATCTGAA CGCCTGGTTG AACACCTTCT ATGCCGCGGA AGAAAAGCGG AAAACGACGT TCCCGGCGCA GCTTCCGCCT GATGCGCAGC CGTTCGACCT ATTGGTCATC AATATTTGTT CGCTCTCCTG GTCGGATGTC GAAGCGGCAG GCTTGATGTC ACATCCGCTA TGGTCGCACT TTGACATTTT GTTTAAACAC TTTAATTCCG GTACGTCTTA CAGCGGCCCG GCGGCCATTC GTCTGCTGCG CGCCAGCTGT GGTCAACCAT CGCATACCCG ACTTTATCAA CCAGCCGATA ACGAATGTTA TCTGTTTGAT AATCTGGCGA AGCTGGGCTT TACTCAGCAT CTGATGATGG ATCATAACGG TGAATTTGGC GGCTTCCTGA AAGAAGTTCG CGAAAACGGC GGTATGCAGA GCGAACTGAT GAACCAGTCC GGCCTGCCAA CCGCCCTGCT GTCATTCGAC GGCTCGCCGG TATATGACGA TCTGGCGGTC CTGAACCGCT GGTTGACAGG GGAAGAACGT GAAGCCAATT CCCGCTCCGC GACTTTCTTT AACCTGCTGC CGCTGCACGA TGGCAACCAC TTCCCCGGCG TCAGCAAAAC GGCGGATTAT AAAATCCGCG CGCAGAAACT GTTCGATGAA CTGGACGCCT TCTTCACCGA ACTGGAGAAA TCCGGGCGTA AGGTGATGGT GGTCGTCGTA CCGGAGCACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGA TCTCAGGCCT GCGCGATATT CCCAGCCCCT CCATCACCAA CGTCCCGGCG GGCGTGAAAT TTTTTGGCAT GAAAGCCCCG CATGAGGGCG CGCCGATTGA TATTAACCAG CCGAGCAGCT ACCTGGCAAT TTCCGAACTG GTCGTACGCG CCGTGGACGG TAAGCTCTTT ACCGAAGACA GTGTGAACTG GAACAAGCTG ACCAGCAATC TGCCGCAAAC CGCGCCGGTT TCAGAAAACG CTAATGCGGT GGTGATTCAG TATCAGGGTA AGCCCTACGT TCGTCTGAAT GGCGGCGACT GGGTGCCTTA CCCGCAGTAA
|
Protein sequence | MTQHTQTPSM PSPLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF MAFLLMPIPK YRLHRLRHWI AIPVGFALFW HDTWLPGPQS IMSQGTQVAE FSSGYLLDLI ARFINWQMIG AIFVLLVAWL FLSQWIRVTV FVVAIMVWLN VLTLTGPVFT LWPAGQPTDT VTTTGGNAAA TVATAGDKPV IGDMPAQTAP PTTANLNAWL NTFYAAEEKR KTTFPAQLPP DAQPFDLLVI NICSLSWSDV EAAGLMSHPL WSHFDILFKH FNSGTSYSGP AAIRLLRASC GQPSHTRLYQ PADNECYLFD NLAKLGFTQH LMMDHNGEFG GFLKEVRENG GMQSELMNQS GLPTALLSFD GSPVYDDLAV LNRWLTGEER EANSRSATFF NLLPLHDGNH FPGVSKTADY KIRAQKLFDE LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQISGLRDI PSPSITNVPA GVKFFGMKAP HEGAPIDINQ PSSYLAISEL VVRAVDGKLF TEDSVNWNKL TSNLPQTAPV SENANAVVIQ YQGKPYVRLN GGDWVPYPQ
|
| |