Gene SNSL254_A3897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3897 
SymbolyhjU 
ID6485778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3780973 
End bp3782652 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content56% 
IMG OID642739161 
Productcellulose synthase operon protein YhjU 
Protein accessionYP_002042872 
Protein GI194446406 
COG category 
COG ID 
TIGRFAM ID[TIGR03368] cellulose synthase operon protein YhjU 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGC ATACTCAAAC TCCTTCAATG CCTTCTCCGC TCTGGCAATA CTGGCGCGGT 
CTTTCCGGCT GGAACTTCTA TTTTCTGGTC AAGTTTGGCC TGCTGTGGGC AGGCTATCTG
AATTTTCATC CTTTACTGAA TCTGGTATTC ATGGCGTTTC TGCTCATGCC AATACCAAAG
TATCGCCTTC ACCGGTTGCG CCACTGGATT GCCATTCCCG TCGGCTTCGC GCTGTTCTGG
CATGATACCT GGCTGCCCGG CCCGCAAAGC ATTATGAGCC AGGGGACGCA GGTGGCGGAA
TTCAGCTCCG GTTATCTGCT CGATCTGATC GCCCGTTTTA TTAACTGGCA AATGATCGGC
GCCATCTTCG TACTGCTGGT TGCCTGGCTT TTTTTATCAC AGTGGATTCG GGTCACGGTG
TTTGTGGTCG CCATCATGGT ATGGCTGAAT GTCCTGACAT TAACCGGCCC GGTTTTTACG
CTGTGGCCGG CAGGCCAGCC AACCGATACG GTGACGACGA CTGGCGGTAA TGCGGCCGCT
ACCGTCGCGA CAGCGGGCGA TAAGCCGGTC ATCGGCGATA TGCCTGCGCA AACCGCGCCG
CCGACGACCG CGAATCTGAA CGCCTGGTTG AACACCTTCT ATGCCGCGGA AGAAAAGCGG
AAAACGACGT TCCCGGCGCA GCTTCCGCCT GATGCGCAGC CGTTCGACCT ATTGGTCATC
AATATTTGTT CGCTCTCCTG GTCGGATGTC GAAGCGGCAG GCTTGATGTC ACATCCGCTA
TGGTCGCACT TTGACATTTT GTTTAAACAC TTTAATTCCG GTACGTCTTA CAGCGGCCCG
GCGGCCATTC GTCTGCTGCG CGCCAGCTGT GGTCAACCAT CGCATACCCG ACTTTATCAA
CCAGCCGATA ACGAATGTTA TCTGTTTGAT AATCTGGCGA AGCTGGGCTT TACTCAGCAT
CTGATGATGG ATCATAACGG TGAATTTGGC GGCTTCCTGA AAGAAGTTCG CGAAAACGGC
GGTATGCAGA GCGAACTGAT GAACCAGTCC GGCCTGCCAA CCGCCCTGCT GTCATTCGAC
GGCTCGCCGG TATATGACGA TCTGGCGGTC CTGAACCGCT GGTTGACAGG GGAAGAACGT
GAAGCCAATT CCCGCTCCGC GACTTTCTTT AACCTGCTGC CGCTGCACGA TGGCAACCAC
TTCCCCGGCG TCAGCAAAAC GGCGGATTAT AAAATCCGCG CGCAGAAACT GTTCGATGAA
CTGGACGCCT TCTTCACCGA ACTGGAGAAA TCCGGGCGTA AGGTGATGGT GGTCGTCGTA
CCGGAGCACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGA TCTCAGGCCT GCGCGATATT
CCCAGCCCCT CCATCACCAA CGTCCCGGCG GGCGTGAAAT TTTTTGGCAT GAAAGCCCCG
CATGAGGGCG CGCCGATTGA TATTAACCAG CCGAGCAGCT ACCTGGCAAT TTCCGAACTG
GTCGTACGCG CCGTGGACGG TAAGCTCTTT ACCGAAGACA GTGTGAACTG GAACAAGCTG
ACCAGCAATC TGCCGCAAAC CGCGCCGGTT TCAGAAAACG CTAATGCGGT GGTGATTCAG
TATCAGGGTA AGCCCTACGT TCGTCTGAAT GGCGGCGACT GGGTGCCTTA CCCGCAGTAA
 
Protein sequence
MTQHTQTPSM PSPLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF MAFLLMPIPK 
YRLHRLRHWI AIPVGFALFW HDTWLPGPQS IMSQGTQVAE FSSGYLLDLI ARFINWQMIG
AIFVLLVAWL FLSQWIRVTV FVVAIMVWLN VLTLTGPVFT LWPAGQPTDT VTTTGGNAAA
TVATAGDKPV IGDMPAQTAP PTTANLNAWL NTFYAAEEKR KTTFPAQLPP DAQPFDLLVI
NICSLSWSDV EAAGLMSHPL WSHFDILFKH FNSGTSYSGP AAIRLLRASC GQPSHTRLYQ
PADNECYLFD NLAKLGFTQH LMMDHNGEFG GFLKEVRENG GMQSELMNQS GLPTALLSFD
GSPVYDDLAV LNRWLTGEER EANSRSATFF NLLPLHDGNH FPGVSKTADY KIRAQKLFDE
LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQISGLRDI PSPSITNVPA GVKFFGMKAP
HEGAPIDINQ PSSYLAISEL VVRAVDGKLF TEDSVNWNKL TSNLPQTAPV SENANAVVIQ
YQGKPYVRLN GGDWVPYPQ