Gene SeAg_B3833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B3833 
SymbolyhjU 
ID6797071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp3723688 
End bp3725367 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content55% 
IMG OID642777957 
Productcellulose synthase operon protein YhjU 
Protein accessionYP_002148553 
Protein GI197247918 
COG category 
COG ID 
TIGRFAM ID[TIGR03368] cellulose synthase operon protein YhjU 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAGC ATACTCAAAC TCCTTCAATG CCTTCTCCGC TCTGGCAGTA CTGGCGCGGT 
CTTTCCGGCT GGAACTTCTA TTTTCTGGTC AAGTTTGGCC TGCTGTGGGC AGGCTATCTG
AATTTTCATC CTTTACTGAA TTTGGTATTC ATGGCGTTTC TGCTCATGCC AATACCAAAG
TATCGCCTTC ACCGGTTGCG CCACTGGATT GCCATTCCCG TCGGCTTCGC GCTGTTCTGG
CATGATACCT GGCTGCCCGG CCCGCAAAGC ATTATGAGTC AGGGGACGCA GGTGGCGGAA
TTCAGCTCCG GTTATCTGCT CGATCTGATC GCCCGTTTTA TTAACTGGCA AATGATCGGC
GCCATCTTCG TACTGCTGGT TGCCTGGCTT TTTTTATCAC AGTGGATTCG GGTCACGGTG
TTTGTGGTCG CCATCATGGT ATGGCTGAAT GTCCTGACAT TAACCGGCCC GGTTTTTACG
CTGTGGCCGG CAGGCCAGCC AACCGATACG GTGACGACGA CTGGCGGTAA TGCGGCCGCT
ACCGTCGCGA CAGCGGGCGA TAAGCCGGTC ATCGGCGATA TGCCTGCGCA AACCGCGCCG
CCGACGACCG CGAATCTGAA CGCCTGGTTG AACACCTTCT ATGCCGCGGA AGAAAAGCGG
AAAACGACGT TCCCGGCGCA GCTTCCGCCT GATGCGCAGC CGTTCGACCT ATTGGTCATC
AATATTTGTT CGCTCTCCTG GTCGGATGTC GAAGCGGCAG GCTTGATGTC ACATCCGCTA
TGGTCGCACT TTGACATTTT GTTTAAACAC TTTAATTCCG GCACGTCTTA CAGCGGCCCG
GCGGCCATTC GTCTGCTGCG CGCCAGCTGT GGTCAACCAT CGCATACCCG ACTTTATCAA
CCAGCCGATA ACGAATGTTA TCTGTTTGAT AATCTGGCGA AACTGGGCTT TACTCAGCAT
CTGATGATGG ATCATAACGG TGAATTTGGC GGCTTCCTGA AAGAAGTTCG CGAAAACGGC
GGTATGCAGA GCGAACTGAT GAACCAGTCC GGCCTGCCAA CCGCCCTGCT GTCATTCGAC
GGCTCGCCGG TATATGACGA TTTGGCGGTC CTGAACCGCT GGTTGGCAGG GGAAGAACGT
GAAGCCAATT CCCGCTCCGC GACTTTCTTT AACCTGCTGC CGCTGCACGA TGGCAACCAT
TTCCCCGGCG TCAGCAAAAC GGCGGATTAT AAAATCCGCG CGCAGAAACT GTTCGATGAA
CTGGACGCCT TCTTTACCGA ACTGGAGAAA TCCGGGCGTA AGGTGATGGT GGTCGTCGTA
CCGGAGCACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGA TATCAGGCCT GCGCGATATT
CCCAGCCCCT CCATCACCAA CGTCCCGGCG GGCGTGAAAT TTTTTGGCAT GAAAGCCCCG
CATGAGGGCG CGCCGATTGA TATTAATCAG CCGAGCAGCT ACCTGGCGAT TTCCGAACTG
GTCGTACGCG CCGTGGACGG TAAGCTCTTT ACCGAAGACA GTGTGAACTG GAACAAGCTG
ACCAGCAATC TGCCGCAAAC CGCGCCGGTT TCAGAAAACG CTAATGCGGT GGTGATTCAG
TACCAGGGTA AGCCCTACGT TCGTCTGAAT GGCGGCGACT GGGTGCCTTA CCCGCAGTAA
 
Protein sequence
MTQHTQTPSM PSPLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF MAFLLMPIPK 
YRLHRLRHWI AIPVGFALFW HDTWLPGPQS IMSQGTQVAE FSSGYLLDLI ARFINWQMIG
AIFVLLVAWL FLSQWIRVTV FVVAIMVWLN VLTLTGPVFT LWPAGQPTDT VTTTGGNAAA
TVATAGDKPV IGDMPAQTAP PTTANLNAWL NTFYAAEEKR KTTFPAQLPP DAQPFDLLVI
NICSLSWSDV EAAGLMSHPL WSHFDILFKH FNSGTSYSGP AAIRLLRASC GQPSHTRLYQ
PADNECYLFD NLAKLGFTQH LMMDHNGEFG GFLKEVRENG GMQSELMNQS GLPTALLSFD
GSPVYDDLAV LNRWLAGEER EANSRSATFF NLLPLHDGNH FPGVSKTADY KIRAQKLFDE
LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQISGLRDI PSPSITNVPA GVKFFGMKAP
HEGAPIDINQ PSSYLAISEL VVRAVDGKLF TEDSVNWNKL TSNLPQTAPV SENANAVVIQ
YQGKPYVRLN GGDWVPYPQ