Gene SeD_A4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4001 
Symbol 
ID6871722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3846195 
End bp3847874 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content56% 
IMG OID642786957 
Productendoglucanase BcsG 
Protein accessionYP_002217585 
Protein GI198246095 
COG category 
COG ID 
TIGRFAM ID[TIGR03368] cellulose synthase operon protein YhjU 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGC ATACTCAAAC TCCATCAATG CCTTCTCCGC TCTGGCAGTA CTGGCGCGGT 
CTTTCCGGCT GGAACTTCTA TTTTCTGGTC AAGTTTGGCC TGCTGTGGGC AGGCTATCTG
AATTTTCATC CTTTACTGAA TCTGGTATTC ATGGCGTTTC TGCTCATGCC AATACCAAAG
TATCGCCTTC ACCGGTTGCG CCACTGGATT GCCATTCCCG TCGGCTTCGC GCTGTTCTGG
CACGATACCT GGCTGCCCGG CCCGCAAAGC ATTATGAGCC AGGGGACGCA GGTGGCGGAA
TTCAGCTCCG GTTATCTGCT CGATCTGATC GCCCGTTTTA TTAACTGGCA AATGATCGGC
GCCATCTTCG TACTGCTGGT TGCCTGGCTT TTTTTATCAC AGTGGATTCG GGTCACGGTG
TTTGTGGTCG CCATCATGGT ATGGCTGAAT GTCCTGACAT TAACCGGCCC GGTTTTTACG
CTGTGGCCGG CAGGCCAGCC AACCGATACG GTGACGACAA CTGGCGGTAA CGCGGCCGCT
ACCGTCGCGA CAGCGGGCGA TAAGCCGGTC ATCGGCGATA TGCCTGCGCA AACCGCGCCG
CCGACGACCG CGAATCTGAA CGCCTGGTTG AACACCTTCT ATGCCGCGGA AGAAAAGCGG
AAAACGACGT TCCCGGCGCA ACTTCCGCCT GATGCGCAGC CGTTCGACCT ATTGGTCATC
AATATTTGTT CGCTCTCCTG GTCGGATGTC GAAGCGGCAG GCCTGATGTC ACATCCGCTA
TGGTCGCACT TTGACATTTT GTTTAAACAC TTTAATTCCG GTACGTCTTA CAGCGGCCCG
GCGGCCATTC GTCTGCTACG CGCCAGCTGT GGTCAACCTT CGCATACCCG ACTTTATCAA
CCAGCCAATA ACGAATGTTA TCTGTTTGAT AATCTGGCGA AACTGGGCTT TACTCAGCAT
CTGATGATGG ATCATAACGG TGAATTTGGC GGCTTCCTGA AAGAAGTTCG CGAAAACGGC
GGTATGCAGA GCGAACTGAT GAACCAGTCC GGCCTGCCAA CCGCCCTGCT GTCATTCGAC
GGCTCGCCGG TATATGACGA TCTGGCGGTC CTGAACCGCT GGTTGACAGG GGAAGAACGT
GAAGCCAATT CCCGCTCCGC GACTTTCTTT AACCTGCTGC CGCTGCACGA TGGCAACCAC
TTCCCCGGCG TCAGCAAAAC GGCGGATTAT AAAATCCGCG CGCAGAAACT GTTCGATGAA
CTGGACGCCT TCTTCACCGA ACTGGAGAAA TCCGGGCGTA AGGTGATGGT GGTCGTCGTA
CCGGAGCACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGA TCTCAGGCCT GCGCGATATT
CCCAGCCCCT CCATCACCAA CGTCCCGGCG GGCGTGAAAT TTTTTGGCAT GAAAGCCCCG
CATGAGGGCG CGCCGATTGA TATTAACCAG CCGAGCAGCT ACCTGGCGAT TTCCGAACTG
GTCGTACGCG CCGTGGACGG TAAGCTCTTT ACCGAAGACA GTGTGAACTG GAACAAGCTG
ACCAGCAATC TGCCGCAAAC CGCGCCGGTT TCAGAAAACG CTAATGCGGT GGTGATTCAG
TATCAGGGTA AGCCCTACGT TCGTCTGAAT GGCGGCGACT GGGTGCCTTA CCCGCAGTAA
 
Protein sequence
MTQHTQTPSM PSPLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF MAFLLMPIPK 
YRLHRLRHWI AIPVGFALFW HDTWLPGPQS IMSQGTQVAE FSSGYLLDLI ARFINWQMIG
AIFVLLVAWL FLSQWIRVTV FVVAIMVWLN VLTLTGPVFT LWPAGQPTDT VTTTGGNAAA
TVATAGDKPV IGDMPAQTAP PTTANLNAWL NTFYAAEEKR KTTFPAQLPP DAQPFDLLVI
NICSLSWSDV EAAGLMSHPL WSHFDILFKH FNSGTSYSGP AAIRLLRASC GQPSHTRLYQ
PANNECYLFD NLAKLGFTQH LMMDHNGEFG GFLKEVRENG GMQSELMNQS GLPTALLSFD
GSPVYDDLAV LNRWLTGEER EANSRSATFF NLLPLHDGNH FPGVSKTADY KIRAQKLFDE
LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQISGLRDI PSPSITNVPA GVKFFGMKAP
HEGAPIDINQ PSSYLAISEL VVRAVDGKLF TEDSVNWNKL TSNLPQTAPV SENANAVVIQ
YQGKPYVRLN GGDWVPYPQ