Gene SeSA_A2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2066 
Symbol 
ID6515735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1984324 
End bp1985463 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content53% 
IMG OID642747146 
Productglycosyl hydrolase, family 88 
Protein accessionYP_002114947 
Protein GI194736201 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.329645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0470712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGTTT ATCCAGTCAA ACACAGTCCG CTATTGCGTC AGCCTGAGCA TTTTATCGCC 
AGAGATGAAC TAAAAGCCCT GGTGCAAAAG GTGACGCATA ACCTGGTCAA TATTAAAGAT
GAGACAGGCG AATTTTTATT GCGACTGGAC GACGGACGCG TGATCGATAC TAAAGGCTGG
GCCGGATGGG AGTGGACGCA CGGCGTGGGC CTGTACGGAA TGTATCACTA TTACCAACAG
ACCGGCGACC AGACGATGCG TAAGATCATT GATGACTGGT TTGCCGATCG TTTTGCGGAA
GGCGCGACGA CGAAAAACGT TAATACGATG GCGCCATTTT TAACGCTGGC GTATCGCTAT
GAAGAGACGC GTAATCCAGC GTATTTACCG TGGCTGGAAA CGTGGGCCGA ATGGGCGATG
AATGAAATGC CCCGAACCGA TCACGGTGGA ATGCAGCACA TCACGCTGGC GGAGGAAAAT
CATCAGCAGA TGTGGGACGA CACGCTAATG ATGACGGTGC TGCCGCTGGC GAAAATCGGT
AAGCTGCTGA ACCGGCCGGA ATATGTGGAA GAGGCAACCT ATCAGTTCCT GCTACACGTG
CAGAATTTGA TGGATAAAGA GACGGGGCTG TGGTTCCACG GCTGGAGCTA TGACGGTCAT
CATAACTTCG CTAACGCTCG CTGGGCGCGC GGCAACAGTT GGCTGACCAT TGTGATCCCG
GATTTTCTTG AACTGCTGGA CTTGCCGGAA AATAATGCCG TGCGCCGTTA CCTGGTCCAG
GTACTGAATG CGCAGATCGC CGCGCTGGCG AAATGTCAGG ATGAAAGCGG TTTGTGGCAT
ACGCTGCTTG ACGATCCGCA CTCTTATCTT GAGGCGTCGG CGACGGCGGG TTTTGCCTAC
GGTATTCTTA AAGCGGTGCG CAAACGCTAT GTCGGGCGGC ACTATGCGCA GGTGGCGGAA
AAAGCTATTC GGGGGATAGT GAAACATATC TCGCCGGAAG GTGAACTGCT GCAAACGTCA
TTTGGCACTG GCATGGGCCA CGATCTCGAT TTTTACCGTC ATATTCCGTT GACCTCTATG
CCTTACGGGC AGGCGATGGC TATGCTGTGT TTGACGGAAT ATCTGCGTAA CTATTTCTGA
 
Protein sequence
MMVYPVKHSP LLRQPEHFIA RDELKALVQK VTHNLVNIKD ETGEFLLRLD DGRVIDTKGW 
AGWEWTHGVG LYGMYHYYQQ TGDQTMRKII DDWFADRFAE GATTKNVNTM APFLTLAYRY
EETRNPAYLP WLETWAEWAM NEMPRTDHGG MQHITLAEEN HQQMWDDTLM MTVLPLAKIG
KLLNRPEYVE EATYQFLLHV QNLMDKETGL WFHGWSYDGH HNFANARWAR GNSWLTIVIP
DFLELLDLPE NNAVRRYLVQ VLNAQIAALA KCQDESGLWH TLLDDPHSYL EASATAGFAY
GILKAVRKRY VGRHYAQVAE KAIRGIVKHI SPEGELLQTS FGTGMGHDLD FYRHIPLTSM
PYGQAMAMLC LTEYLRNYF