Gene SNSL254_A2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2071 
Symbol 
ID6483914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2011386 
End bp2012525 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content52% 
IMG OID642737427 
Productglycosyl hydrolase, family 88 
Protein accessionYP_002041177 
Protein GI194442446 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00000233336 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGGTTT ATCCAGTCAA ACACAGTCCG CTATTGCGTC AGCCTGAGCA TTTTATCGCC 
AGAGATGAAC TAAAAGCCCT GGTGCAAAAG GTGACGCATA ACCTGGTCAA TATTAAAGAT
GAGACAGGCG AATTTTTATT GCGACTGGAC GACGGACGCG TGATCGATAC TAAAGGCTGG
GCCGGATGGG AGTGGACGCA CGGCGTGGGC CTGTACGGAA TGTATCACTA TTACCAACAG
ACCGGCGACC AGACGATGCG TAAGATCATT GATGACTGGT TTGCCGATCG TTTTGCGGAA
GGCGCGACGA CGAAAAACGT TAATACGATG GCGCCGTTTT TAACGTTGGC GTATCGCTAT
GAAGAGACGC GTAATCCAGC GTATTTACCG TGGCTGGAAA CGTGGGCCGA ATGGGCGATG
AATGAAATGC CCCGAACCGA TCACGGCGGA ATGCAGCACA TCACGCTGGC GGAGGAAAAT
CATCAGCAGA TGTGGGACGA CACGCTAATG ATGACGGTGC TGCCGCTGGC GAAAATCGGT
AAACTGTTGA ACCGGCCGGA ATATGTGGAA GAGGCAACCT ATCAGTTCCT GCTGCACGTG
CAGAATTTGA TGGATAAAGA GACGGGGCTG TGGTTCCACG GCTGGAGCTA TGACGGTCAT
CATAACTTCG CTAATGCTCG CTGGGCGCGC GGCAACAGCT GGCTGACCAT TGTGATCCCG
GATTTTCTTG AACTGCTGGA CTTGCCGGAA AATAATGCCG TGCGCCGTTA CCTGGTCCAG
GTACTGAATG CGCAGATCGC CGCGCTGGCG AAATGTCAGG ATAAAAGCGG TTTGTGGCAT
ACGCTGCTTG ACGATCCGCA CTCTTATCTT GAGGCGTCGG CGACGGCGGG TTTTGCCTAC
GGTATTCTTA AAGCGGTGCG CAAACGCTAT GTCGAACGGC ACTATGCGCA GGTGGCGGAA
AAAGCTATTC GGGGGATAGT GAAACATATC TCGCCGGAAG GCGAACTGCT GCAAACGTCA
TTTGGCACTG GCATGGGCCA CGATCTCGAT TTTTACCGTC ATATTCCGTT GACCTCTATG
CCTTACGGGC AGGCGATGGC AATGCTGTGT TTGACGGAAT ATCTGCGTAA CTATTTCTGA
 
Protein sequence
MMVYPVKHSP LLRQPEHFIA RDELKALVQK VTHNLVNIKD ETGEFLLRLD DGRVIDTKGW 
AGWEWTHGVG LYGMYHYYQQ TGDQTMRKII DDWFADRFAE GATTKNVNTM APFLTLAYRY
EETRNPAYLP WLETWAEWAM NEMPRTDHGG MQHITLAEEN HQQMWDDTLM MTVLPLAKIG
KLLNRPEYVE EATYQFLLHV QNLMDKETGL WFHGWSYDGH HNFANARWAR GNSWLTIVIP
DFLELLDLPE NNAVRRYLVQ VLNAQIAALA KCQDKSGLWH TLLDDPHSYL EASATAGFAY
GILKAVRKRY VERHYAQVAE KAIRGIVKHI SPEGELLQTS FGTGMGHDLD FYRHIPLTSM
PYGQAMAMLC LTEYLRNYF