Gene SNSL254_A4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4246 
Symbol 
ID6483911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4137157 
End bp4138452 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content54% 
IMG OID642739496 
Productchondroitin sulfate/heparin utilization regulation protein 
Protein accessionYP_002043195 
Protein GI194443254 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.598382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTAG CAGGTTGTCA CGTAATGGCC AAACCCGGCG GCGCGATCTG CAATATTGAT 
TGCACATATT GCTTCTATCT TGAAAAAGAG GCGCTGTACC CGGAACGCAA TAAAAACTGG
CGGATGTCGG ACGAGACGCT GGAACAATTT ATACGCCAGC ATATTGCCGC GCAGAGTGGC
GACCGCATTG ACTTTGCCTG GCAGGGCGGC GAACCGACCA TGATGGGGCT GCCATTTTTC
CGCCGGGTTG TCGCATTGTG TGAAAAGTAC GGCGATGGGC GAAAAATCAC TCATGCGTTA
CAGACGAACG GCATCCTGGT GAATGACGAG TGGGCGCGCT TTTTCGCTGA ACAGCATTTT
CTCATCGGTC TCTCTATCGA CGGTCCGGCG TCGTTACACA ACCACTATCG GCTTAATCGC
GCTGGAAAAG GAACTCATGA ACAGGTCGTC GCAGCCATGG CGCGGCTTAA AGCGCACCAT
GTCGACTTTA ATACCTTAAC CGTCGTGGGA AAACATAACG TCGGTCATGC AGCAGACGTC
TACGAATTTC TTCTGGCGGC GGGATCGCGT TTTATTCAGT TTATCCCGCT GGTAGAGCGA
ATGAGCACCG ATAACTCATC GGTACTTAAT CTGGTGATGC CCGGCGAAAG CGCGGCAAAG
CTGGCGCCAT GGACGGTACC GTCGTGGCAA TATGGCGAAT TTCTCAACCA GATCTTTGAT
ATCTGGGTTC TTCGCGACGT AGACCGCGTC TATGTGCAGA TGTTTGACGT GGCGTTAGCC
GCCTGGACGG CGCAGCAGCC GGTACTGTGT GTACATTCCG AGACTTGTGG ACATGCCTTC
GCGTTGGAGT CGAACGGCGA TCTCTACAAC TGCGACCACT TTGTCTACCC GGAGCATCTG
CTGGGGAATA TCCACCAGCA CAGCATCAAA ACCTTAAATA ATAGCGAGCG GGCTATTGCC
TTTGGTGAGG CCAAGCGGGA GACCCTGACC GCCGATTGTC GTCGCTGTGA CTACCGCTTT
GCGTGTCATG GCGGCTGTCC GAAGCATCGC TTTGCCGTCT CGCCGTCCGG TCATCCTGCG
CATAATTACT TGTGTGCGGG CTATAAGCAT TTTTTCCAGC ACGTTACGCC GTATATGAAT
GTCTGGCGGG AGCTGCTGGC GCAAGGCTAT CCGATGGCAT CGATCATGCG CTGGCTGGCG
CAGGACGCGC GTAAAGACAC AGGAGCCGTC AGTCGTAACC ATCTCTGTCC CTGCGGCAGC
GGCAAAAAAT ATAAAAAATG CTGTGGTAAA GCATAG
 
Protein sequence
MAVAGCHVMA KPGGAICNID CTYCFYLEKE ALYPERNKNW RMSDETLEQF IRQHIAAQSG 
DRIDFAWQGG EPTMMGLPFF RRVVALCEKY GDGRKITHAL QTNGILVNDE WARFFAEQHF
LIGLSIDGPA SLHNHYRLNR AGKGTHEQVV AAMARLKAHH VDFNTLTVVG KHNVGHAADV
YEFLLAAGSR FIQFIPLVER MSTDNSSVLN LVMPGESAAK LAPWTVPSWQ YGEFLNQIFD
IWVLRDVDRV YVQMFDVALA AWTAQQPVLC VHSETCGHAF ALESNGDLYN CDHFVYPEHL
LGNIHQHSIK TLNNSERAIA FGEAKRETLT ADCRRCDYRF ACHGGCPKHR FAVSPSGHPA
HNYLCAGYKH FFQHVTPYMN VWRELLAQGY PMASIMRWLA QDARKDTGAV SRNHLCPCGS
GKKYKKCCGK A