Gene SNSL254_A0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0040 
Symbol 
ID6483436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp41717 
End bp42904 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content50% 
IMG OID642735484 
Productchondroitin sulfate/heparin utilization regulation protein 
Protein accessionYP_002039266 
Protein GI194443637 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.295568 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGGA AAAGTTGTCA GGTCATGGTT AAACCGACCG GATCGGTGTG TAACCTTGAC 
TGTAAGTACT GTTTTTATCT GGAGAAAGAA AAGCTCTATC CGGATCGAAA AAACCATTAC
AAAATGTCGG AAGAGACCCT CGAACTCTTC ATCAGGCAGC AGATTGCCGC ACAAGATATT
GATGAGGTCA TTTTTGCGTG GCAGGGCGGG GAACCCACAT TAATGGGCAT CCCGTTTTAT
CGTAAAGCCG TTGAGTTTCA GCAGCGCTAT TGTGGCGGCA AAACCATCGT CAATACCTTC
CAGACCAACG GCATCCTGAT CAACGATGAC TGGGCGACCT TCTTCCGGGA GCATGATTTT
CTGGTTGGCG TCTCTATTGA TGGCGATGCC GCGTTACACG ATGAATGGCG AGTGACGCGC
TCCGGAAAGC CGACGCATGA AAAAGTAGAA AATGCGGTGC GTTGTCTGGC GCAGCACGAC
GTAGAATTTA ATACCCTCAC GGTGGTTAAC CGTACCAATA TGCATCATCC TGTTCAGGTC
TATCGCTACC TGAAAAGCAT TGGTAGCCGC TATATGCAAT TTATCCCTTT AGTTGAACGC
TGTGGGGAAA ATGGGCTGGC GCAGCCGCAG GATAAACATA TCGCGATGAC GCCGTGGTCG
GTCGATAGCC TGCAATTTGG TCAGTTTCTG AATGCGGTAT TTGATATCTG GATCCGTGAG
GATATCGGCG ATATCGGCAT CCAGCTATTT GAACAGACGC TGGCGGCCTG GTGCGGCCTG
CCGCCGCAGG TTTGCGTTTT TGCTCCCACC TGCGGCAGCG CGTTTGCGAT GGAAATGAAC
GGCGATGTTT ATAACTGCGA TCACTTCGTA TATCCGCAAT TTAAACTGGG GAATATCCAC
CAGAAGACGC TGCGTCAAAT GAATCAGGGC GAACAAAATC GCCAGTTCGG CAGCGATAAA
CAGCGTTCAA TGGCGCAGGA GTGTCATCGC TGTCAATGGA AGTTCGCCTG CTATGGCGGC
TGTCCGAAAC ATCGTTTTTT ACCCTCTGCG TCAGGCGCAA CCAATCATAA CTATCTGTGT
GCAGGTTATC AGGCTTTTTT CTCGCATACC GCGACGGCGA TGAGTGCCAT GCGAACCCTG
TATGAAAAAG GCATCTCACC TGCAGAAATA AAGTCAATAT TTGTTTGA
 
Protein sequence
MFGKSCQVMV KPTGSVCNLD CKYCFYLEKE KLYPDRKNHY KMSEETLELF IRQQIAAQDI 
DEVIFAWQGG EPTLMGIPFY RKAVEFQQRY CGGKTIVNTF QTNGILINDD WATFFREHDF
LVGVSIDGDA ALHDEWRVTR SGKPTHEKVE NAVRCLAQHD VEFNTLTVVN RTNMHHPVQV
YRYLKSIGSR YMQFIPLVER CGENGLAQPQ DKHIAMTPWS VDSLQFGQFL NAVFDIWIRE
DIGDIGIQLF EQTLAAWCGL PPQVCVFAPT CGSAFAMEMN GDVYNCDHFV YPQFKLGNIH
QKTLRQMNQG EQNRQFGSDK QRSMAQECHR CQWKFACYGG CPKHRFLPSA SGATNHNYLC
AGYQAFFSHT ATAMSAMRTL YEKGISPAEI KSIFV