Gene SNSL254_A3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3372 
Symbol 
ID6485299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3271728 
End bp3272912 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content50% 
IMG OID642738663 
Productarylsulfatase-activating protein AtsB 
Protein accessionYP_002042383 
Protein GI194443597 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATC TCAACACATT ACGGCAACAA CAAATCCCCG TAATGACGGA ATATCGCGCG 
CAGATACCCT TTCATATATT GGCAAAACCC ATAGGCCCTG CATGTAATCT GGCCTGCCGC
TATTGTTATT ACCCACAGGG CGAAACGCCC GTAGAAAAAA TGAATGAATC AACGCTGGAG
GTTTTTATTT GTCGCTATAT TGCGGCGCAA CCTGCCAGTG CGCGTGAAAT TAATTTTGTC
TGGCAAGGCG GTGAACCGCT TTTAGCCGGA ATCGGTTTTT ATAAAAAGGT AATAGCGCTT
CAACAACGAT ATGCGCCTGA CGGCGTGACG ATCAGTAATA GTCTGCAAAC GAATGCGACG
TTGTTAAACG ATGCCTGGTG CCGTCTGTTT CGCGACAATA ATTTTACTAT TGGCATCAGT
CTTGAGGGCA GTGAAGACTT GCAAAATCAT CATCGTCCGG GCAAACGCGG CGAGGCCAGC
TATCCGGCGG TGTTGCGGGG AATCACATTG TTACAACACT ATCGAGTCGA TTTTAATGTA
CTGATTGTCG TGCATGATGA CATGGCTCGC CATGCGGCAG CCATCTACGA TCATGTTGTT
AGCCTTGGCG CTCGTTATCT GCAATTTCAG CCACTGATGG ACGAAGGCAA CGCCCTACAG
CAACGTTACC AATTGAGTGC GGATAACTGG GGACGTTTCA TGATTGATAT CTGGCGTCAA
TGGCGCAAAC GCGGTGATAT GGGACGGGTT TTTGTGATCA ACATTGAACA GGCATGGGCA
CAATATTTTA CGCATATCAG CGCCACCTGT GTCCATTCCG CCCGCTGCGG CACGAATCTG
GTCATGGAGC CGGACGGCAA ACTCTATGCC TGCGATCATC TGATTAATAG CCAGCATTAC
CTGGGACAGC TTTCTAATAA TACGTTAGCG CCAGCCGTAG ATTCCGCAAC CCGGCTTCCC
TTTGGTATTA AGAAAAGCCA GCGCCGGGAG TGTCAACGGT GTTCTGTGAA AATAGTCTGC
CAGGGAGGCT GCCCCGCACA TATCAACAGT GCCGGCTACA ACCGACTTTG TAGCGGCTAT
TACTCTTTTT TCACGGAGAT TCTGGCTCCG CTACGCGCCT GGCCCCGGAA TCTGAATGGA
CTGAAGGCCT GGCGTGCTGA CGTTATGGGC AGATTTTCGG GCTGA
 
Protein sequence
MLNLNTLRQQ QIPVMTEYRA QIPFHILAKP IGPACNLACR YCYYPQGETP VEKMNESTLE 
VFICRYIAAQ PASAREINFV WQGGEPLLAG IGFYKKVIAL QQRYAPDGVT ISNSLQTNAT
LLNDAWCRLF RDNNFTIGIS LEGSEDLQNH HRPGKRGEAS YPAVLRGITL LQHYRVDFNV
LIVVHDDMAR HAAAIYDHVV SLGARYLQFQ PLMDEGNALQ QRYQLSADNW GRFMIDIWRQ
WRKRGDMGRV FVINIEQAWA QYFTHISATC VHSARCGTNL VMEPDGKLYA CDHLINSQHY
LGQLSNNTLA PAVDSATRLP FGIKKSQRRE CQRCSVKIVC QGGCPAHINS AGYNRLCSGY
YSFFTEILAP LRAWPRNLNG LKAWRADVMG RFSG