Gene SeSA_A3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3298 
Symbol 
ID6516693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3182388 
End bp3183572 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content50% 
IMG OID642748298 
Productarylsulfatase-activating protein AtsB 
Protein accessionYP_002116071 
Protein GI194735302 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATC TCAACACATT ACGGCAACAA CAAATCCCCG TAATGACTGA ATATCGCGCG 
CAGATACCCT TTCATATTCT GGCAAAACCC ATAGGCCCTG CGTGTAATCT GGCCTGCCGC
TATTGTTATT ACCCACAGGG CGAAACGCCC GTAGAAAAAA TGAATGAATC AACGCTGGAG
GTTTTTATTT GTCGCTATAT TGCGGCGCAA CCTGCCAGTG CGCGTGAAAT TAATTTTGTC
TGGCAAGGCG GTGAACCGCT TTTAGCCGGA ATCGGTTTTT ATAAAAAGGT AATAGCGCTT
CAACAACGAT ATGCGCCTGA CGACGTGACG ATCAGTAATA GTCTGCAAAC GAATGCGACG
TTGTTAAACG ATGCCTGGTG TCGTCTGTTT CGCGACAATA ATTTTACTAT TGGCATCAGT
CTTGAGGGCA GTGAAGACTT GCAAAATTAC CATCGTCCGG GCAAACGCGG CGAGTCCAGT
TATCCGGCGG TGTTGCGGGG AATCACATTG TTACAACACT ATCGAGTCGA TTTTAACGTA
CTGATTGTCG TGCATGATGA CATGGCTCGC CATGCGGCAG CCATCTACGA TCATGTTGTT
AGCCTTGGCG CTCGTTATCT GCAATTTCAG CCACTGATGA ACGAAGGCAA CGCCCTACAG
CAACGTTACC AATTGAGTGC GGATAACTGG GGACGTTTCA TGATTGATAT CTGGCGTCAA
TGGCGCAAAC GCGGCGATAT GGGGCGGGTC TTTGTGATCA ACATTGAACA GGCATGGGCA
CAATATTTTA CGCATATCAG CGCCACCTGT GTCCATTCCG CCCGCTGCGG CACGAATCTG
GTCATGGAGC CGGACGGCAA ACTCTATGCC TGCGATCATC TGATTAATAG CCAGCATTAC
CTGGGACAGC TTTCTAATAA TATGTTAGCG CCAGCCGTAG ATACCGCAAC CCGGCTTCCC
TTTGGTATTA AGAAAAGCCA GCGCCGGGAG TGTCAACGGT GTTCTGTGAA AATAGTCTGC
CAGGGAGGCT GCCCCGCACA TATCAACAGT GCCGGCTACA ACCGACTTTG TAGCGGCTAT
TACTCTTTTT TCACTGAGAT TCTGGCTCCG CTACGCGCCT GGCCCCGGAA TCTGAATGGA
CTGAAAGCCT GGCGTGCTGA CGTTATGGGC AGATTTTCGG GCTGA
 
Protein sequence
MLNLNTLRQQ QIPVMTEYRA QIPFHILAKP IGPACNLACR YCYYPQGETP VEKMNESTLE 
VFICRYIAAQ PASAREINFV WQGGEPLLAG IGFYKKVIAL QQRYAPDDVT ISNSLQTNAT
LLNDAWCRLF RDNNFTIGIS LEGSEDLQNY HRPGKRGESS YPAVLRGITL LQHYRVDFNV
LIVVHDDMAR HAAAIYDHVV SLGARYLQFQ PLMNEGNALQ QRYQLSADNW GRFMIDIWRQ
WRKRGDMGRV FVINIEQAWA QYFTHISATC VHSARCGTNL VMEPDGKLYA CDHLINSQHY
LGQLSNNMLA PAVDTATRLP FGIKKSQRRE CQRCSVKIVC QGGCPAHINS AGYNRLCSGY
YSFFTEILAP LRAWPRNLNG LKAWRADVMG RFSG