Gene SeHA_C3365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3365 
Symbol 
ID6491810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3274761 
End bp3275945 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content50% 
IMG OID642743498 
Productarylsulfatase-activating protein AtsB 
Protein accessionYP_002047113 
Protein GI194449090 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value0.812959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATC TCAACACATT ACGGCAACAA CAAATCCCCG TAATGACGGA ATATCGCGCG 
CAGATACCCT TTCATATTCT GGCAAAGCCC ATAGGCCCTG CATGTAATCT GGCCTGCCGC
TATTGTTATT ATCCACAGGG CGAAACGCCC GTAGAAAAAA TGAATGAATC AACGCTGGAG
ATTTTTATTT GTCGCTATAT TGCGGCGCAA CCTGCCAGTG CGCGTGAAAT TAATTTTGTC
TGGCAAGGCG GTGAACCGCT TTTAGCCGGA ATCGGTTTTT ATAAAAAGGT AATAGCGCTT
CAACAACGAT ATGCGCCTGA CGGCGTGACG ATCAGTAATA GTCTGCAAAC GAATGCGACG
TTGTTAAACG ATGCCTGGTG CCGTCTGTTT CGCGACAATA ATTTTACTAT TGGCATCAGT
CTTGAGGGCA GTGAAGACTT GCAAAATCAT CATCGTCCGG GCAAACGCGG CGAGGCCAGC
TATCCGGCGG TGTTGCGGGG AATCACATTG TTACAACACT ATCGAGTCGA TTTTAATGTA
CTGATTGTCG TGCATGATGA CATGGCTCGC CATGCGGCAG CCATCTACGA TCATGTTGTT
AGCCTTGGCG TTCGTTATCT GCAATTTCAG CCACTGATGA ACGAAGGCAA CGCCCTACAG
CAACGTTACC AATTGAGTGC GGATAACTGG GGACGTTTCA TGATTGATAT CTGGCGTCAA
TGGCGCAAAC GCGGCGATAT GGGACGGGTT TTTGTGATCA ACATTGAACA GGCATGGGCA
CAATATTTTA CGCATATCAG CGCCACCTGT GTCCATTCCG CCCGCTGCGG AACGAATCTG
GTCATGGAGC CGGACGGCAA ACTCTATGCC TGCGATCATC TGATTAATAG CCAGCATTAC
CTGGGACAGC TTGCTAATAA TACGTTAGCG CCAGCCGTAG ATTCCGCAAC CCGGCTTCCC
TTTGGTATTA AGAAAAGCCA GCGCCGGGAG TGTCAACGGT GTTCTGTGAA AATAGTCTGC
CAGGGAGGCT GCCCCGCACA TATCAACAGT GCCGGCTACA ACCGACTTTG TAGCGGCTAT
TACTCTTTTT TCACGGAGAT TCTGGCTCCG CTACGCGCCT GGCCCCGGGA TCTGAATGGA
CTGAAAGCCT GGCGTGCTGA CGTTATGGGT AGATTTTCGG GCTGA
 
Protein sequence
MLNLNTLRQQ QIPVMTEYRA QIPFHILAKP IGPACNLACR YCYYPQGETP VEKMNESTLE 
IFICRYIAAQ PASAREINFV WQGGEPLLAG IGFYKKVIAL QQRYAPDGVT ISNSLQTNAT
LLNDAWCRLF RDNNFTIGIS LEGSEDLQNH HRPGKRGEAS YPAVLRGITL LQHYRVDFNV
LIVVHDDMAR HAAAIYDHVV SLGVRYLQFQ PLMNEGNALQ QRYQLSADNW GRFMIDIWRQ
WRKRGDMGRV FVINIEQAWA QYFTHISATC VHSARCGTNL VMEPDGKLYA CDHLINSQHY
LGQLANNTLA PAVDSATRLP FGIKKSQRRE CQRCSVKIVC QGGCPAHINS AGYNRLCSGY
YSFFTEILAP LRAWPRDLNG LKAWRADVMG RFSG