Gene SeHA_C4307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4307 
Symbol 
ID6489644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4194493 
End bp4196316 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content47% 
IMG OID642744396 
Productarylsulfotransferase 
Protein accessionYP_002047990 
Protein GI194451107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTA AATATGCTTT AACTTCTCTC GCATTATCTG TTGCAATTTT GTCATCAGTA 
CCTTCTACTG CTTTCGCCAT CGGCGGCGCC AGCGGCGCTA AAGTGGACTA TCAGGTCCAG
GGAAAAATTG GCGAAGTTGT TATGAACCCC TATGATATCG CGCCGCTAAC CGCCGTTATT
CGTAATGGCG GTTACCAGTT ACGTGACGTG CATGTACGGA TTGTCCCCAA AGAAAATGGC
CAGGAGATCG CGTATAAAGT TAATAATAAA TACCTTTTAA CGTATGGCGG TATCCCCGTC
TTTGGTCTTT ACCCAGATTA TGTCAATACC GTTGAAGTTG AATATACAAG GATCCAGGGT
AGTAAAACCG AAAATGTAAA AGAAAGCTAT AAAATGTATG CGCCGCCTGC CTATATTGAA
TCGGCGGGTA CAAAAGAAGA ACAATCAGCA CTCTTTACTA TCGATGTTAA AAAGGTTTCC
CCAGAATTTA AAGATCGCTT GTATCTTTTG AATAATACGA AAGATAAGTC TGGGAATGGA
ACGCGTACTG TCTGGAACAA CCCTACTGGG GGTGCATTAG AATGGAACTT CACTACAGCT
AACGCTATTA TCGACACCTC CGGTGATATT CGTTGGTTTA TGAATCCAAG TTCAATTTAT
GATTTAAAGT CAATTTATCG TGCTGGCGTT ATGATGGGCT TTAAACAAAA CCAGGATGGC
GCACTATCGT GGGGCTACGG TCAGCGTTAT GTGAAATACG ATATCATGGG GCGTGAAATC
TTTAACCGCC GTCTGCCGGA TAATTATAAC GATTTTTCAC ACTCAATGGA TAACGCGGCC
AACGGTCACT ACTTCCTGCG TGTAGCCAGC TCTAACTATA AACGCCCTGA TGGGAAAAAT
GTTCGTACCG TGCGTGATGT GATTGCCGAA GTTGATCAGA ACGGCGTGGT AGTGGATGAA
TGGCGTCTGT TTGATATCCT CGATCCTTAT CGTGATGTGA TAATGAAAAC CCTCGATCAG
GGCGCTGTGT GCCTGAATAT CGACGCCAGC CAGTCCGGCC ATACGTTGAG CGAAGAAGAT
CTGGCGGCGC TGGACTCCTC CGACAAATTC GGGGATATCG TGGGTAGTGG GGCTGGCCGC
AACTGGGCGC ATGTCAACAG CGTCGACTAT GACAGTGAAG ATGATTCCAT CATCATCAGC
TCCCGCCACC AGAGTGCGAT TATCAAAATC GGCCGCGATA AGAAAGTGAA GTGGATACTG
GGTACGCCTG CTGGCTGGAA AGCGCCATTT AATGCCGCAA TTCTGACGCC AGTGGATAGC
AAAGGCCAAA AAATTGCTTG CCAGGACAGT GGCTGCGAGG GTGACTTCGA CTGGACATGG
ACGCAACATA CGGCCTTTAA AATTGATAGT AAGAGTAAAG GCGATATCTT ATACCTTTCC
GCTTTCGACA ATGGCGATGG CCGCGGCTTA GAACAGCCTG CTATGCAGAG TATGAAATAC
AGCCGCTCCG TGATTTACAA AATCGACCAG AAAAACAAGA CAGTCCAACA GATCTGGCAA
TACGGTAAAG AGCGCGGGAA CGAGTGGTTT AGCCCGGTAA CCTCTATCAC CGAGTACCAG
ACTGACAAGA ATTCTGTGTT CGTGTATTCC GCAACAGCAG GTGGTGCGTT TGATTTGTCG
GTAGGCGCAT TTACCAGCTT GCCTAATCCG TATCTGGAAG AGTTCAAATG GGGAGAAAAA
GAGCCTGCGG TCGAAATGCA AATACATGGT GCGCGTGGAT ATCAGGCTAT GCCATTTAGC
CTGACCAAAG CGCTTACTGA GTAG
 
Protein sequence
MKFKYALTSL ALSVAILSSV PSTAFAIGGA SGAKVDYQVQ GKIGEVVMNP YDIAPLTAVI 
RNGGYQLRDV HVRIVPKENG QEIAYKVNNK YLLTYGGIPV FGLYPDYVNT VEVEYTRIQG
SKTENVKESY KMYAPPAYIE SAGTKEEQSA LFTIDVKKVS PEFKDRLYLL NNTKDKSGNG
TRTVWNNPTG GALEWNFTTA NAIIDTSGDI RWFMNPSSIY DLKSIYRAGV MMGFKQNQDG
ALSWGYGQRY VKYDIMGREI FNRRLPDNYN DFSHSMDNAA NGHYFLRVAS SNYKRPDGKN
VRTVRDVIAE VDQNGVVVDE WRLFDILDPY RDVIMKTLDQ GAVCLNIDAS QSGHTLSEED
LAALDSSDKF GDIVGSGAGR NWAHVNSVDY DSEDDSIIIS SRHQSAIIKI GRDKKVKWIL
GTPAGWKAPF NAAILTPVDS KGQKIACQDS GCEGDFDWTW TQHTAFKIDS KSKGDILYLS
AFDNGDGRGL EQPAMQSMKY SRSVIYKIDQ KNKTVQQIWQ YGKERGNEWF SPVTSITEYQ
TDKNSVFVYS ATAGGAFDLS VGAFTSLPNP YLEEFKWGEK EPAVEMQIHG ARGYQAMPFS
LTKALTE