Gene SeD_A4353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4353 
Symbol 
ID6874492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4199474 
End bp4200769 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content54% 
IMG OID642787278 
Productchondroitin sulfate/heparin utilization regulation protein 
Protein accessionYP_002217894 
Protein GI198242498 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTAG CAGGTTGTCA CGTAATGGCC AAACCCGGCG GCGCGATCTG CAATATTGAT 
TGCACATATT GCTTCTATCT TGAAAAAGAG GCGCTGTACC CGGAACGCAA TAAAAACTGG
CGGATGTCGG ACGAGACGCT GGAACAATTT ATACGCCAGC ATATTGCCGC GCAGAGTGGC
GACCGCATTG ACTTTGCCTG GCAGGGCGGC GAACCGACCA TGATGGGGCT ACCGTTTTTC
CGCCGGGTTG TCACATTATG TGAAAAGTAC GGCGATGGGC GAAAAATCAC TCATGCGTTG
CAGACGAACG GCATCCTGGT GAATGACGAG TGGGCGCGCT TTTTCGCTGA ACAGCATTTT
CTCATCGGTC TCTCTATCGA CGGCCCGGCG TCGTTACACA ACCACTATCG GCTTAATCGC
GCTGGAAAAG GAACTCATGA ACAGGTCGTC GCAGCAATGG CGCGGCTTAA AGCGCACCAT
GTCGACTTTA ATACCTTAAC CGTCGTGGGA AAACATAACG TCGGTCATGC AGCAGACGTC
TACGAATTTC TTCTGGCGGC GGGATCGCGT TTTATTCAGT TTATCCCGCT GGTGGAGCGA
ATGAGCACCG ATAACTCATC GGTACTTAAT CTGGTGATGC CCGGCGAAAG CGCGGCGACG
CTGGCGCCAT GGACGGTACC GTCGTGGCAA TATGGCGAAT TTCTCAACCA GATCTTTGAT
ATCTGGGTTC GTCGCGACGT AGACCGCGTC TATGTGCAGA TGTTTGACGT GGCGTTAGCC
GCCTGGACGG CGCAGCAGCC GGTACTGTGT GTACATTCCG AGACTTGTGG ACATGCCTTC
GCGTTGGAGT CGAACGGCGA TCTCTACAAC TGCGACCACT TTGTCTACCC GGAGCATCTG
CTGGGAAATA TCCACCAGCA CAGCATCAAA GCCTTAAATA ATAGCGAGCG GGCTATTGCG
TTTGGCGAGG CCAAGCGGGA GACCCTGACC GCCGATTGTC GTCGCTGTGA CTACCGCTTT
GCGTGTCATG GCGGCTGTCC GAAGCATCGC TTTGCCGTCT CGCCGTCCGG TCATCCTGCG
CATAATTACT TGTGTGCGGG CTATAAGCAT TTTTTCCAGC ACGTTACGCC GTATATGAAT
GTCTGGCGGG AGCTGCTGGC GCAAGGCTAT CCGATGGCAT CGATCATGCG CTGGCTGGCG
CAGGACGCGC GTAAAGACAC AGGAGCCGTC AGTCGTAACC ATCTCTGTCC CTGTGGCAGC
GGCAAAAAAT ATAAAAAATG CTGTGGTAAA GCATAG
 
Protein sequence
MAVAGCHVMA KPGGAICNID CTYCFYLEKE ALYPERNKNW RMSDETLEQF IRQHIAAQSG 
DRIDFAWQGG EPTMMGLPFF RRVVTLCEKY GDGRKITHAL QTNGILVNDE WARFFAEQHF
LIGLSIDGPA SLHNHYRLNR AGKGTHEQVV AAMARLKAHH VDFNTLTVVG KHNVGHAADV
YEFLLAAGSR FIQFIPLVER MSTDNSSVLN LVMPGESAAT LAPWTVPSWQ YGEFLNQIFD
IWVRRDVDRV YVQMFDVALA AWTAQQPVLC VHSETCGHAF ALESNGDLYN CDHFVYPEHL
LGNIHQHSIK ALNNSERAIA FGEAKRETLT ADCRRCDYRF ACHGGCPKHR FAVSPSGHPA
HNYLCAGYKH FFQHVTPYMN VWRELLAQGY PMASIMRWLA QDARKDTGAV SRNHLCPCGS
GKKYKKCCGK A