Gene SeD_A2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2069 
Symbol 
ID6875225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2000566 
End bp2001762 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content50% 
IMG OID642785182 
Productchondroitin sulfate/heparin utilization regulation protein 
Protein accessionYP_002215848 
Protein GI198243291 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.000256382 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATAG TATTCAATAC CGTAGCCAAG CCCAGCGGCA GTCTGTGTAA CTTATCCTGC 
AAGTACTGTT TCTATCTTGA TAAACCCCGG GGGCAGCGCG TCATGTCTGA CGATGTGCTG
GAGACGTATA TCCGCCGGGT AATTGATGAT ACGCCATCCT CAGAGGTCTC GTTTTGCTGG
CAGGGGGGAG AGCCGACGCT ATGCGGTCTT TCTTTTTACC AAAAAGTGGT GCGCTTGCAG
CAACGCTATG CCAACGGCAA AACTATCTAC AACAGTCTGC AAACCAATGG CGTATTAATC
AATGAAGAGT GGGCGGCTTT CTTTGCGCAG CACCAGTTCC TGATTGGTAT ATCGATTGAT
GGGCCGCAAG TCGTTCATGA TAATTACCGG AAAACGCCGT CAGGGCGGGC GTCTTTTTCC
CGAGTCGTTA ATGCTATCCG CCTTCTGCAG GCAAATGATG TCGAGTTCAA CACGCTCACT
GTCGTGAATG ATGCGTCATG CCGTCATGGC AACGCTATTT ATCATTTTTT GACGCAGGAA
CTGGAAAGTA AACACCTGCA ATTTATTCCC ATTGTTGAGC CGCTCGCGCA AAAAGCGCAG
CGTTCTTTGA CGTTATCTGA CAATGAGGAT TCGCCTTCGC TGATGCCCTT TTCCGTCACG
CCTGAAGGGT GGGGCGCCTT TATGTGCGAT GTTTTTGATC AATGGATACG TCACGATGTC
GGACGCATAT TCGTACAGCT TTTTGATAAC TTACTTGGCG TCTGGATGGG GGAGCCCGCC
ACGCTTTGTA CGATGCAGTC GACCTGCGGG CAAAGTTTGC TGGTGGAGCA GAATGGCGAC
GTGTTTAGCT GTGACCATTT TGTTTTTCCC GCCTATAAAC TGGGCAATCT GCAGCAACAC
TCTTTAGAAG AAATGGCGGC CTCTCCTTTT CAGCAGCAGT TTGGCGCGGC TAAAGCAAAC
CTTTCCTCAC GCTGCCAGAA CTGTACGTGG CGCTTTGCCT GTCACGGCGG TTGTCCGAAA
CATCGAATTT GTATGGACGG CGGCGAACGG CAAAATTATC TCTGTAAAGG ATATCTGGAG
TTCTTTCAAC ATGTGACGCC CTATATGAAT GTGATGCGTC AATTATTACT AAATCAGCGA
CCCGCCGCGC ATATTACCCG CATCGTCGAC ATGATTGCGG ATGACGTTCG TCAGTGA
 
Protein sequence
MSIVFNTVAK PSGSLCNLSC KYCFYLDKPR GQRVMSDDVL ETYIRRVIDD TPSSEVSFCW 
QGGEPTLCGL SFYQKVVRLQ QRYANGKTIY NSLQTNGVLI NEEWAAFFAQ HQFLIGISID
GPQVVHDNYR KTPSGRASFS RVVNAIRLLQ ANDVEFNTLT VVNDASCRHG NAIYHFLTQE
LESKHLQFIP IVEPLAQKAQ RSLTLSDNED SPSLMPFSVT PEGWGAFMCD VFDQWIRHDV
GRIFVQLFDN LLGVWMGEPA TLCTMQSTCG QSLLVEQNGD VFSCDHFVFP AYKLGNLQQH
SLEEMAASPF QQQFGAAKAN LSSRCQNCTW RFACHGGCPK HRICMDGGER QNYLCKGYLE
FFQHVTPYMN VMRQLLLNQR PAAHITRIVD MIADDVRQ