Gene SeD_A0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0039 
Symbol 
ID6873165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp41721 
End bp42911 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content50% 
IMG OID642783297 
Productchondroitin sulfate/heparin utilization regulation protein 
Protein accessionYP_002213991 
Protein GI198243215 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.499783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.00726967 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGTTTG GGAAAAGTTG TCAGGTCATG GTTAAACCAA CCGGATCGGT GTGTAACCTT 
GACTGTAAGT ACTGTTTTTA TCTGGAGAAA GAAAAGCTCT ATCCGGATCG AAAAAACCAT
TACAAAATGT CGGAAGAGAC CCTCGAACTC TTCATCAGGC AGCAGATTGC CGCACAGGAT
ATTGATGAGG TCATTTTTGC GTGGCAGGGC GGGGAACCCA CATTAATGGG CATCCCGTTT
TATCGTAAAG CCGTTGAATT TCAGCAGCGC TATTGTGGCG GCAAAACCAT CGTCAATACC
TTCCAGACCA ACGGCATCCT GATCAACGAT GACTGGGCGA CCTTCTTCCG GGAGCATGAT
TTTCTGGTTG GCGTCTCTAT TGATGGCGAT GCCGCGTTAC ACGATGAATG GCGAGTGACG
CGCTCCGGAA AGCCGACGCA TGAAAAAGTA GAAAATGCGG TGCGTTGTCT GGCGCAGCAC
GACGTAGAAT TTAATACCCT CACGGTGGTT AACCGTACCA ATATGCATCA TCCTGTTCAG
GTCTATCGCT ACCTGAAAAG CATTGGTAGC CGCTATATGC AATTTATCCC TTTAGTTGAA
CGCTGCGGGG AAAATGGACT GGCGCAGCCG CAGGATAAAC ATATCGCGAT GACGCCGTGG
TCGGTCGATA GCCTGCAATT TGGTCAGTTT CTGAATGCGG TATTTGATAT CTGGATCCGT
GAGGATATCG GCGATATCGG CATTCAGCTA TTTGAACAGA CGCTGGCGGC CTGGTGCGGC
CTGCCGCCGC AGGTTTGCGT TTTTGCGCCC ACCTGCGGCA GCGCGTTTGC GATGGAAATG
AACGGCGATG TTTATAACTG CGATCACTTC GTATATCCGC AATTTAAACT GGGGAATATC
CACCAGAAGA CGCTGCGTCA AATGAATCAG GGCGAACAAA ATCGCCAGTT CGGCAGCGAT
AAACAGCGTT CAATGGCGCA GGAGTGTCAT CGCTGTCAAT GGAAGTTCGC CTGCTATGGC
GGCTGTCCGA AACATCGTTT TTTACCCTCT GCGTCAGGCG CAACCAATCA TAACTATCTG
TGTGCAGGTT ATCAGGCTTT TTTCTCGCAT ACCGCGACGG CGATGAGTGC CATGCGAACC
CTGTATGAAA AAGGCATCTC ACCTGCAGAA ATAAAGTCAA TATTTGTTTG A
 
Protein sequence
MMFGKSCQVM VKPTGSVCNL DCKYCFYLEK EKLYPDRKNH YKMSEETLEL FIRQQIAAQD 
IDEVIFAWQG GEPTLMGIPF YRKAVEFQQR YCGGKTIVNT FQTNGILIND DWATFFREHD
FLVGVSIDGD AALHDEWRVT RSGKPTHEKV ENAVRCLAQH DVEFNTLTVV NRTNMHHPVQ
VYRYLKSIGS RYMQFIPLVE RCGENGLAQP QDKHIAMTPW SVDSLQFGQF LNAVFDIWIR
EDIGDIGIQL FEQTLAAWCG LPPQVCVFAP TCGSAFAMEM NGDVYNCDHF VYPQFKLGNI
HQKTLRQMNQ GEQNRQFGSD KQRSMAQECH RCQWKFACYG GCPKHRFLPS ASGATNHNYL
CAGYQAFFSH TATAMSAMRT LYEKGISPAE IKSIFV