Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0039 |
Symbol | |
ID | 6873165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 41721 |
End bp | 42911 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642783297 |
Product | chondroitin sulfate/heparin utilization regulation protein |
Protein accession | YP_002213991 |
Protein GI | 198243215 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.499783 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.00726967 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGTTTG GGAAAAGTTG TCAGGTCATG GTTAAACCAA CCGGATCGGT GTGTAACCTT GACTGTAAGT ACTGTTTTTA TCTGGAGAAA GAAAAGCTCT ATCCGGATCG AAAAAACCAT TACAAAATGT CGGAAGAGAC CCTCGAACTC TTCATCAGGC AGCAGATTGC CGCACAGGAT ATTGATGAGG TCATTTTTGC GTGGCAGGGC GGGGAACCCA CATTAATGGG CATCCCGTTT TATCGTAAAG CCGTTGAATT TCAGCAGCGC TATTGTGGCG GCAAAACCAT CGTCAATACC TTCCAGACCA ACGGCATCCT GATCAACGAT GACTGGGCGA CCTTCTTCCG GGAGCATGAT TTTCTGGTTG GCGTCTCTAT TGATGGCGAT GCCGCGTTAC ACGATGAATG GCGAGTGACG CGCTCCGGAA AGCCGACGCA TGAAAAAGTA GAAAATGCGG TGCGTTGTCT GGCGCAGCAC GACGTAGAAT TTAATACCCT CACGGTGGTT AACCGTACCA ATATGCATCA TCCTGTTCAG GTCTATCGCT ACCTGAAAAG CATTGGTAGC CGCTATATGC AATTTATCCC TTTAGTTGAA CGCTGCGGGG AAAATGGACT GGCGCAGCCG CAGGATAAAC ATATCGCGAT GACGCCGTGG TCGGTCGATA GCCTGCAATT TGGTCAGTTT CTGAATGCGG TATTTGATAT CTGGATCCGT GAGGATATCG GCGATATCGG CATTCAGCTA TTTGAACAGA CGCTGGCGGC CTGGTGCGGC CTGCCGCCGC AGGTTTGCGT TTTTGCGCCC ACCTGCGGCA GCGCGTTTGC GATGGAAATG AACGGCGATG TTTATAACTG CGATCACTTC GTATATCCGC AATTTAAACT GGGGAATATC CACCAGAAGA CGCTGCGTCA AATGAATCAG GGCGAACAAA ATCGCCAGTT CGGCAGCGAT AAACAGCGTT CAATGGCGCA GGAGTGTCAT CGCTGTCAAT GGAAGTTCGC CTGCTATGGC GGCTGTCCGA AACATCGTTT TTTACCCTCT GCGTCAGGCG CAACCAATCA TAACTATCTG TGTGCAGGTT ATCAGGCTTT TTTCTCGCAT ACCGCGACGG CGATGAGTGC CATGCGAACC CTGTATGAAA AAGGCATCTC ACCTGCAGAA ATAAAGTCAA TATTTGTTTG A
|
Protein sequence | MMFGKSCQVM VKPTGSVCNL DCKYCFYLEK EKLYPDRKNH YKMSEETLEL FIRQQIAAQD IDEVIFAWQG GEPTLMGIPF YRKAVEFQQR YCGGKTIVNT FQTNGILIND DWATFFREHD FLVGVSIDGD AALHDEWRVT RSGKPTHEKV ENAVRCLAQH DVEFNTLTVV NRTNMHHPVQ VYRYLKSIGS RYMQFIPLVE RCGENGLAQP QDKHIAMTPW SVDSLQFGQF LNAVFDIWIR EDIGDIGIQL FEQTLAAWCG LPPQVCVFAP TCGSAFAMEM NGDVYNCDHF VYPQFKLGNI HQKTLRQMNQ GEQNRQFGSD KQRSMAQECH RCQWKFACYG GCPKHRFLPS ASGATNHNYL CAGYQAFFSH TATAMSAMRT LYEKGISPAE IKSIFV
|
| |