Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2069 |
Symbol | |
ID | 6875225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2000566 |
End bp | 2001762 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642785182 |
Product | chondroitin sulfate/heparin utilization regulation protein |
Protein accession | YP_002215848 |
Protein GI | 198243291 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.000256382 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCATAG TATTCAATAC CGTAGCCAAG CCCAGCGGCA GTCTGTGTAA CTTATCCTGC AAGTACTGTT TCTATCTTGA TAAACCCCGG GGGCAGCGCG TCATGTCTGA CGATGTGCTG GAGACGTATA TCCGCCGGGT AATTGATGAT ACGCCATCCT CAGAGGTCTC GTTTTGCTGG CAGGGGGGAG AGCCGACGCT ATGCGGTCTT TCTTTTTACC AAAAAGTGGT GCGCTTGCAG CAACGCTATG CCAACGGCAA AACTATCTAC AACAGTCTGC AAACCAATGG CGTATTAATC AATGAAGAGT GGGCGGCTTT CTTTGCGCAG CACCAGTTCC TGATTGGTAT ATCGATTGAT GGGCCGCAAG TCGTTCATGA TAATTACCGG AAAACGCCGT CAGGGCGGGC GTCTTTTTCC CGAGTCGTTA ATGCTATCCG CCTTCTGCAG GCAAATGATG TCGAGTTCAA CACGCTCACT GTCGTGAATG ATGCGTCATG CCGTCATGGC AACGCTATTT ATCATTTTTT GACGCAGGAA CTGGAAAGTA AACACCTGCA ATTTATTCCC ATTGTTGAGC CGCTCGCGCA AAAAGCGCAG CGTTCTTTGA CGTTATCTGA CAATGAGGAT TCGCCTTCGC TGATGCCCTT TTCCGTCACG CCTGAAGGGT GGGGCGCCTT TATGTGCGAT GTTTTTGATC AATGGATACG TCACGATGTC GGACGCATAT TCGTACAGCT TTTTGATAAC TTACTTGGCG TCTGGATGGG GGAGCCCGCC ACGCTTTGTA CGATGCAGTC GACCTGCGGG CAAAGTTTGC TGGTGGAGCA GAATGGCGAC GTGTTTAGCT GTGACCATTT TGTTTTTCCC GCCTATAAAC TGGGCAATCT GCAGCAACAC TCTTTAGAAG AAATGGCGGC CTCTCCTTTT CAGCAGCAGT TTGGCGCGGC TAAAGCAAAC CTTTCCTCAC GCTGCCAGAA CTGTACGTGG CGCTTTGCCT GTCACGGCGG TTGTCCGAAA CATCGAATTT GTATGGACGG CGGCGAACGG CAAAATTATC TCTGTAAAGG ATATCTGGAG TTCTTTCAAC ATGTGACGCC CTATATGAAT GTGATGCGTC AATTATTACT AAATCAGCGA CCCGCCGCGC ATATTACCCG CATCGTCGAC ATGATTGCGG ATGACGTTCG TCAGTGA
|
Protein sequence | MSIVFNTVAK PSGSLCNLSC KYCFYLDKPR GQRVMSDDVL ETYIRRVIDD TPSSEVSFCW QGGEPTLCGL SFYQKVVRLQ QRYANGKTIY NSLQTNGVLI NEEWAAFFAQ HQFLIGISID GPQVVHDNYR KTPSGRASFS RVVNAIRLLQ ANDVEFNTLT VVNDASCRHG NAIYHFLTQE LESKHLQFIP IVEPLAQKAQ RSLTLSDNED SPSLMPFSVT PEGWGAFMCD VFDQWIRHDV GRIFVQLFDN LLGVWMGEPA TLCTMQSTCG QSLLVEQNGD VFSCDHFVFP AYKLGNLQQH SLEEMAASPF QQQFGAAKAN LSSRCQNCTW RFACHGGCPK HRICMDGGER QNYLCKGYLE FFQHVTPYMN VMRQLLLNQR PAAHITRIVD MIADDVRQ
|
| |