Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4353 |
Symbol | |
ID | 6874492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4199474 |
End bp | 4200769 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642787278 |
Product | chondroitin sulfate/heparin utilization regulation protein |
Protein accession | YP_002217894 |
Protein GI | 198242498 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTAG CAGGTTGTCA CGTAATGGCC AAACCCGGCG GCGCGATCTG CAATATTGAT TGCACATATT GCTTCTATCT TGAAAAAGAG GCGCTGTACC CGGAACGCAA TAAAAACTGG CGGATGTCGG ACGAGACGCT GGAACAATTT ATACGCCAGC ATATTGCCGC GCAGAGTGGC GACCGCATTG ACTTTGCCTG GCAGGGCGGC GAACCGACCA TGATGGGGCT ACCGTTTTTC CGCCGGGTTG TCACATTATG TGAAAAGTAC GGCGATGGGC GAAAAATCAC TCATGCGTTG CAGACGAACG GCATCCTGGT GAATGACGAG TGGGCGCGCT TTTTCGCTGA ACAGCATTTT CTCATCGGTC TCTCTATCGA CGGCCCGGCG TCGTTACACA ACCACTATCG GCTTAATCGC GCTGGAAAAG GAACTCATGA ACAGGTCGTC GCAGCAATGG CGCGGCTTAA AGCGCACCAT GTCGACTTTA ATACCTTAAC CGTCGTGGGA AAACATAACG TCGGTCATGC AGCAGACGTC TACGAATTTC TTCTGGCGGC GGGATCGCGT TTTATTCAGT TTATCCCGCT GGTGGAGCGA ATGAGCACCG ATAACTCATC GGTACTTAAT CTGGTGATGC CCGGCGAAAG CGCGGCGACG CTGGCGCCAT GGACGGTACC GTCGTGGCAA TATGGCGAAT TTCTCAACCA GATCTTTGAT ATCTGGGTTC GTCGCGACGT AGACCGCGTC TATGTGCAGA TGTTTGACGT GGCGTTAGCC GCCTGGACGG CGCAGCAGCC GGTACTGTGT GTACATTCCG AGACTTGTGG ACATGCCTTC GCGTTGGAGT CGAACGGCGA TCTCTACAAC TGCGACCACT TTGTCTACCC GGAGCATCTG CTGGGAAATA TCCACCAGCA CAGCATCAAA GCCTTAAATA ATAGCGAGCG GGCTATTGCG TTTGGCGAGG CCAAGCGGGA GACCCTGACC GCCGATTGTC GTCGCTGTGA CTACCGCTTT GCGTGTCATG GCGGCTGTCC GAAGCATCGC TTTGCCGTCT CGCCGTCCGG TCATCCTGCG CATAATTACT TGTGTGCGGG CTATAAGCAT TTTTTCCAGC ACGTTACGCC GTATATGAAT GTCTGGCGGG AGCTGCTGGC GCAAGGCTAT CCGATGGCAT CGATCATGCG CTGGCTGGCG CAGGACGCGC GTAAAGACAC AGGAGCCGTC AGTCGTAACC ATCTCTGTCC CTGTGGCAGC GGCAAAAAAT ATAAAAAATG CTGTGGTAAA GCATAG
|
Protein sequence | MAVAGCHVMA KPGGAICNID CTYCFYLEKE ALYPERNKNW RMSDETLEQF IRQHIAAQSG DRIDFAWQGG EPTMMGLPFF RRVVTLCEKY GDGRKITHAL QTNGILVNDE WARFFAEQHF LIGLSIDGPA SLHNHYRLNR AGKGTHEQVV AAMARLKAHH VDFNTLTVVG KHNVGHAADV YEFLLAAGSR FIQFIPLVER MSTDNSSVLN LVMPGESAAT LAPWTVPSWQ YGEFLNQIFD IWVRRDVDRV YVQMFDVALA AWTAQQPVLC VHSETCGHAF ALESNGDLYN CDHFVYPEHL LGNIHQHSIK ALNNSERAIA FGEAKRETLT ADCRRCDYRF ACHGGCPKHR FAVSPSGHPA HNYLCAGYKH FFQHVTPYMN VWRELLAQGY PMASIMRWLA QDARKDTGAV SRNHLCPCGS GKKYKKCCGK A
|
| |