Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1888 |
Symbol | |
ID | 6793063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 1846858 |
End bp | 1848054 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642776118 |
Product | chondroitin sulfate/heparin utilization regulation protein |
Protein accession | YP_002146752 |
Protein GI | 197249407 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.422634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATAG TATTCAATAC CGTAGCCAAG CCCAGCGGCA GTCTGTGTAA CTTATCCTGC AAGTACTGTT TCTATCTTGA TAAACCCCGG GGCCAGCGCG TCATGTCTGA CGATGTGCTG GAGACGTATA TCCGCCGGGT AATTGATGAT ACGCCATCAT CAGAGGTCTC GTTTTGTTGG CAGGGGGGAG AGCCGACGCT ATGCGGTCTT TCTTTTTACC AAAAAGTGGT GTGCTTGCAG CAACGCTATG CCAACGGCAA AACTATTTAC AACAGTCTGC AAACCAATGG CGTATTAATC AATGAAGAGT GGGCGGCTTT CTTTGCGCAG CACCAGTTCC TGATTGGTAT ATCGATTGAT GGGCCGCAAG TCGTTCATGA TAATTACCGG AGAACGCCGT CAGGGCGGGC GTCTTTTTCC CGAGTCGTGA ATGCTATCCG CCTTCTGCAG GCAAATGATG TCGAGTTCAA TACGCTCACT GTCGTGAATG ATGCATCATG CCGTCATGGC AACGCTATTT ATCATTTTTT GACGCAGGAA CTGGAAAGTA AACACCTGCA ATTTATTCCC ATTGTTGAGC CGCTCGCGCA AAAAGCGCAG CGTTCTTTGA CGTTATCTGA CAATGAAGAT TCGCCTTCGC TGATGCCCTT TTCCGTCACG CCTGAAGGGT GGGGCGCCTT TATGTGCGAT GTTTTTGATC AATGGATACG TCACGATGTC GGACGCATAT TCGTACAGCT TTTTGACAAC TTACTTGGCG TCTGGATGGG GGAGCCCGCC ACGCTTTGTA CGATGCAGTC GACCTGCGGG CAAAGTTTGC TGGTGGAGCA GAATGGCGAC GTGTTTAGCT GCGACCATTT TGTTTTTCCC GCCTATAAAC TGGGCAATCT GCAGCAACAC TCTTTAGAAG AAATGGCGGC CTCTCCTTTT CAGCAGCAGT TTGGCGCGGC TAAAGCAAAC CTTTCCTCAC GCTGCCAGAA CTGTACGTGG CGCTTTGCCT GTCACGGCGG TTGTCCGAAA CATCGAATTT GCATGGACGG CGGCGAACGG CAAAATTATC TCTGTAAAGG ATATCTGGAG TTCTTTCAAC ATGTGACGCC CTATATGAAT GTGATGCGGC AATTATTACT GAATCAGCGA CCCGCCGCGC ATATTACTCG CATCGTCGAC ATGATTGCGG ATGACGTTCG TCAGTGA
|
Protein sequence | MSIVFNTVAK PSGSLCNLSC KYCFYLDKPR GQRVMSDDVL ETYIRRVIDD TPSSEVSFCW QGGEPTLCGL SFYQKVVCLQ QRYANGKTIY NSLQTNGVLI NEEWAAFFAQ HQFLIGISID GPQVVHDNYR RTPSGRASFS RVVNAIRLLQ ANDVEFNTLT VVNDASCRHG NAIYHFLTQE LESKHLQFIP IVEPLAQKAQ RSLTLSDNED SPSLMPFSVT PEGWGAFMCD VFDQWIRHDV GRIFVQLFDN LLGVWMGEPA TLCTMQSTCG QSLLVEQNGD VFSCDHFVFP AYKLGNLQQH SLEEMAASPF QQQFGAAKAN LSSRCQNCTW RFACHGGCPK HRICMDGGER QNYLCKGYLE FFQHVTPYMN VMRQLLLNQR PAAHITRIVD MIADDVRQ
|
| |