Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0041 |
Symbol | |
ID | 6794279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 41723 |
End bp | 42910 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642774353 |
Product | chondroitin sulfate/heparin utilization regulation protein |
Protein accession | YP_002145017 |
Protein GI | 197247602 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGGGA AAAGTTGTCA GGTCATGGTT AAACCAACCG GATCGGTGTG TAACCTTGAC TGTAAGTACT GTTTTTATCT GGAGAAAGAA AAGCTCTATC CGGATCGAAA AAACCATTAC AAAATGTCGG AAGAGACCCT CGAACTCTTC ATCAGGCAGC AGATTGCCGC ACAGGATATT GATGAGGTCA TTTTTGCGTG GCAGGGCGGG GAACCCACAT TAATGGGCAT CCCGTTTTAT CGTAAAGCCG TTGAATTTCA GCAGCGCTAT TGTGGCGGCA AAACCATCGT CAATACCTTC CAGACCAACG GCATCCTGAT CAACGACGAC TGGGCGACCT TCTTCCGGGA GCATGATTTT CTGGTTGGCG TCTCTATTGA TGGCGATGCC GCGTTACACG ATGAATGGCG AGTGACGCGC TCCGGAAAGC CGACGCATGA AAAAGTAGAA AATGCGGTGC GTTGTCTGGC GCAGCACAAC GTGGAATTTA ATACCCTCAC GGTGGTTAAC CGTACCAATA TGCACCATCC TGTTCAGGTC TATCGCTACC TGAAAAGCAT TGGTAGCCGC TATATGCAAT TTATCCCTTT AGTTGAACGA TGCGGGGAAA ATGGGCTGGC GCAGCCGCAG GATAAACATA TCGCGATGAC GCCGTGGTCG GTCGATAGCC TGCAATTTGG TCAGTTTCTG AATGCGGTAT TTGATGTCTG GATCCGTGAG GATATCGGCG ATATCGGCAT CCAGCTATTT GAACAGACGC TGGCAGCCTG GTGCGGCCTG CCGCCGCAGG TTTGCGTTTT TGCTCCCACC TGCGGCAGCG CGTTTGCGAT GGAAATGAAC GGCGATGTTT ATAACTGCGA TCACTTCGTA TATCCGCAAT TTAAACTGGG GAATATCCAC CAGAAGACGC TGCGTCAAAT GAATCAGGGC GAACAAAATC GCCAGTTCGG CAGCGATAAA CAGCATTCAA TGGCGCAGGA GTGCCATCGC TGTCAATGGA AGTTCGCCTG CTATGGCGGC TGTCCGAAAC ATCGTTTTTT ACCCTCCGCG TCAGGCGCAA CCAATCATAA CTATCTGTGT GCAGGTTATC AGGCTTTTTT CTCGCATACC GCGACGGCGA TGAGTGCCAT GCGAACCCTG TATGAAAAAG GCATCTCACC TGCAGAAATA AAGTCAATAT TTGTTTGA
|
Protein sequence | MFGKSCQVMV KPTGSVCNLD CKYCFYLEKE KLYPDRKNHY KMSEETLELF IRQQIAAQDI DEVIFAWQGG EPTLMGIPFY RKAVEFQQRY CGGKTIVNTF QTNGILINDD WATFFREHDF LVGVSIDGDA ALHDEWRVTR SGKPTHEKVE NAVRCLAQHN VEFNTLTVVN RTNMHHPVQV YRYLKSIGSR YMQFIPLVER CGENGLAQPQ DKHIAMTPWS VDSLQFGQFL NAVFDVWIRE DIGDIGIQLF EQTLAAWCGL PPQVCVFAPT CGSAFAMEMN GDVYNCDHFV YPQFKLGNIH QKTLRQMNQG EQNRQFGSDK QHSMAQECHR CQWKFACYGG CPKHRFLPSA SGATNHNYLC AGYQAFFSHT ATAMSAMRTL YEKGISPAEI KSIFV
|
| |