Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0040 |
Symbol | |
ID | 6483436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 41717 |
End bp | 42904 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642735484 |
Product | chondroitin sulfate/heparin utilization regulation protein |
Protein accession | YP_002039266 |
Protein GI | 194443637 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.295568 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGGGA AAAGTTGTCA GGTCATGGTT AAACCGACCG GATCGGTGTG TAACCTTGAC TGTAAGTACT GTTTTTATCT GGAGAAAGAA AAGCTCTATC CGGATCGAAA AAACCATTAC AAAATGTCGG AAGAGACCCT CGAACTCTTC ATCAGGCAGC AGATTGCCGC ACAAGATATT GATGAGGTCA TTTTTGCGTG GCAGGGCGGG GAACCCACAT TAATGGGCAT CCCGTTTTAT CGTAAAGCCG TTGAGTTTCA GCAGCGCTAT TGTGGCGGCA AAACCATCGT CAATACCTTC CAGACCAACG GCATCCTGAT CAACGATGAC TGGGCGACCT TCTTCCGGGA GCATGATTTT CTGGTTGGCG TCTCTATTGA TGGCGATGCC GCGTTACACG ATGAATGGCG AGTGACGCGC TCCGGAAAGC CGACGCATGA AAAAGTAGAA AATGCGGTGC GTTGTCTGGC GCAGCACGAC GTAGAATTTA ATACCCTCAC GGTGGTTAAC CGTACCAATA TGCATCATCC TGTTCAGGTC TATCGCTACC TGAAAAGCAT TGGTAGCCGC TATATGCAAT TTATCCCTTT AGTTGAACGC TGTGGGGAAA ATGGGCTGGC GCAGCCGCAG GATAAACATA TCGCGATGAC GCCGTGGTCG GTCGATAGCC TGCAATTTGG TCAGTTTCTG AATGCGGTAT TTGATATCTG GATCCGTGAG GATATCGGCG ATATCGGCAT CCAGCTATTT GAACAGACGC TGGCGGCCTG GTGCGGCCTG CCGCCGCAGG TTTGCGTTTT TGCTCCCACC TGCGGCAGCG CGTTTGCGAT GGAAATGAAC GGCGATGTTT ATAACTGCGA TCACTTCGTA TATCCGCAAT TTAAACTGGG GAATATCCAC CAGAAGACGC TGCGTCAAAT GAATCAGGGC GAACAAAATC GCCAGTTCGG CAGCGATAAA CAGCGTTCAA TGGCGCAGGA GTGTCATCGC TGTCAATGGA AGTTCGCCTG CTATGGCGGC TGTCCGAAAC ATCGTTTTTT ACCCTCTGCG TCAGGCGCAA CCAATCATAA CTATCTGTGT GCAGGTTATC AGGCTTTTTT CTCGCATACC GCGACGGCGA TGAGTGCCAT GCGAACCCTG TATGAAAAAG GCATCTCACC TGCAGAAATA AAGTCAATAT TTGTTTGA
|
Protein sequence | MFGKSCQVMV KPTGSVCNLD CKYCFYLEKE KLYPDRKNHY KMSEETLELF IRQQIAAQDI DEVIFAWQGG EPTLMGIPFY RKAVEFQQRY CGGKTIVNTF QTNGILINDD WATFFREHDF LVGVSIDGDA ALHDEWRVTR SGKPTHEKVE NAVRCLAQHD VEFNTLTVVN RTNMHHPVQV YRYLKSIGSR YMQFIPLVER CGENGLAQPQ DKHIAMTPWS VDSLQFGQFL NAVFDIWIRE DIGDIGIQLF EQTLAAWCGL PPQVCVFAPT CGSAFAMEMN GDVYNCDHFV YPQFKLGNIH QKTLRQMNQG EQNRQFGSDK QRSMAQECHR CQWKFACYGG CPKHRFLPSA SGATNHNYLC AGYQAFFSHT ATAMSAMRTL YEKGISPAEI KSIFV
|
| |