Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4246 |
Symbol | |
ID | 6483911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4137157 |
End bp | 4138452 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642739496 |
Product | chondroitin sulfate/heparin utilization regulation protein |
Protein accession | YP_002043195 |
Protein GI | 194443254 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.598382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTAG CAGGTTGTCA CGTAATGGCC AAACCCGGCG GCGCGATCTG CAATATTGAT TGCACATATT GCTTCTATCT TGAAAAAGAG GCGCTGTACC CGGAACGCAA TAAAAACTGG CGGATGTCGG ACGAGACGCT GGAACAATTT ATACGCCAGC ATATTGCCGC GCAGAGTGGC GACCGCATTG ACTTTGCCTG GCAGGGCGGC GAACCGACCA TGATGGGGCT GCCATTTTTC CGCCGGGTTG TCGCATTGTG TGAAAAGTAC GGCGATGGGC GAAAAATCAC TCATGCGTTA CAGACGAACG GCATCCTGGT GAATGACGAG TGGGCGCGCT TTTTCGCTGA ACAGCATTTT CTCATCGGTC TCTCTATCGA CGGTCCGGCG TCGTTACACA ACCACTATCG GCTTAATCGC GCTGGAAAAG GAACTCATGA ACAGGTCGTC GCAGCCATGG CGCGGCTTAA AGCGCACCAT GTCGACTTTA ATACCTTAAC CGTCGTGGGA AAACATAACG TCGGTCATGC AGCAGACGTC TACGAATTTC TTCTGGCGGC GGGATCGCGT TTTATTCAGT TTATCCCGCT GGTAGAGCGA ATGAGCACCG ATAACTCATC GGTACTTAAT CTGGTGATGC CCGGCGAAAG CGCGGCAAAG CTGGCGCCAT GGACGGTACC GTCGTGGCAA TATGGCGAAT TTCTCAACCA GATCTTTGAT ATCTGGGTTC TTCGCGACGT AGACCGCGTC TATGTGCAGA TGTTTGACGT GGCGTTAGCC GCCTGGACGG CGCAGCAGCC GGTACTGTGT GTACATTCCG AGACTTGTGG ACATGCCTTC GCGTTGGAGT CGAACGGCGA TCTCTACAAC TGCGACCACT TTGTCTACCC GGAGCATCTG CTGGGGAATA TCCACCAGCA CAGCATCAAA ACCTTAAATA ATAGCGAGCG GGCTATTGCC TTTGGTGAGG CCAAGCGGGA GACCCTGACC GCCGATTGTC GTCGCTGTGA CTACCGCTTT GCGTGTCATG GCGGCTGTCC GAAGCATCGC TTTGCCGTCT CGCCGTCCGG TCATCCTGCG CATAATTACT TGTGTGCGGG CTATAAGCAT TTTTTCCAGC ACGTTACGCC GTATATGAAT GTCTGGCGGG AGCTGCTGGC GCAAGGCTAT CCGATGGCAT CGATCATGCG CTGGCTGGCG CAGGACGCGC GTAAAGACAC AGGAGCCGTC AGTCGTAACC ATCTCTGTCC CTGCGGCAGC GGCAAAAAAT ATAAAAAATG CTGTGGTAAA GCATAG
|
Protein sequence | MAVAGCHVMA KPGGAICNID CTYCFYLEKE ALYPERNKNW RMSDETLEQF IRQHIAAQSG DRIDFAWQGG EPTMMGLPFF RRVVALCEKY GDGRKITHAL QTNGILVNDE WARFFAEQHF LIGLSIDGPA SLHNHYRLNR AGKGTHEQVV AAMARLKAHH VDFNTLTVVG KHNVGHAADV YEFLLAAGSR FIQFIPLVER MSTDNSSVLN LVMPGESAAK LAPWTVPSWQ YGEFLNQIFD IWVLRDVDRV YVQMFDVALA AWTAQQPVLC VHSETCGHAF ALESNGDLYN CDHFVYPEHL LGNIHQHSIK TLNNSERAIA FGEAKRETLT ADCRRCDYRF ACHGGCPKHR FAVSPSGHPA HNYLCAGYKH FFQHVTPYMN VWRELLAQGY PMASIMRWLA QDARKDTGAV SRNHLCPCGS GKKYKKCCGK A
|
| |