Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3940 |
Symbol | |
ID | 6484852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3820503 |
End bp | 3821681 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642739200 |
Product | xylose operon regulatory protein |
Protein accession | YP_002042910 |
Protein GI | 194444174 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 0.688005 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGATA AACGTCACCG CATCACTCTG TTATTTAACG CGAATAAAGC CTATGACCGT CAGGTAGTGG AGGGGGTGGG TGAATATTTA CAAGCCTCGC AATCCGAATG GGATATATTT ATTGAGGAAG ATTTCCGTGC CCGTATCGAT AACATTAAAG AGTGGTTAGG CGACGGCGTT ATTGCCGATT ACGATGATGA CGATATCGCG CAATTATTGG CCGATGTCGA CGTCCCCATT GTCGGGGTCG GCGGTTCTTA CCATCTTGCT GAAAATTATC CCGCCGTTCA TTACATCGCC ACCGATAATC ATGCGCTCGT TGAAAGCGCT TTCCTGCATT TAAAAGAAAA AGGCGTTAAC CGCTTCGCGT TTTACGGTTT GCCCGACTCC AGCCGCAAAC ATTGGGCGGC GGAACGGGAA TACGCCTTTC GCCAGCTGGT CGCCGAGGAA AAATACCGCG GCGTAGTCTA TCAGGGACTG GAAACCGCGC CGGAAAACTG GCAGCACGCG CAAAATCGCC TCGCCGACTG GCTTCAGACG CTGCCGCCGC AAACCGGCAT CATTGCCGTA ACGGATGCCC GCGCCCGTCA CGTATTGCAG GCCTGTGAAC ACCTGCATAT TCCGGTGCCG GAAAAACTTT GCGTTATCGG TATTGATAAC GAAGAGTTAA CCCGTTATCT GTCGCGCGTC GCGCTTTCCT CCGTCGCGCA GGGGGCGCGG CAAATGGGCT ATCAGGCGGC GAAGCTGCTG CACCGTTTGC TGGCGCGCGA AGAGATGCCG TTACAGCGCA TTCTGGTGCC GCCGGTGCGC GTCATTGCGC GCCGCTCGAC AGACTATCGC TCCCTGACCG ATCCGGCGGT TATCCAGGCG ATGCACTTTA TTCGTAACCA TGCCTGTAAG GGCATTAAAG TCGAGCAGGT GCTGGACGCG GTTGGGATTT CACGTTCAAA CCTGGAAAAA CGTTTTAAGG AGGAAGTTGG CGAGACGATA CATGCGCTGA TCCACGCCGA AAAGCTGGAA AAAGCGCGTA GTTTGTTGAT TTCTACCACG TTGGCGATAA ACGAAATTTC GCAAATGTGC GGCTACCCGT CACTGCAATA TTTCTATTCG GTGTTTAAAA AGGAGTACGT AACTACGCCT AAGGAGTATC GCGACCAGCA TAGTGAAGCG TTGTTGTAG
|
Protein sequence | MFDKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID NIKEWLGDGV IADYDDDDIA QLLADVDVPI VGVGGSYHLA ENYPAVHYIA TDNHALVESA FLHLKEKGVN RFAFYGLPDS SRKHWAAERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT LPPQTGIIAV TDARARHVLQ ACEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR QMGYQAAKLL HRLLAREEMP LQRILVPPVR VIARRSTDYR SLTDPAVIQA MHFIRNHACK GIKVEQVLDA VGISRSNLEK RFKEEVGETI HALIHAEKLE KARSLLISTT LAINEISQMC GYPSLQYFYS VFKKEYVTTP KEYRDQHSEA LL
|
| |