Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4129 |
Symbol | |
ID | 6485645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4021499 |
End bp | 4022458 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642739385 |
Product | DNA-binding transcriptional regulator YidZ |
Protein accession | YP_002043094 |
Protein GI | 194443623 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.176556 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT CCCTTACCAA TCTCGACCTT AATTTGCTGT TATGCCTACA GTTACTCATG CAGGAGCGCA GCGTCACTAA AGCTGCAAAA CGGATGAACG TAACGCCCTC GGCGGTCAGC AAATCGCTGG CGAAACTCCG GGCATGGTTT GACGATCCGC TGTTCGTCAA TACGCCGCTG GGCCTGGCGC CGACGCCGCT GATGGTCAGT ATGGAGCAAA GTCTGGCGGA CTGGATGCAG ATGGGGAATC AACTGCTTGA TAAGCCGCAT CATCAAACGC CGCGCGGTTT AAAATTTGAG TTGGCGGCAG AGTCGCCGCT GATGATGATT ATGTTTAATT CACTGTCGCA GCAGATTTAT CAGCGCTATC CGCAGGCTAC CATTAAGGTT CGCAACTGGG ATTATGACTC GCTGGAGGCG ATTACGCGTG GGGAAGTCGA TATCGGATTT ACCGGACGCG AAAGCCATCC CCGCTCGCGA GAATTGCTCA GTCTGTTGCC GCTCGCGATT GATTTTGAGG TGTTGTTTAG CGATTTGCCG TGGGTCTGGC TGCGGGAAGA TCATCCGGCA CTGCGTGAAG CGTGGGATCT GGACACTTTT CTGCGCTACC CGCATATCAG TATTTGCTGG GAGCAAAGCG ATACCTGGGC GCTGGATGAT GTCCTGCAAG AAATGGGACG CAAACGCCAT ATTGCCTTGA GCCTGCCGGG GTTCGAGCAG TCGCTCTTTA TGGCCGCCCA GCCGGATCAC ACCCTGATAG CAACCGCGCC GCGCTATTGC CAGCACTACA ATCAGCTCCA CCAGCTACCG TTAGTGGCGC GCCCCCTGCC TTTTGATGCG CAACAGCGGG AAAAGCTGAT GGTGCCGTTT ACCCTATTAT GGCACAAACG TAATAGTCAT AATCCCAAGA TTGTGTGGCT ACGACAGGCT ATCAACACGC TCTGCCGTCG CCTTATCTGA
|
Protein sequence | MKKSLTNLDL NLLLCLQLLM QERSVTKAAK RMNVTPSAVS KSLAKLRAWF DDPLFVNTPL GLAPTPLMVS MEQSLADWMQ MGNQLLDKPH HQTPRGLKFE LAAESPLMMI MFNSLSQQIY QRYPQATIKV RNWDYDSLEA ITRGEVDIGF TGRESHPRSR ELLSLLPLAI DFEVLFSDLP WVWLREDHPA LREAWDLDTF LRYPHISICW EQSDTWALDD VLQEMGRKRH IALSLPGFEQ SLFMAAQPDH TLIATAPRYC QHYNQLHQLP LVARPLPFDA QQREKLMVPF TLLWHKRNSH NPKIVWLRQA INTLCRRLI
|
| |