Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3002 |
Symbol | |
ID | 6483826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2926372 |
End bp | 2927706 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642738318 |
Product | GntR family transcriptional regulator |
Protein accession | YP_002042047 |
Protein GI | 194442950 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.00227723 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGCGCT ATCAGCACAT CGCTCGTCAG TTAAAAACGG CCATTGAGCA AGGAGAACTC GCGCCCGGAA CGCGCTTGCC TTCCAGCCGG ACGTGGGCGC AGGAACTGGG CGTTTCTCGC GCCACGGTGG AAAATGCCTA TGGCGAGCTG GTGGCGCAGG GCTGGCTGGA GCGACGTGGT CAGGCAGGCA CGTTTGTGAG CAACGCTCTA CGGTTTGAGA CGGCGCCGCC GATACCCGCT GTTTTTGCCG GAGAAAGTCC GGAACCGAAA CCCTTTCAGA TGGGGTTACC GGCGCTGGAT CTCTTTCCAC GCGAGAAGTG GGCGCGAGTG ATGGGGCGTC GGTTGCGCAC GCAGACGCGC TTCGATCTGG CATTAGGCGA CGTCTGCGGC GAGGCGATTT TGCGCCAGGC GATAGTCGAT TACCTGCGGG TTTCGCGTAG CATTGAATGC CTGCCGGAAC AGGTATTTAT TACCTCCGGA TATGCGGATT CTATGCGGCT AATCCTGCGT ACATTGTCTG TGCCGGGAGA CAGCATGTGG GTGGAAGATC CCGGTTTTCC GTTAATTCGC CCGGTGATAA CGCAGGAGGG GATTACGCTG GCGCCGATTC CGGTCGATGC CGATGGGCTG AATGTCGCGG CGGGGATGCG GGATTGCCCG CAGGGGCGCT TTGCATTGGT GACGCCCGCC CACCAAAGTC CGTTGGGGGT GGCGCTGTCG TTAACTCGCC GACGGCAACT TCTGGCATGG GCGGCGAATG TGCAGGCCTG GATTATTGAA GATGACTACG ACAGCGAATT TCGTTATCAC GGTAAACCGC TTCCGCCACT CAAGAGTCTG GATGCCCCGC AGCGAGTGAT TTACGCCGGA ACGTTCAGTA AGTCGCTCTT TCCGGCATTA CGTACCGCCT GGCTGGTGGT GCCGATAAAG CAGATTGAGC ATTTCCGCCA GCAGGTGTCG CTGATGCCCT GTAGCGTACC GTTGTTATGG CAGCACACGC TGGCTGATTT TATCCGTGAT GGCCATTTCT GGCGGCATCT GAAAAAGATG CGTCAACATT ATGCTCAGCG ACGGTTATGG ATTGAAGAGG CGCTGGCAGA ACAGGGATTT GTCGTGACAT TACAGAAAGG CGGTATTCAA TTGGTTATTG AGGTTGAAGG CGATGATAAA GCGCAGGTAG CAAAAGCGAA TCAGGCCGGA CTGGCGGTAC AGGCGCTAAG CCGTTGGCGA GTGGTTTCAT CAGGAAAGGG GGGCATTTTA CTGTCGTTTA CCAATATTAC TTCCGCTGGC ATGGCGAAAC AGGTCGCGTG GCAGCTTCGA CAGGCGATAC AGTAA
|
Protein sequence | MPRYQHIARQ LKTAIEQGEL APGTRLPSSR TWAQELGVSR ATVENAYGEL VAQGWLERRG QAGTFVSNAL RFETAPPIPA VFAGESPEPK PFQMGLPALD LFPREKWARV MGRRLRTQTR FDLALGDVCG EAILRQAIVD YLRVSRSIEC LPEQVFITSG YADSMRLILR TLSVPGDSMW VEDPGFPLIR PVITQEGITL APIPVDADGL NVAAGMRDCP QGRFALVTPA HQSPLGVALS LTRRRQLLAW AANVQAWIIE DDYDSEFRYH GKPLPPLKSL DAPQRVIYAG TFSKSLFPAL RTAWLVVPIK QIEHFRQQVS LMPCSVPLLW QHTLADFIRD GHFWRHLKKM RQHYAQRRLW IEEALAEQGF VVTLQKGGIQ LVIEVEGDDK AQVAKANQAG LAVQALSRWR VVSSGKGGIL LSFTNITSAG MAKQVAWQLR QAIQ
|
| |