Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3108 |
Symbol | |
ID | 6873299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2994392 |
End bp | 2995726 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642786132 |
Product | GntR family transcriptional regulator |
Protein accession | YP_002216778 |
Protein GI | 198243372 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.552212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.675979 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGCT ATCAGCACAT TGCTCGTCAG TTAAAAACGG CCATTGAGCA AGGAGAACTC GCGCCCGGAA CGCGTTTGCC TTCCAGTCGG ACGTGGGCGC AGGAACTGGG CGTTTCTCGC GCCACGGTGG AAAATGCCTA TGGCGAGCTG GTGGCGCAGG GCTGGCTGGA GCGACGTGGC CAGGCAGGCA CGTTTGTGAG CAACGCTCTA CGGTTTGAGA CGGCGCCGCC GATACCCGCT GTTTTTGCCG GAGAAAGTCC GGAACCGAAA CCCTTTCAGA TGGGGTTACC GGCGCTGGAT CTCTTTCCAC GCGAGAAGTG GGCGCGAGTG ATGGGACGTC GGTTGCGCAC GCAGACGCGC TTCGATCTGG CATTAGGCGA CGTCTGCGGC GAGGTGATTT TGCGCCAGGC GATAGTCGAT TACCTGCGGG TTTCGCGTAG CATTGAATGC CTGCCGGAAC AGGTATTTAT TACCTCCGGA TATGCGGATT CTATGCGGCT AATCCTGCGT ACATTGTCTG TGCCGGGAGA CAGCATGTGG GTGGAAGATC CCGGTTTTCC GTTAATTCGC CCGGTGATAA CGCAGGAGGG GATTACGCTG GCGCCGATTC CGGTCGATGC GGATGGGCTG AATGTCGCGG CGGGGATGCG GGATTGCCCG CAGGGGCGCT TTGCATTGGT GACGCCCGCC CACCAAAGTC CGTTGGGGGT AGCGCTGTCG TTAACTCGCC GACGGCAACT TCTGGCATGG GCGGCGAATG TGCAGGCCTG GATTATTGAA GATGACTACG ACAGCGAATT TCGTTATCAC GGTAAACCGC TTCCGCCGCT CAAGAGTCTG GATGCCCCGC AGCGAGTGAT TTACGCCGGA ACGTTCAGTA AGTCGCTCTT TCCGGCATTA CGTACCGCCT GGCTGGTGGT GCCGATAAAG CAGATTGAGC ATTTCCGCCA GCAGGCGTCG CTGATGCCCT GTAGCGTACC GTTGTTATGG CAGCACACGC TGGCTGATTT TATCCGTGAT GGCCATTTCT GGCGGCATCT GAAAAAGATG CGCCAACATT ATGCTCAGCG ACGGTTATGG ATTGAAGAGG CGCTGGCAGA ACAGGGATTT GTCGTGACAT TACAGAAAGG CGGTATTCAA TTGGTTATTG AAGTTGAAGG TGATGATAAA GCGCAGGTAG CAAAAGCGAA TCAGGCCGGA CTGGCGGTAC AGGCGCTAAG CCGTTGGCGA GTGGTTTCGT CAGGAAAGGG GGGCATTCTA CTGTCGTTTA CCAATATTAC TTCCGCTGGC ATGGCGAAAC AGGTCGCATG GCAGCTTCGA CAGGTGATAC GGTAA
|
Protein sequence | MPRYQHIARQ LKTAIEQGEL APGTRLPSSR TWAQELGVSR ATVENAYGEL VAQGWLERRG QAGTFVSNAL RFETAPPIPA VFAGESPEPK PFQMGLPALD LFPREKWARV MGRRLRTQTR FDLALGDVCG EVILRQAIVD YLRVSRSIEC LPEQVFITSG YADSMRLILR TLSVPGDSMW VEDPGFPLIR PVITQEGITL APIPVDADGL NVAAGMRDCP QGRFALVTPA HQSPLGVALS LTRRRQLLAW AANVQAWIIE DDYDSEFRYH GKPLPPLKSL DAPQRVIYAG TFSKSLFPAL RTAWLVVPIK QIEHFRQQAS LMPCSVPLLW QHTLADFIRD GHFWRHLKKM RQHYAQRRLW IEEALAEQGF VVTLQKGGIQ LVIEVEGDDK AQVAKANQAG LAVQALSRWR VVSSGKGGIL LSFTNITSAG MAKQVAWQLR QVIR
|
| |