Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B3895 |
Symbol | |
ID | 6796869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 3787550 |
End bp | 3788431 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642778015 |
Product | putative transcriptional regulator |
Protein accession | YP_002148610 |
Protein GI | 197248758 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAAT ATATCGGCAT TGATGTGGGA GGAACTCACG TCAAATATGG CGTGATTAAC AGTGACGGCG AAGAATTAAC CCATTATCAA TTCGATACGC CAGAGGACGC CTCCACGTTT ACCCGCAAAT GGCAGGATGT GGTGGCGCGT TGCCAACAGG ACTATGACAT TGCGGCAATC GGGGTCAGTT TCCCCGGCCA TATTAATCCC CATAACGGTC ATGCGGCAAA AGCGGGCGCG CTGGCTTACC TGGATGACGT CAACCTGATG GAGTTGTTCA GCGGGCTGAC CGATCTGCCG CTGGTCGTGG AGAACGACGC GAACTGCGCG GCGCTGGGCG AAATGTGGCG AGGCGCCGGG CAGCATTATG ACAATCTGGT CTGTATCACC ATTGGAACCG GCATTGGCGG CGGTATTATC GTCGGACGAG AACTGTATCG CGGCGCGCAT TTTCACGCCG GTGAATTCGG CGTCATGCCG GTCGGGAATA ATGGCGAAAG TATGCATAAA ATCGCGTCAA CCAGCGGATT AATGGCGTCA TGCCGCCAGG CGCTGGCGCT GCCTGCCGAA GAGATGCCGC CTGCGGATGT GATCTTCGAA CGAATGGCGA CCGATGTTCA TCTGCGTGAG GCGGTCAATG ACTGGGCGCG TTATCTTTCA CGCGGCGTTT ACAGCGTGAT CTCTATGTTT GATCCGGGCG TGGTGCTGAT CGGCGGAGGA ATAAGCGAAC AGGAAAAGCT CTACCCGCTC CTGACGCGGC ATCTTGAAAC GTTTGAAATG TGGGAGGCGC TCCAGGTGCC GATTCAGCCC TGCCAACTGG GAAATCAGGC GGGCAGGCTG GGCGCCGTCT GGCTGGCGCA GCAAAAGCTC GATCGAAGCT AA
|
Protein sequence | MQQYIGIDVG GTHVKYGVIN SDGEELTHYQ FDTPEDASTF TRKWQDVVAR CQQDYDIAAI GVSFPGHINP HNGHAAKAGA LAYLDDVNLM ELFSGLTDLP LVVENDANCA ALGEMWRGAG QHYDNLVCIT IGTGIGGGII VGRELYRGAH FHAGEFGVMP VGNNGESMHK IASTSGLMAS CRQALALPAE EMPPADVIFE RMATDVHLRE AVNDWARYLS RGVYSVISMF DPGVVLIGGG ISEQEKLYPL LTRHLETFEM WEALQVPIQP CQLGNQAGRL GAVWLAQQKL DRS
|
| |