Gene SNSL254_A3959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3959 
Symbol 
ID6485208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3842198 
End bp3843079 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content57% 
IMG OID642739219 
Productputative transcriptional regulator 
Protein accessionYP_002042929 
Protein GI194445844 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0229726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.500225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAAT ATATCGGTAT TGATGTGGGA GGAACTCACG TCAAATATGG CGTGATTAAC 
AGTGACGGCG AAGAATTAAC CCATCATCAA TTCGATACGC CAGAGGACGC CTCCACGTTT
ACCCGCAAAT GGCAGGATGT GGTGGCGCGT TGCCAACAGG ACTATGACAT TGCGGCAATC
GGGGTTAGTT TCCCCGGCCA TATTAATCCC CATAACGGTC ATGCGGCAAA AGCGGGCGCG
CTGGCTTACC TGGATGACGT CAACCTGATG GAGTTGTTCA GCGGGCTGAC CGATCTGCCG
CTGGTCGTGG AGAACGACGC GAACTGTGCG GCGCTGGGCG AAATGTGGCG AGGTGCCGGG
CAGCATTATG ACAATCTGGT CTGTATTACC ATTGGAACCG GCATTGGCGG CGGTATTATC
GTCGGACGAG AACTGTATCG CGGCGCGCAT TTTCACGCCG GTGAATTCGG CGTCATGCCG
GTCGGGAACA ATGGCGAAAG TATGCATAAA ATCGCGTCAA CCAGCGGATT AATGGCGTCG
TGCCGCCAGG CGCTGGCGCT GCCTGCCGAA GAGATGCCGC CTGCGGATGT GATCTTCGAA
CGAATGGCGA CCGATGTTCA TCTGCGTGAG GCGGTCAATG ACTGGGCGCG TTATCTTTCA
CGCGGCGTTT ACAGCGTGAT CTCTATGTTT GATCCGGGCG TGGTGCTGAT CGGCGGAGGA
ATAAGCGAAC AGGAAAAGCT CTACCCGCTC CTGACGCGGC ATCTTGAAAC GTTTGAAATG
TGGGAGGCGC TCCAGGTGCC GATTCAGCCC TGCCAACTGG GAAATCAGGC GGGCAGGCTG
GGCGCCGTCT GGCTGGCGCA GCAAAAGCTC GATCGAAGCT AA
 
Protein sequence
MQQYIGIDVG GTHVKYGVIN SDGEELTHHQ FDTPEDASTF TRKWQDVVAR CQQDYDIAAI 
GVSFPGHINP HNGHAAKAGA LAYLDDVNLM ELFSGLTDLP LVVENDANCA ALGEMWRGAG
QHYDNLVCIT IGTGIGGGII VGRELYRGAH FHAGEFGVMP VGNNGESMHK IASTSGLMAS
CRQALALPAE EMPPADVIFE RMATDVHLRE AVNDWARYLS RGVYSVISMF DPGVVLIGGG
ISEQEKLYPL LTRHLETFEM WEALQVPIQP CQLGNQAGRL GAVWLAQQKL DRS