Gene SNSL254_A3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3002 
Symbol 
ID6483826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2926372 
End bp2927706 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content57% 
IMG OID642738318 
ProductGntR family transcriptional regulator 
Protein accessionYP_002042047 
Protein GI194442950 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.00227723 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGCGCT ATCAGCACAT CGCTCGTCAG TTAAAAACGG CCATTGAGCA AGGAGAACTC 
GCGCCCGGAA CGCGCTTGCC TTCCAGCCGG ACGTGGGCGC AGGAACTGGG CGTTTCTCGC
GCCACGGTGG AAAATGCCTA TGGCGAGCTG GTGGCGCAGG GCTGGCTGGA GCGACGTGGT
CAGGCAGGCA CGTTTGTGAG CAACGCTCTA CGGTTTGAGA CGGCGCCGCC GATACCCGCT
GTTTTTGCCG GAGAAAGTCC GGAACCGAAA CCCTTTCAGA TGGGGTTACC GGCGCTGGAT
CTCTTTCCAC GCGAGAAGTG GGCGCGAGTG ATGGGGCGTC GGTTGCGCAC GCAGACGCGC
TTCGATCTGG CATTAGGCGA CGTCTGCGGC GAGGCGATTT TGCGCCAGGC GATAGTCGAT
TACCTGCGGG TTTCGCGTAG CATTGAATGC CTGCCGGAAC AGGTATTTAT TACCTCCGGA
TATGCGGATT CTATGCGGCT AATCCTGCGT ACATTGTCTG TGCCGGGAGA CAGCATGTGG
GTGGAAGATC CCGGTTTTCC GTTAATTCGC CCGGTGATAA CGCAGGAGGG GATTACGCTG
GCGCCGATTC CGGTCGATGC CGATGGGCTG AATGTCGCGG CGGGGATGCG GGATTGCCCG
CAGGGGCGCT TTGCATTGGT GACGCCCGCC CACCAAAGTC CGTTGGGGGT GGCGCTGTCG
TTAACTCGCC GACGGCAACT TCTGGCATGG GCGGCGAATG TGCAGGCCTG GATTATTGAA
GATGACTACG ACAGCGAATT TCGTTATCAC GGTAAACCGC TTCCGCCACT CAAGAGTCTG
GATGCCCCGC AGCGAGTGAT TTACGCCGGA ACGTTCAGTA AGTCGCTCTT TCCGGCATTA
CGTACCGCCT GGCTGGTGGT GCCGATAAAG CAGATTGAGC ATTTCCGCCA GCAGGTGTCG
CTGATGCCCT GTAGCGTACC GTTGTTATGG CAGCACACGC TGGCTGATTT TATCCGTGAT
GGCCATTTCT GGCGGCATCT GAAAAAGATG CGTCAACATT ATGCTCAGCG ACGGTTATGG
ATTGAAGAGG CGCTGGCAGA ACAGGGATTT GTCGTGACAT TACAGAAAGG CGGTATTCAA
TTGGTTATTG AGGTTGAAGG CGATGATAAA GCGCAGGTAG CAAAAGCGAA TCAGGCCGGA
CTGGCGGTAC AGGCGCTAAG CCGTTGGCGA GTGGTTTCAT CAGGAAAGGG GGGCATTTTA
CTGTCGTTTA CCAATATTAC TTCCGCTGGC ATGGCGAAAC AGGTCGCGTG GCAGCTTCGA
CAGGCGATAC AGTAA
 
Protein sequence
MPRYQHIARQ LKTAIEQGEL APGTRLPSSR TWAQELGVSR ATVENAYGEL VAQGWLERRG 
QAGTFVSNAL RFETAPPIPA VFAGESPEPK PFQMGLPALD LFPREKWARV MGRRLRTQTR
FDLALGDVCG EAILRQAIVD YLRVSRSIEC LPEQVFITSG YADSMRLILR TLSVPGDSMW
VEDPGFPLIR PVITQEGITL APIPVDADGL NVAAGMRDCP QGRFALVTPA HQSPLGVALS
LTRRRQLLAW AANVQAWIIE DDYDSEFRYH GKPLPPLKSL DAPQRVIYAG TFSKSLFPAL
RTAWLVVPIK QIEHFRQQVS LMPCSVPLLW QHTLADFIRD GHFWRHLKKM RQHYAQRRLW
IEEALAEQGF VVTLQKGGIQ LVIEVEGDDK AQVAKANQAG LAVQALSRWR VVSSGKGGIL
LSFTNITSAG MAKQVAWQLR QAIQ