Gene SNSL254_A3810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3810 
SymbolgntR 
ID6484931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3686680 
End bp3687741 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content56% 
IMG OID642739076 
Producttranscriptional regulator GntR 
Protein accessionYP_002042787 
Protein GI194443456 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones110 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCATTAA TTCCGCACGT CCGTGGTAAA CTGGGCAAAT CTATCCCTTT TATACCTTTC 
AGGACGATGA AAAAGAAAAG ACCCGTACTT CAGGATGTAG CCGACCGCGT GGGCGTGACC
AAAATGACGG TCAGCCGCTT TTTGCGTAAC CCGGAGCAGG TCTCCGTGGC GCTGCGGGGC
AAAATTGCCG CTGCACTTGA TGAGCTGGGT TATATTCCCA ATCGCGCTCC TGATATCCTT
TCTAACGCCA CCAGTCGCGC TATCGGCGTG CTGTTACCGT CTCTCACCAA CCAGGTTTTT
GCGGAAGTGT TACGCGGCAT TGAGGCCGTC ACCGACGCGC ATGGTTATCA AACCATGCTG
GCGCACTACG GCTATAAGCC CGAAATGGAG CAGGAGCGCC TGGAGTCGAT GCTCTCCTGG
AATATCGACG GTCTTATCCT CACTGAGCGT ACCCATACGC CGCGCACCTT AAAAATGATC
GAAGTCGCCG GGATTCCGGT GGTGGAACTG ATGGACAGCC AGTCGCCGTG TCTCGATATT
GCCGTTGGTT TTGATAACTT CGAGGCCGCC CGTCAGATGA CCGCCGCGAT TATCGCGCGT
GGTCATCGTC ATATCGCCTA TCTGGGGGCG CGCCTCGACG AACGTACTAT CATCAAGCAG
AAGGGCTATG AACAGGCGAT GCGGGACGCA GGCCTGGTTC CTTACAGTGT GATGATGGAG
CAATCTTCAT CCTACTCTTC CGGTATCGAA CTCATGCGCC AGGCGCGACG TGAATACCCA
CAGCTTGACG GTATTTTTTG CACCAACGAT GACCTGGCGG TGGGGGCGGC CTTCGAATGC
CAGCGCCTGG GGCTAAAAAT CCCGGACGAC ATGGCGATCG CCGGGTTCCA CGGTCATGAC
ATCGGCCAGG TGATGGAACC GCGTCTGGCA AGCGTCCTGA CGCCGCGCGA GCGAATGGGC
AGCATTGGCG CGGAACGTCT GTTGGCCCGC ATTCGCGGCG AAACGGTCAC GCCGAAAATG
TTAGATTTAG GTTTCACCTT GTCACCGGGC GGATCTATTT AG
 
Protein sequence
MSLIPHVRGK LGKSIPFIPF RTMKKKRPVL QDVADRVGVT KMTVSRFLRN PEQVSVALRG 
KIAAALDELG YIPNRAPDIL SNATSRAIGV LLPSLTNQVF AEVLRGIEAV TDAHGYQTML
AHYGYKPEME QERLESMLSW NIDGLILTER THTPRTLKMI EVAGIPVVEL MDSQSPCLDI
AVGFDNFEAA RQMTAAIIAR GHRHIAYLGA RLDERTIIKQ KGYEQAMRDA GLVPYSVMME
QSSSYSSGIE LMRQARREYP QLDGIFCTND DLAVGAAFEC QRLGLKIPDD MAIAGFHGHD
IGQVMEPRLA SVLTPRERMG SIGAERLLAR IRGETVTPKM LDLGFTLSPG GSI