Gene SNSL254_A3910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3910 
Symbol 
ID6484381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3793795 
End bp3794802 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content60% 
IMG OID642739172 
Productregulatory protein LacI:Periplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002042883 
Protein GI194442857 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.83437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTTC AAAATAAAAA ACGCGCAAAG TTGATTGATG TTGCCCGCTA TGCAGGTGTG 
TCGCCAGGGA CAGTATCCAA TGCATTGCAC AACACTCGCT TTGTCGAGCC GCAGACGCGA
CGGCGTATTG AAGAGGCCAT TGCTGCGCTC AACTACACGC CGAATATTCG CGCCCGCCAG
TTGCGAACCG GCAAAACCAA TACCATTGCT TTGCTCTCTT CGGTGCCGCT GGCGATTGCC
TCCGGCGCGT CACGACTGGG ATTTATGATG GAGGTGGCAT TAACGTCCGC GATGATGGCG
CTGGAAAAGC AGCATGCGCT GATTCTGGTG CCGCCGGGGG CAAATCCACT GGATGCCGTC
AGCTTTGACG CGGCGATCCT GATTGAGCCG GCGGAGAACG ATCCGCAGCT CCAGGCGCTG
GCGCAAGCGG GCATTCCCTG CGTCACCATT GGCCGCACGC CGGGGACCGA CACGCCTGTG
CCGTGGGTTG AGCTGCACTC GGCGGCAACA GCACAGCTTC TGCTAACGCA TCTGGAGGCC
TCCGGCGCCA GCAAATGTGC GTTATTTGTC GGTAACACAC GGCGAACATC AGTTCTGGAA
AGCATAGCGG CTTACCAGCG CTGGTGCGCG GGGCGCCAGG CCCCCGTCGT CTACTCTCTC
AATGAAAGCG AGGGTGAAAA TGCCGGCTAC CAGGCCGCGC AGCAGCTATT ACAGGCGCAT
CCCGACGTTG ACGGCGTGCT GGTGCTGATC GATACCTTTG CCAGCGGCGC GGTACGCGCT
TTCCAGGAAC AAGACATCGC CATACCTGAA CAAATGCGGG TGGTTACCCG CTATGACGGT
ATCCGCGCGC GCGAATCGCT GCCGCCGCTG ACGGCAGTGA ATATGCATCT TGATGAGGTG
GCGCGACAGG CAATCACGCT CCTGTTTGCC GTTCTGTCGG GTGAGAAGGT CAGCTACAGC
GACGGGATCA TGCCTGAACT GGTGGTGCGA GCGTCAACCT GCCGGTGA
 
Protein sequence
MAVQNKKRAK LIDVARYAGV SPGTVSNALH NTRFVEPQTR RRIEEAIAAL NYTPNIRARQ 
LRTGKTNTIA LLSSVPLAIA SGASRLGFMM EVALTSAMMA LEKQHALILV PPGANPLDAV
SFDAAILIEP AENDPQLQAL AQAGIPCVTI GRTPGTDTPV PWVELHSAAT AQLLLTHLEA
SGASKCALFV GNTRRTSVLE SIAAYQRWCA GRQAPVVYSL NESEGENAGY QAAQQLLQAH
PDVDGVLVLI DTFASGAVRA FQEQDIAIPE QMRVVTRYDG IRARESLPPL TAVNMHLDEV
ARQAITLLFA VLSGEKVSYS DGIMPELVVR ASTCR