Gene SNSL254_A3757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3757 
Symbol 
ID6482531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3620988 
End bp3622265 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content58% 
IMG OID642739024 
Producthypothetical protein 
Protein accessionYP_002042735 
Protein GI194446812 
COG category[S] Function unknown 
COG ID[COG3266] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAT TCAAACCAGA AGACGAGCTG AAACCCGATC CCAGCGATCG TCGTACTGGT 
CGTTCTCGTC AATCTTCAGA ACGCGATAAT GAGCCGCAGA TCAACTTTGA TGACGTTGAT
CTGGATGCCG ACGATCGCCG TCCGACGCGT ACGCGTAAAG CGCGTAGTGA AGAACCTGAA
GTTGAAGAAG AGTACGAATC CGATGAAGAC GATACGGTGG ACGAAGAGCG TGTTGAACGC
CGCCCACGTA AGCGTAAAAA AGCGACCCAT AAGCCAGCCT CTCGTCAGTA CATGATGATG
GGCGTTGGCG TACTGGTGCT GCTGCTGTTG ATTATCGGTA TCGGCTCCGC GCTGAAAGCC
CCCTCAACGT CTTCCAGCGA GCCGTCGGCC TCTGGCGAAA AGAGTATCGA TCTTTCCGGT
AACGCCGCCG ACCAGGCGAA TGCGACCCAG CCTGCGCCGG GCGCCACCTC CGCAGAACAA
ACCGCGGGCA ATACGTCGCA GGATATTTCG TTGCCGCCGA TTTCTTCAAC GCCGACGCAG
GGACAGTCGC CTGTGGTCGC TGACGGTCAG CAGCGCGTGG AAGTGCAGGG CGATCTGAAT
AATGCGCTGA CGCAGAATCC AGAGCAGATG AACAATGTTG CGGTGAACTC TACGTTGCCG
ACAGAGCCTG CAACCGTCGC GCCAGTTCGC AATGGCAGCA CGACGCGTCA GGCGGCGGTT
AGCGAACCTG CCGAGCGTCA TACCACGCGT CCGGAACGTA AACAGGCCGT CATTGAACCT
AAGAAGCCGC AGACCACGGC GAAAACCACC ACTGCGGAAC CGAAGAAACC GGTCGCGCCA
GTGAAACGCA CGGAACCGGC AGCGCCAGCC GCGACGCCGA AAGCGACCAC CACGACGGCT
GCGCCGACAG CGACGGCAAG CGCTGCGCCG GTACAAACCG CGAAGCCAGC GCAAGCCTCG
ACGACGCCTG TCGCAGGCGG CGGGAAAAGC GCCGGCAACG TTGGCGCATT AAAGAGCGCG
CCATCCAGCC ACTACACATT GCAGCTCAGT AGTTCTTCAA ATTACGACAA CCTGAACGGT
TGGGCGAAGA AAGAGAACCT GAAAAATTAT GTGGTATACG AGACGACGCG TAATGGACAA
CCGTGGTATG TGCTGGTAAC GGGGATGTAT GCTTCGAAAG AAGATGCTAA ACGTGCGGTG
TCCACCTTAC CTGCCGATGT GCAGGCGAAA AACCCGTGGG CAAAACCGTT GCATCAGGTT
CAGGCCGATC TGAAATAA
 
Protein sequence
MDEFKPEDEL KPDPSDRRTG RSRQSSERDN EPQINFDDVD LDADDRRPTR TRKARSEEPE 
VEEEYESDED DTVDEERVER RPRKRKKATH KPASRQYMMM GVGVLVLLLL IIGIGSALKA
PSTSSSEPSA SGEKSIDLSG NAADQANATQ PAPGATSAEQ TAGNTSQDIS LPPISSTPTQ
GQSPVVADGQ QRVEVQGDLN NALTQNPEQM NNVAVNSTLP TEPATVAPVR NGSTTRQAAV
SEPAERHTTR PERKQAVIEP KKPQTTAKTT TAEPKKPVAP VKRTEPAAPA ATPKATTTTA
APTATASAAP VQTAKPAQAS TTPVAGGGKS AGNVGALKSA PSSHYTLQLS SSSNYDNLNG
WAKKENLKNY VVYETTRNGQ PWYVLVTGMY ASKEDAKRAV STLPADVQAK NPWAKPLHQV
QADLK