Gene SNSL254_A3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3957 
Symbol 
ID6484435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3838250 
End bp3840205 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content57% 
IMG OID642739217 
Producthypothetical protein 
Protein accessionYP_002042927 
Protein GI194444786 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.565903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.86497 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTAC TGGAAGTCGA TCTGCATAAA CTGACGGTCA GCGATCCGTT CCTCGGACAG 
TATCAACAAC TGGTTCGCGA TGTGGTTATT CCTTACCAGT GGGATGCGTT AAACGATCGT
ATTCCAGAGG CTGAACCCAG CCATGCCATT GAAAATTTCC GCATTGCCGC AGGACAGCAG
ACGGGCGACT TTTACGGCAT GGTCTTTCAG GACAGCGACG TGGCGAAATG GCTGGAAGCG
GTTGCCTGGT CACTGTGCCA GAAGCCCGAT CCCGCGCTTG AGAAAACCGC CGATGAGGTG
ATTGAACTGG TGGCCGCCGC GCAGTGTGAC GATGGCTATC TCAATACGTA CTTTACGGCA
AAAGCCCCGC AAGAACGCTG GAGCAACCTG GCGGAGTGCC ACGAGCTTTA TTGCGCCGGG
CACCTGATTG AAGCGGGCGT CGCCTTCTTT CAGGCCACCG GCAAGCGTCG GCTGCTAGAC
GTCGTTTGTC GCCTGGCCGA TCATATCGAC AGCACTTTCG GCCCTGGCGA AAATCAGCTG
CACGGCTATC CGGGCCACCC GGAAATTGAG CTGGCGTTGA TGCGTCTGTA TGAGGTAACA
GAGCAGCCGC GCTATATGAC GCTGGCAAGC TACTTTATCG GGCAGCGCGG CGCCCAACCG
CACTTCTACG ACGAAGAGTA CGAAAAACGC GGCCAAACCT CTTACTGGCA TACCTACGGC
CCGGCGTGGA TGGTCAAAGA CAAAGCCTAC AGCCAGGCGC ATCTGCCAAT TTCGCAGCAG
CAGACGGCCA TTGGCCACGC GGTACGTTTT GTCTATCTGA TGACTGGCGT GGCGCATCTC
GCTCGCCTGA GCAACGATGA AGGCAAACGC CAGGACTGCC TGCGCCTATG GAAAAATATG
GCGCAGCGTC AGCTGTATAT CACCGGAGGC ATCGGTTCAC AGAGCAGTGG CGAAGCCTTT
AGCAGCGATT ACGATTTACC GAATGATTCG GTCTATGCGG AAAGTTGCGC TTCAATCGGC
CTGATGATGT TCGCCCGCCG GATGCTGGAA ATGGAAGCCG ATAGCCAGTA CGCCGACGTG
ATGGAGCGCG CGCTGTACAA CACCGTCCTC GGCGGTATGG CGCTGGATGG CAAGCATTTC
TTCTACGTCA ACCCACTGGA AGTGCATCCA AAATCGTTAA ACTTCAACCA TATTTACGAT
CACGTTAAGC CCATCCGCCA GCGCTGGTTT GGCTGCGCCT GCTGCCCGCC GAACATCGCC
CGCGTACTCA CCTCCCTTGG TCACTACATC TACACGCCGC GTGCGGATGC GCTGTACATC
AATATGTACG TGGGTAACAG CATGGAAATA CCGGTTGAAA ATGGCGCGCT CAAACTGCGA
ATCAGCGGGA ACTACCCGTG GCATGAGCAG GTGAAGATTG CCATCGACTC TGTGCAGCCG
GTACGTCACA CGCTGGCGCT ACGTCTGCCG GACTGGTGCC CTGAGGCAAA AGTGACGCTC
AACGGGCTGG AAGTGGAGCA GGATATTCGC AAAGGTTATC TGCATATCCG TCGAACCTGG
CAGGAGGGCG ATACGATAAC CCTGACGCTG CCGATGCCGG TTCGCCGCGT GTATGGCAAT
CCGCTGGCGC GTCACGTCGC CGGTAAGGTC GCCATTCAGC GCGGGCCGCT GGTCTATTGC
CTTGAGCAGG CCGATAACGG CGAAGAACTG CATAATCTGT GGTTACCGAA AGAGAGTGAG
TTCCGGGTCT TTGAGGGCAA AGGGATTTTT GCGCATAAGA TGCTGATTCA GGCTGAAGGC
GAGAAGCAAA GCGCCCCAGA TGCGCAGCAT CAGGCGTTGT GGCACTACGA TAACGCGCCG
TCATCGCGCC AGCCGCAGAC GCTAACGTTC ATTCCGTGGT TTAGCTGGGC CAACCGTGGC
GAGGGCGAAA TGCGGATTTG GGTTAACGAG CGGTAA
 
Protein sequence
MNVLEVDLHK LTVSDPFLGQ YQQLVRDVVI PYQWDALNDR IPEAEPSHAI ENFRIAAGQQ 
TGDFYGMVFQ DSDVAKWLEA VAWSLCQKPD PALEKTADEV IELVAAAQCD DGYLNTYFTA
KAPQERWSNL AECHELYCAG HLIEAGVAFF QATGKRRLLD VVCRLADHID STFGPGENQL
HGYPGHPEIE LALMRLYEVT EQPRYMTLAS YFIGQRGAQP HFYDEEYEKR GQTSYWHTYG
PAWMVKDKAY SQAHLPISQQ QTAIGHAVRF VYLMTGVAHL ARLSNDEGKR QDCLRLWKNM
AQRQLYITGG IGSQSSGEAF SSDYDLPNDS VYAESCASIG LMMFARRMLE MEADSQYADV
MERALYNTVL GGMALDGKHF FYVNPLEVHP KSLNFNHIYD HVKPIRQRWF GCACCPPNIA
RVLTSLGHYI YTPRADALYI NMYVGNSMEI PVENGALKLR ISGNYPWHEQ VKIAIDSVQP
VRHTLALRLP DWCPEAKVTL NGLEVEQDIR KGYLHIRRTW QEGDTITLTL PMPVRRVYGN
PLARHVAGKV AIQRGPLVYC LEQADNGEEL HNLWLPKESE FRVFEGKGIF AHKMLIQAEG
EKQSAPDAQH QALWHYDNAP SSRQPQTLTF IPWFSWANRG EGEMRIWVNE R