Gene SNSL254_A4161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4161 
SymbolyieM 
ID6484115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4052570 
End bp4054021 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content56% 
IMG OID642739417 
Producthypothetical protein 
Protein accessionYP_002043126 
Protein GI194442438 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACGC TGGATACGCT AAATACCATG CTGGCCGTCA GCGAAGAGGG AATGGTCGAA 
GAGATGATCC TCGCGCTACT GGCTTCCCCA CAACTGGTCA TTTTCTTTGA AAAGTTTCCG
CGTTTAAAAA ACGCCGTGAC CGCCGATCTC CCGCGCTGGC GGGAAGCGCT ACGCAGCCGT
CTTAAAGACG CACGCGTTCC GCCGGAACTC ACGGAAGAGG TCATGTGTTA TCAGCAAAGC
CAACTTCTCT CTACCCCACA GTTCATCGTG CAACTGCCGC AAATACTGGC GTTGCTTCAC
CGCCTGCATT CACCGTATGC CGCGCAGGCG AAGCAGTTGA CGGAGAGCAA CAGTACCTTT
ACCCCTGCGC TACACACGCT TTTTTTGCAA CGCTGGCGGT TAAGTCTGGT CGTGCAGGCC
ACCACGTTAA ACCAACAACT ACTGGAAGAA GAGCGCGAGC AGTTGCTGAG TGACGTTCAG
GAACGGATGA CGCTGAGCGG GCAACTGGAA CCGACGCTGG CGGAAAATGA TAATGCCGCA
GGCCGCCTGT GGGATATGAG CGCGGGCCAG CTTAAACGTG GTGATTATCA ACTGATCGTA
AAATACGGCG AATTTCTCGC CGCCCAGCCG GAGCTAATGC AACTGGCGGA ACAACTGGGA
CGTTCGCGGG AAGCCAAATC GGTACCGAAA AAAGACGCGC CGATGGAAAC CTTTCGTACA
CTGGTACGCG AACCCGCTAC GGTGCCGGAG CAGGTTGACG GTATTCAGCA AGGCGATGAT
ATTCTGCGCC TGTTGCCGCC AGAGCTGGCG ACGCTCGGCA TCACCGAGCT GGAATATGAA
TTCTACCGCC GGTTAGTGGA AAAACAGCTC CTCACCTATC GCCTGCATGG CGAAGCGTGG
CGTGAGAAAG TGACCGAACG GCCGGTAGTA CACCAGGATG TCGACGAGCA GCCGCGCGGA
CCGTTTATTG TCTGCGTCGA TACTTCAGGC TCGATGGGAG GATTTAACGA GCAGTGCGCA
AAAGCGTTCT GCCTGGCGTT GATGCGCGTT GCGCTGGCGG ATAACCGCCG CTGCTTTATT
ATGCTGTTTT CCACTGACGT TGTGCGCTAT GAACTCTCCG GCCCGGAAGG TATCGAGCAG
GCCATCCGCT TTTTAAGTCA ACGTTTTCGC GGCGGCACGG ATATCGCCAG CTGTTTTCGC
GCCATTATTG AAAGAATGCA GGGACGGGAA TGGTTTGATG CCGATGCGGT GGTCATTTCG
GATTTTATCG CCCAGCGCTT GCCGGATGAC GTGGTGAGCA AAGTGGGAGA GTTGCAGCGT
CTTCACCAGC ATCGATTCCA TGCGGTGGCG ATGTCGGCGC ACGGCAAACC CGGCATCATG
CGCATTTTCG ATCATATCTG GCGCTTTGAC ACCGGGATGC GAAGCCGCCT GCTGAGACGC
TGGCGGCGCT AA
 
Protein sequence
MLTLDTLNTM LAVSEEGMVE EMILALLASP QLVIFFEKFP RLKNAVTADL PRWREALRSR 
LKDARVPPEL TEEVMCYQQS QLLSTPQFIV QLPQILALLH RLHSPYAAQA KQLTESNSTF
TPALHTLFLQ RWRLSLVVQA TTLNQQLLEE EREQLLSDVQ ERMTLSGQLE PTLAENDNAA
GRLWDMSAGQ LKRGDYQLIV KYGEFLAAQP ELMQLAEQLG RSREAKSVPK KDAPMETFRT
LVREPATVPE QVDGIQQGDD ILRLLPPELA TLGITELEYE FYRRLVEKQL LTYRLHGEAW
REKVTERPVV HQDVDEQPRG PFIVCVDTSG SMGGFNEQCA KAFCLALMRV ALADNRRCFI
MLFSTDVVRY ELSGPEGIEQ AIRFLSQRFR GGTDIASCFR AIIERMQGRE WFDADAVVIS
DFIAQRLPDD VVSKVGELQR LHQHRFHAVA MSAHGKPGIM RIFDHIWRFD TGMRSRLLRR
WRR