Gene SNSL254_pSN254_0112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_pSN254_0112 
Symbol 
ID4929465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_009140 
Strand
Start bp99821 
End bp101617 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content53% 
IMG OID642572411 
Productvon Willebrand factor type A domain-containing protein 
Protein accessionYP_001101986 
Protein GI134047208 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4548] Nitric oxide reductase activation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.237342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAA AACGCACCAT TTACAGCGCT CTACCAATCG TGGCCGCAGC CTATGGTGAA 
AAACTCGGTG TCAAAGTCGC CATCGGTAAC GATGACGCAT ACACCGATGG TAAGACCATC
GTGGTTCCGA ATATCCCCGA CGACTATCCT CACATGGATG CTGTCTGGGG GTATTTGGCC
CATGAAGCAG CCCATGTCCG GTTTACGGAC TTTGGTGTTG AGCGCCGCAG AGGTCTTCAT
GCTGAGTTGT CCAACGTTTT GGAGGACTGC CGCATAGAAC GGGCCATGAT GGAACTCTTC
CCCGGTACGT CGCAGACCCT GAATGAGGTT GCTCGCTATA TGGCTCAAGC TGGTCATTAC
GAGCACGTCA CAGACAAAGA GGCCCCTGCC TCCATTCTGA CAGGGTTTTG TTTGTACTGG
TTGCAAACCA AGGCTGTAGG GCAATCCGTC CTTCAACCCT ATCTCGATTC GGCTACCCCC
GTATTCGAGC GCGTGTTCCC TCAGGGTGTT GTTGTTCGGC TGAACGCTTT ACTGCGTAAG
GCTGTGAACA CTAAGTCAAC CGCAGAGGTG ACATCCTTGG CCGACCAAAT CATCAAGATG
ATCGAGGAGG AAAAGGAGAA AGAAGAGCAA AAGCCCCAGA ATGGTCAGGA TGGTAACAAC
CAGCAGAATG CTGGTGGCAA CCAACCTCAG AACAGTCAGG GCGGAAGTGG TAACGATCAA
AACCAAGGGC CTGATGCCAA TGGTGGTGAT CAGCAAGGTA AAGACCAGAA GCAAGACGAT
GCTAACGGGA AATCTGATCC GAAAGGACAG GGCGACCAAG GCAAGTCGGA TACTGATGGT
GGCAGCAAGG CAGGACAAAG CCAGGCTGGT GGCAATTCTG ACGCGGCGAA ACAAGACGCG
GCCAAAATGC TCCAGCAGGT TCTGAATGCC GGTGCCGGTG ATTTGCGTGG TGACGCGCAT
GATGCACTTA AAGCCGAGCT CAACCGGGTG GCTCAAGATA AGGGGGACAG TAGCTATATG
ACTGTTCGCT CTGCTGTGAA CACCCAGGAC AACCCTGCTG TTGGCAAAAG CCTGGTAGGG
GATGTGAAGA GCACCACTTC AAAGATAAGA ACGCAGCTCT ACGGATTGGT CCAGGCCAGC
CAGCGAGTTG CTCACCGTAA CCAACGATCA GGGAAGCGTG TGGATGCTCG GAAACTACAT
CGTGTAGTGA CGGGTGATAC CCGCGTATTC CTCAAGCCAG AAGCCAAGAA ACGCCCTAAT
ACGGCGGTTC ACATCCTGGT TGATATGAGC TCCTCGATGG CCTACAAGGC CGCCAATGGA
AAGGAGCGTC AAGACATTGC GCGGGAAGCG TCCTTGGCTA TTTCGATGGC TCTGGAAGCA
ATACCCGGCG TAAACCCGGC AGTCACCTTT TTTGGTGGCA ACCGGAACCA GCCAGTGTTC
AGTGTCGTGA AGCATGGAGA TACGGTTCAG AATCGGGCCG GTCGGTTTGG GTTCAAAGCA
ACTGGCGGTA CGCCTATGGC GGAAGCTATG TGGTATGCAG CTTTTGAACT CACCAAGACC
CGTGAAGAGC GAAAAATGTT GATCGTAGTG ACTGACGGGC AGCCTCAAAG CGCCCCGGCA
TGTCGCTCAG TGATTGACCT CTGTGAACGA AGCGATGTTG AGGTGATCGG CATAGGGGTA
GAGACTACCG CAGTGTCAGG ACTGTTCCAA AAGAACATTG TCATTGATGA TGCGGCAGCT
CTGCAACGCA CACTGTTTAA GTTGATGGAG CGGTCATTGA CTGCTTTTGC AGCTTAA
 
Protein sequence
MSKKRTIYSA LPIVAAAYGE KLGVKVAIGN DDAYTDGKTI VVPNIPDDYP HMDAVWGYLA 
HEAAHVRFTD FGVERRRGLH AELSNVLEDC RIERAMMELF PGTSQTLNEV ARYMAQAGHY
EHVTDKEAPA SILTGFCLYW LQTKAVGQSV LQPYLDSATP VFERVFPQGV VVRLNALLRK
AVNTKSTAEV TSLADQIIKM IEEEKEKEEQ KPQNGQDGNN QQNAGGNQPQ NSQGGSGNDQ
NQGPDANGGD QQGKDQKQDD ANGKSDPKGQ GDQGKSDTDG GSKAGQSQAG GNSDAAKQDA
AKMLQQVLNA GAGDLRGDAH DALKAELNRV AQDKGDSSYM TVRSAVNTQD NPAVGKSLVG
DVKSTTSKIR TQLYGLVQAS QRVAHRNQRS GKRVDARKLH RVVTGDTRVF LKPEAKKRPN
TAVHILVDMS SSMAYKAANG KERQDIAREA SLAISMALEA IPGVNPAVTF FGGNRNQPVF
SVVKHGDTVQ NRAGRFGFKA TGGTPMAEAM WYAAFELTKT REERKMLIVV TDGQPQSAPA
CRSVIDLCER SDVEVIGIGV ETTAVSGLFQ KNIVIDDAAA LQRTLFKLME RSLTAFAA