Gene SNSL254_A1808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1808 
Symbol 
ID6483495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1774225 
End bp1775286 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content57% 
IMG OID642737184 
Producthypothetical protein 
Protein accessionYP_002040936 
Protein GI194445309 
COG category[S] Function unknown 
COG ID[COG3768] Predicted membrane protein 
TIGRFAM ID[TIGR01620] conserved hypothetical protein, TIGR01620 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0000894217 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGAAC CGTTAAAACC GCGTATTGAT TTTGCAGAAC CGCTAAAGGA GGAACCTACG 
TCGGCCTTCA AAGCGCAGCA AACTTTTAGC GAAGCGGAGT CGCGTACATT TGCGCCTGCA
GCTATCGATG AGCGCCCGGA AGACGAAGGC GTGGCAGAAG CGGCGGTCGA TGCCGCGCTG
CGCCCCAAAC GCAGTCTGTG GCGTAAAATG GTGATGGGAG GGCTGGCGCT GTTTGGCGCG
AGCGTGGTCG GGCAAGGCGT ACAGTGGACA ATGAATGCCT GGCAAACTCA GGACTGGGTC
GCTTTAGGCG GCTGTGCCGC AGGCGCGCTG ATCATTGGCG CTGGCGTGGG ATCGGTGGTC
ACGGAGTGGC GGCGATTATG GCGCTTGCGC CAGCGGGCGC ATGAGCGCGA TGAGGCGCGT
GAACTGTTAC ATAGCCATAG CGTCGGGAAA GGTCGCGCAT TTTGCGAAAA ACTGGCGCAG
CAGGCGGGGA TTGATCAATC ACATCCGGCA TTACAACGTT GGTATGCCGC TATTCACGAA
ACGCAAAACG ACAGGGAAAT CGTCGGTTTG TATGCGAATC TGGTACAGCC GGTACTTGAC
GCGCAGGCGC GACGTGAGAT TAGCCGTTTC GCCGCGGAAT CGACTCTGAT GATCGTCGTC
AGCCCGTTAG CGTTGGTGGA TATGGCGTTT ATTGCCTGGC GTAATTTACG CCTGATTAAC
CGTATCGCAA CGCTGTATGG CATTGAACTT GGTTATTACA GCCGCCTTCG TCTGTTCCGT
CTGGTGTTGC TAAATATCGC GTTCGCGGGG GCCAGTGAGC TGGTACGTGA AGTCGGTATG
GACTGGATGT CTCAGGATCT GGCCGCACGC CTGTCTACGC GCGCGGCGCA GGGGATTGGC
GCAGGCCTCC TTACCGCTCG ACTGGGAATA AAAGCGATGG AGCTATGTCG GCCATTGCCG
TGGATCGACA ACGATAAACC ACGTCTCGGT GATTTTCGTC GTCAGCTTAT CGGTCAGCTA
AAAGAGACCC TGCAAAAGAG TAAGTCGTCG CCGGAGAAAT GA
 
Protein sequence
MSEPLKPRID FAEPLKEEPT SAFKAQQTFS EAESRTFAPA AIDERPEDEG VAEAAVDAAL 
RPKRSLWRKM VMGGLALFGA SVVGQGVQWT MNAWQTQDWV ALGGCAAGAL IIGAGVGSVV
TEWRRLWRLR QRAHERDEAR ELLHSHSVGK GRAFCEKLAQ QAGIDQSHPA LQRWYAAIHE
TQNDREIVGL YANLVQPVLD AQARREISRF AAESTLMIVV SPLALVDMAF IAWRNLRLIN
RIATLYGIEL GYYSRLRLFR LVLLNIAFAG ASELVREVGM DWMSQDLAAR LSTRAAQGIG
AGLLTARLGI KAMELCRPLP WIDNDKPRLG DFRRQLIGQL KETLQKSKSS PEK