Gene SNSL254_A2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2049 
Symbol 
ID6483800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1990704 
End bp1992023 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content55% 
IMG OID642737405 
Producthypothetical protein 
Protein accessionYP_002041155 
Protein GI194446438 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.245619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.249878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAACAGA TAGCCCGCTC TGTCGCCCTG GCATTTAATA ATCTGCCCCG ACCCCACCGC 
GTTATGCTGG GGTCACTTAC CGTTCTGACA CTGGCCGTCG CCGTATGGCG GCCCTATGTT
TACCACCCAG AATCCGCACC AACCGTTAAA ACTATTGAAC TGGAGAAAAG CGAGATTCGT
TCCCTCTTAC CGGAGGCCAG CGAACCCATC GATCAGGCCG CGCAGGAAGA TGAAGCTATT
CCTCAGGATG AGCTGGACGA TAAAACCGCA GGCGAAGTCG GCGTCCATGA ATACGTCGTC
TCCACAGGCG ATACGTTAAG CAGCATTCTG AATCAGTACG GCATCGATAT GAGCGATATT
AGCCGACTTG CCGCTTCTGA TAAGGAGCTG CGCCATCTGA AAATTGGCCA ACAGCTTTCC
TGGACACTGA CCGCCGATGG CGATTTACAG CGTCTGACAT GGGAAGTCTC CCGCCGTGAA
ACGCGTACCT ACGATCGCAC TGCCAACGGT TTTAAAATGA GCAGTGAAAT GCAGCAGGGG
GACTGGGTTA ACAGTCTGCT GAAAGGTACG GTAGGGGGTA GCTTTGTCGC CAGCGCGAAA
GAGGCCGGTT TAACCAGCAG CGAAATCAGC GCAGTGATAA AAGCAATGCA GTGGCAGATG
GATTTTCGCA AGCTGAAAAA GGGCGATGAA TTTTCGGTTC TGATGTCGCG CGAGATGCTG
GATGGCAAGC GTGAACAGAG TCAGTTGTTG GGCGTGCGGA TGCGTTCCGA TGGTAAAGAT
TACTACGCCA TTCGCGCCGC TGACGGTAAA TTCTATGACC GTAACGGTGT TGGCCTGGCG
AAAGGCTTTT TACGCTTCCC GACCGCCAAA CAGTTCCGCA TCTCCTCCAA CTTCAATCCG
CGTCGTCTGA ACCCGGTTAC CGGACGCGTT GCGCCGCATC GTGGCGTTGA CTTTGCGATG
CCGCAGGGTA CGCCGGTGCT GTCGGTGGGG GATGGCGAGG TCGTGGTCGC TAAACGTAGC
GGCGCTGCTG GTTACTACAT TGCGATTCGT CATGGACGCA CCTACACCAC ACGTTACATG
CACTTGCGTA AGCTGCTGGT GAAACCGGGG CAAAAAGTGA AACGTGGCGA TCGTATTGCG
CTTTCTGGTA ACACCGGGCG TTCCACAGGG CCGCATCTGC ATTATGAGGT ATGGATCAAC
CAGCAAGCCG TTAACCCTCT AACAGCAAAA TTGCCGCGCA CGGAAGGTCT GACGGGGTCA
GATCGTCGTG AATACCTGGC ACAGGTGAAA GAGGTTCTGC CGCAACTGCG CTTCGATTAA
 
Protein sequence
MQQIARSVAL AFNNLPRPHR VMLGSLTVLT LAVAVWRPYV YHPESAPTVK TIELEKSEIR 
SLLPEASEPI DQAAQEDEAI PQDELDDKTA GEVGVHEYVV STGDTLSSIL NQYGIDMSDI
SRLAASDKEL RHLKIGQQLS WTLTADGDLQ RLTWEVSRRE TRTYDRTANG FKMSSEMQQG
DWVNSLLKGT VGGSFVASAK EAGLTSSEIS AVIKAMQWQM DFRKLKKGDE FSVLMSREML
DGKREQSQLL GVRMRSDGKD YYAIRAADGK FYDRNGVGLA KGFLRFPTAK QFRISSNFNP
RRLNPVTGRV APHRGVDFAM PQGTPVLSVG DGEVVVAKRS GAAGYYIAIR HGRTYTTRYM
HLRKLLVKPG QKVKRGDRIA LSGNTGRSTG PHLHYEVWIN QQAVNPLTAK LPRTEGLTGS
DRREYLAQVK EVLPQLRFD