Gene SNSL254_A3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3631 
SymboltldD 
ID6482241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3516391 
End bp3517836 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content57% 
IMG OID642738906 
Productprotease TldD 
Protein accessionYP_002042623 
Protein GI194445200 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.0205399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA ACCTGGTAAG TGAACAATTG CTAGCGGCGA ATGGCCTGAA CCATCAGGAT 
CTGTTCGCTA TTTTGGGCCA ACTGGCCGAA CGCCGTCTTG ATTATGGCGA CCTCTATTTT
CAGTCGAGCT ATCACGAATC CTGGGTTTTA GAAGACCGCA TCATTAAAGA TGGTTCATAT
AATATCGACC AGGGCGTTGG CGTTCGCGCC ATTAGCGGCG AAAAAACCGG TTTTGCTTAT
GCTGACCAGA TAAGCCTCCT GGCGCTGGAG CAGAGTGCGC AGGCAGCGCG AACCATTGTA
CGCGAGAACG GCGAAGGCAA GGTAAAAACG CTCGCCGCCG TAGCGCATCA GCCGCTCTAC
ACCACCCTTG ATCCGCTGCA AAGTATGAGC CGCGAAGAGA AGCTGGATAT CCTCAGACGC
GTTGACAAAG TGGCGCGAGA AGCCGATAAA CGCGTGCAGG AAGTTAACGC CAGCCTGACC
GGCGTATATG AATTAATCCT CGTGGCGGCG ACCGACGGGA CGCTGGCGGC GGATGTCCGT
CCACTGGTGC GGTTGTCCGT TAGCGTGCAG GTGGAAGAAG ACGGTAAACG CGAGCGCGGC
GCCAGCGGCG GCGGCGGTCG CTTTGGTTAT GAGTATTTTC TTGCCGATCT CGACGGCGAG
GTGCGCGCCG ACGCGTGGGC GAAAGAAGCG GTACGCATGG CGCTGGTTAA TCTCTCCGCG
GTCGCTGCGC CAGCGGGGAC GTTACCGGTG GTTCTGGGCG CCGGGTGGCC GGGCGTATTG
CTGCACGAAG CGGTCGGTCA CGGGCTGGAA GGTGATTTCA ACCGTCGTGG GACGTCTGTG
TTTAGCGGTC AGATCGGTGA GCAGGTTGCC TCCGCGCTTT GCACCGTAGT GGATGACGGC
ACAATGATGA ACCGTCGCGG CTCCGTTGCT ATCGATGATG AAGGTACGCC AGGCCAGTAC
AACGTATTGA TTGAAAATGG CGTACTGAAA GGATACATGC AGGACAAGCT GAACGCGCGC
CTGATGGGCG CTGCGCCGAC CGGTAACGGG CGTCGCGAAT CTTATGCGCA TCTGCCGATG
CCGCGTATGA CGAATACCTA TATGTTGGCG GGGCAGTCAA CGCCGCAGGA AATTATCGAA
TCCGTTGAGT ACGGCATCTA TGCGCCTAAC TTTGGCGGCG GTCAGGTGGA TATCACCTCC
GGCAAGTTTG TGTTCTCTAC CTCGGAAGCG TATCTGATTG AAAACGGCAA AGTCACGACG
CCGGTGAAGG GCGCGACGTT GATTGGATCA GGCATTGAAA CAATGCAACA GATCTCCATG
GTCGGCAATG ACCTTAAGCT GGATAACGGG GTGGGGGTTT GCGGTAAAGA GGGGCAAAGT
CTGCCGGTAG GCGTAGGCCA GCCGACGCTG AAAGTCGATA ACCTGACGGT TGGCGGCACC
GCATAA
 
Protein sequence
MSLNLVSEQL LAANGLNHQD LFAILGQLAE RRLDYGDLYF QSSYHESWVL EDRIIKDGSY 
NIDQGVGVRA ISGEKTGFAY ADQISLLALE QSAQAARTIV RENGEGKVKT LAAVAHQPLY
TTLDPLQSMS REEKLDILRR VDKVAREADK RVQEVNASLT GVYELILVAA TDGTLAADVR
PLVRLSVSVQ VEEDGKRERG ASGGGGRFGY EYFLADLDGE VRADAWAKEA VRMALVNLSA
VAAPAGTLPV VLGAGWPGVL LHEAVGHGLE GDFNRRGTSV FSGQIGEQVA SALCTVVDDG
TMMNRRGSVA IDDEGTPGQY NVLIENGVLK GYMQDKLNAR LMGAAPTGNG RRESYAHLPM
PRMTNTYMLA GQSTPQEIIE SVEYGIYAPN FGGGQVDITS GKFVFSTSEA YLIENGKVTT
PVKGATLIGS GIETMQQISM VGNDLKLDNG VGVCGKEGQS LPVGVGQPTL KVDNLTVGGT
A