Gene SeHA_C3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3666 
SymboltldD 
ID6488692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3548762 
End bp3550207 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content57% 
IMG OID642743784 
Productprotease TldD 
Protein accessionYP_002047396 
Protein GI194450951 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA ACCTGGTAAG TGAACAATTG CTAGCGGCGA ATGGCCTGAA CCATCAGGAT 
CTGTTCGCTA TTTTGGGCCA ACTGGCCGAA CGCCGTCTTG ATTATGGCGA CCTCTATTTT
CAGTCGAGCT ATCACGAATC CTGGGTTTTA GAAGACCGCA TCATTAAAGA TGGTTCATAT
AATATCGACC AGGGCGTTGG CGTTCGCGCC ATTAGCGGCG AAAAAACCGG TTTTGCTTAT
GCTGACCAGA TAAGCCTCCT GGCGCTGGAG CAGAGTGCGC AGGCAGCGCG AACCATTGTA
CGCGATAACG GCGAAGGCAA GGTAAAAACG CTCGCCGCCG TAGCGCATCA GCCGCTCTAC
ACCACCCTTG ATCCACTGCA AAGTATGAGC CGCGAAGAGA AGCTGGATAT CCTCAGACGC
GTTGACAAAG TGGCGCGAGA AGCCGATAAA CGCGTGCAGG AAGTTAACGC CAGCCTGACC
GGCGTATATG AATTAATCCT CGTGGCGGCG ACCGACGGGA CGCTGGCGGC GGATGTCCGT
CCACTGGTGC GGTTGTCCGT TAGCGTGCAG GTGGAAGAAG ACGGTAAACG CGAGCGCGGC
GCCAGCGGCG GCGGCGGTCG CTTTGGTTAT GAGTATTTTC TTGCCGATCT CGACGGCGAG
GTTCGCGCCG ACGCGTGGGC GAAAGAAGCG GTACGCATGG CGCTGGTTAA TCTCTCCGCG
GTCGCTGCGC CAGCGGGGAC GTTACCGGTG GTTCTGGGCG CCGGGTGGCC GGGCGTATTG
CTGCACGAAG CGGTCGGGCA CGGGCTGGAA GGTGATTTTA ACCGTCGTGG GACGTCTGTG
TTTAGCGGTC AGATCGGCGA GCAGGTTGCC TCCGCGCTTT GCACCGTAGT GGATGACGGC
ACAATGATGA ACCGTCGTGG CTCCGTTGCT ATCGATGATG AAGGTACGCC AGGCCAGTAC
AACGTATTGA TTGAAAATGG CGTACTGAAA GGATACATGC AGGACAAGCT GAACGCGCGC
CTGATGGGCG CTGCGCCGAC CGGTAACGGG CGTCGCGAAT CTTATGCGCA TCTGCCGATG
CCGCGTATGA CGAATACCTA TATGTTGGCG GGGCAGTCAA CGCCGCAGGA AATTATCGAA
TCCGTTGAGT ACGGCATCTA TGCGCCTAAC TTTGGCGGCG GTCAGGTGGA TATCACCTCC
GGCAAGTTTG TGTTCTCTAC CTCGGAAGCG TATCTGATTG AAAACGGCAA AGTCACGACG
CCGGTGAAGG GCGCGACGTT AATTGGATCA GGCATTGAAA CGATGCAACA GATCTCCATG
GTCGGCAATG ACCTTAAGCT GGATAACGGG GTGGGGGTTT GCGGTAAAGA GGGGCAAAGT
CTGCCGGTAG GCGTAGGCCA GCCGACGCTG AAAGTCGATA ACCTGACGGT TGGCGGCACC
GCATAA
 
Protein sequence
MSLNLVSEQL LAANGLNHQD LFAILGQLAE RRLDYGDLYF QSSYHESWVL EDRIIKDGSY 
NIDQGVGVRA ISGEKTGFAY ADQISLLALE QSAQAARTIV RDNGEGKVKT LAAVAHQPLY
TTLDPLQSMS REEKLDILRR VDKVAREADK RVQEVNASLT GVYELILVAA TDGTLAADVR
PLVRLSVSVQ VEEDGKRERG ASGGGGRFGY EYFLADLDGE VRADAWAKEA VRMALVNLSA
VAAPAGTLPV VLGAGWPGVL LHEAVGHGLE GDFNRRGTSV FSGQIGEQVA SALCTVVDDG
TMMNRRGSVA IDDEGTPGQY NVLIENGVLK GYMQDKLNAR LMGAAPTGNG RRESYAHLPM
PRMTNTYMLA GQSTPQEIIE SVEYGIYAPN FGGGQVDITS GKFVFSTSEA YLIENGKVTT
PVKGATLIGS GIETMQQISM VGNDLKLDNG VGVCGKEGQS LPVGVGQPTL KVDNLTVGGT
A