Gene SeD_A3629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3629 
Symbol 
ID6874394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3481400 
End bp3482395 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content59% 
IMG OID642786612 
Productpeptidase, U32 family 
Protein accessionYP_002217248 
Protein GI198245611 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTGC TCTGCCCTGC CGGAAATCTC CCGGCGCTGA AGGCGGCCAT CGAAAACGGC 
GCTGACGCCG TTTATATCGG GTTAAAAGAC GATACCAATG CCCGTCATTT TGCCGGCCTT
AACTTTACCG AAAAAAAATT GCAGGAAGCG GTGAGTTTTG TTCACCAGCA TCGCCGCAAA
TTACACATCG CCATTAATAC TTTTGCGCAT CCGGACGGCT ATGCCCGCTG GCAGCGCGCC
GTGGATATGG CGGCGCAGCT TGGCGCCGAC GCGTTGATTC TCGCCGACCT CGCTATGCTG
GAATATGCCG CAGAGCGTTA CCCGCATATT GAGCGCCATG TTTCCGTTCA GGCCTCGGCA
ACCAACGAAG AGGCGATTCG CTTTTATCAC CGCAACTTTG ATGTCCACCG TGTAGTACTG
CCGCGTGTAC TGTCGATTCA CCAGGTAAAA CAACTGGCCC GCGTCACGCC GGTGCCGCTG
GAGGTATTCG CGTTTGGCAG CCTGTGCATT ATGGCGGAAG GTCGCTGCTA TCTTTCTTCC
TACCTGACGG GGGAGTCGCC CAATACCGTC GGCGCCTGCT CTCCCGCCCG CTTTGTCCGT
TGGCAGCAAA CGCCGCAGGG GCTGGAATCG CGCCTGAATG ATGTCCTGAT TGACCGTTAC
CAGGATGGCG AAAACGCAGG CTACCCAACG CTGTGTAAAG GCCGCTATTT AGTGGACGGC
GAGCGCTATC ACGCGCTGGA GGAGCCAACC AGCCTGAACA CGCTGGAACT GCTGCCGGAG
CTCATGGCGG CGAACATCGC TTCGGTGAAG ATCGAAGGCC GCCAGCGCAG CCCGGCCTAC
GTCAGCCAGG TGGCGAAAGT GTGGCGCCAG GCGATCGATC GCTGCAAAGC CGCCCCGCAA
AACTTCGTTC CACAGCGCGA CTGGATGGAG ACGCTCGGCG CGATGTCCGA AGGCACCCAA
ACCACGCTTG GCGCATATCA CCGTAAATGG CAGTGA
 
Protein sequence
MELLCPAGNL PALKAAIENG ADAVYIGLKD DTNARHFAGL NFTEKKLQEA VSFVHQHRRK 
LHIAINTFAH PDGYARWQRA VDMAAQLGAD ALILADLAML EYAAERYPHI ERHVSVQASA
TNEEAIRFYH RNFDVHRVVL PRVLSIHQVK QLARVTPVPL EVFAFGSLCI MAEGRCYLSS
YLTGESPNTV GACSPARFVR WQQTPQGLES RLNDVLIDRY QDGENAGYPT LCKGRYLVDG
ERYHALEEPT SLNTLELLPE LMAANIASVK IEGRQRSPAY VSQVAKVWRQ AIDRCKAAPQ
NFVPQRDWME TLGAMSEGTQ TTLGAYHRKW Q