Gene SeD_A2481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2481 
Symbol 
ID6871151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2361307 
End bp2362668 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content54% 
IMG OID642785564 
Productpeptidase, U32 family 
Protein accessionYP_002216222 
Protein GI198242901 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.433184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAC CAGAACTTCT TTCGCCGGCG GGAACGCTGA AAAATATGCG TTACGCTTTC 
GCTTACGGTG CCGATGCCGT CTATGCGGGC CAACCACGCT ACTCTTTACG CGTGCGTAAT
AACGAATTCA ATCACGAAAA TTTGCAGCTT GGCATCAACG AAGCCCACGC GCTCGGAAAA
AAATTCTACG TGGTGGTGAA CATCGCCCCG CATAACGCCA AGCTCAAAAC CTTTATCCGT
GACCTGAAAC CCGTCGTCGA GATGGGCCCG GATGCGCTGA TCATGTCCGA TCCAGGGTTG
ATTATGCTGG TACGCGAGCA CTTCCCGGCA ATGCCGATTC ACCTGTCGGT ACAGGCTAAC
GCCGTAAACT GGGCGACGGT AAAATTCTGG CAGCAGATGG GGCTGACCCG CGTGATTCTC
TCCCGCGAGC TGTCACTGGA AGAGATTGAG GAAATTCGCC AGCAGGTGCC GGATATGGAA
ATAGAAATTT TCGTCCACGG CGCGCTATGC ATGGCCTATT CCGGTCGCTG CCTGCTTTCC
GGCTACATCA ATAAACGCGA TCCGAATCAG GGCACCTGCA CCAATGCCTG CCGTTGGGAA
TATAACGTGC AGGAAGGAAA AGAAGACGTT GTCGGCAACA TCGTGCATAA GCATGAACCG
ATTCCGGTAC AGAACGTTGA GCCGACGCTC GGTATCGGCG CGCCGACGGA TAAAGTGTTT
ATGATAGAAG AGGCCCAAAG ACCGGGCGAA TACATGACCG CGTTCGAAGA CGAGCATGGC
ACCTATATCA TGAACTCAAA AGATTTGCGC GCTATCGCCC ACGTGGAGCG CCTGACGAAA
ATGGGCGTCC ACTCGCTGAA AATCGAAGGC CGCACCAAAT CCTTTTATTA CTGCGCCCGT
ACCGCGCAGG TCTACCGTAA GGCCATCGAC GACGCCGCCG CGGGTAAACC TTTCGACCCT
ACGCTGCTGG AAACGCTGGA AGGTCTGGCT CATCGCGGCT ATACCGAAGG TTTCCTGCGT
CGCCATACGC ACGACGATTA CCAGAATTAC GAGTACGGGT ACTCCGTTTC CGAACGCCAG
CAATTTGTCG GCGAGTTCAC CGGTGAGCGT AAAGGCCAAC TGGCGGCCGT GGCGGTGAAG
AATAAATTCT CCGTTGGCGA TAGTCTGGAG CTGATGACAC CGCAGGGAAA TATCAATTTC
ACCCTGGAAC AGATGGAGAA CGCCAAAGGA GACGCTATGC CGGTGGCACC CGGCGATGGC
TATACCGTCT GGATGCCCGT CCCGCAGGAC GTTACGCTGG ATTACGCACT ATTGATGCGT
AATTTCTCAG GCGAATCAAC GCGTAACCCC CATGGTAAGT AG
 
Protein sequence
MFKPELLSPA GTLKNMRYAF AYGADAVYAG QPRYSLRVRN NEFNHENLQL GINEAHALGK 
KFYVVVNIAP HNAKLKTFIR DLKPVVEMGP DALIMSDPGL IMLVREHFPA MPIHLSVQAN
AVNWATVKFW QQMGLTRVIL SRELSLEEIE EIRQQVPDME IEIFVHGALC MAYSGRCLLS
GYINKRDPNQ GTCTNACRWE YNVQEGKEDV VGNIVHKHEP IPVQNVEPTL GIGAPTDKVF
MIEEAQRPGE YMTAFEDEHG TYIMNSKDLR AIAHVERLTK MGVHSLKIEG RTKSFYYCAR
TAQVYRKAID DAAAGKPFDP TLLETLEGLA HRGYTEGFLR RHTHDDYQNY EYGYSVSERQ
QFVGEFTGER KGQLAAVAVK NKFSVGDSLE LMTPQGNINF TLEQMENAKG DAMPVAPGDG
YTVWMPVPQD VTLDYALLMR NFSGESTRNP HGK