Gene SNSL254_A2321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2321 
Symbol 
ID6482158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2241094 
End bp2242455 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content54% 
IMG OID642737664 
Productpeptidase, U32 family 
Protein accessionYP_002041406 
Protein GI194445636 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAC CAGAACTCCT TTCGCCGGCG GGAACGCTGA AAAATATGCG TTACGCTTTC 
GCTTACGGTG CCGATGCCGT CTATGCGGGC CAACCACGCT ACTCTTTACG CGTGCGTAAT
AACGAATTCA ATCACGAAAA TTTGCAGCTT GGCATCAACG AAGCCCACGC GCTCGGAAAA
AAATTCTACG TGGTGGTGAA CATCGCCCCG CATAACGCCA AGCTCAAAAC CTTTATCCGT
GACCTGAAAC CCGTCGTCGA GATGGGCCCG GATGCGCTGA TCATGTCCGA TCCAGGGTTG
ATTATGCTGG TACGCGAGCA CTTCCCGACA ATGCCGATTC ACCTGTCGGT ACAGGCTAAC
GCCGTAAACT GGGCGACGGT AAAATTCTGG CAGCAGATGG GGCTGACCCG TGTGATTCTC
TCCCGCGAAC TGTCGCTGGA AGAGATTGAG GAAATTCGCC AGCAGGTGCC GGATATGGAA
ATAGAAATTT TCGTCCACGG CGCGCTATGC ATGGCCTATT CCGGCCGCTG CCTGCTTTCC
GGCTACATCA ATAAACGCGA TCCGAATCAG GGCACCTGCA CCAATGCCTG CCGTTGGGAA
TATAACGTGC AGGAAGGAAA AGAAGACGTT GTCGGCAACA TCGTGCATAA GCACGAACCG
ATTCCGGTAC AGAACGTTGA GCCGACGCTC GGTATCGGCG CGCCGACGGA TAAAGTGTTT
ATGATAGAAG AGGCCCAAAG ACCGGGCGAA TACATGACCG CGTTCGAAGA CGAGCATGGC
ACCTATATCA TGAACTCAAA AGATTTGCGC GCTATCGCCC ACGTGGAGCG CCTGACGAAA
ATGGGCGTCC ACTCGCTGAA AATCGAAGGC CGCACCAAAT CCTTTTATTA CTGCGCCCGT
ACCGCGCAGG TCTACCGTAA GGCCATCGAC GACGCCGCCG CGGGTAAACC CTTCGACCCT
ACGCTGCTGG AAACGTTGGA AGGTCTGGCT CATCGCGGCT ATACCGAAGG TTTCCTGCGT
CGCCATACGC ACGACGATTA CCAGAATTAC GAGTACGGGT ACTCCGTTTC CGAACGCCAG
CAATTTGTCG GCGAGTTCAC CGGCGAGCGT AAAGGCCAAC TGGCGGCCGT GACGGTGAAA
AATAAATTCT CCGTTGGCGA TAGTCTGGAG CTGATGACAC CGCAGGGAAA TATCCATTTC
ACCCTGGAAC AGATGGAGAA CGCCAAAGGC GACGCTATGC CGGTGGCACC TGGCGATGGC
TATACCGTCT GGATGCCCGT CCCGCAGGAC GTTACGCTGG ATTACGCACT ATTGATGCGT
AATTTCTCAG GCGAATCAAC GCGTAACCCC TATGCTAAGT AG
 
Protein sequence
MFKPELLSPA GTLKNMRYAF AYGADAVYAG QPRYSLRVRN NEFNHENLQL GINEAHALGK 
KFYVVVNIAP HNAKLKTFIR DLKPVVEMGP DALIMSDPGL IMLVREHFPT MPIHLSVQAN
AVNWATVKFW QQMGLTRVIL SRELSLEEIE EIRQQVPDME IEIFVHGALC MAYSGRCLLS
GYINKRDPNQ GTCTNACRWE YNVQEGKEDV VGNIVHKHEP IPVQNVEPTL GIGAPTDKVF
MIEEAQRPGE YMTAFEDEHG TYIMNSKDLR AIAHVERLTK MGVHSLKIEG RTKSFYYCAR
TAQVYRKAID DAAAGKPFDP TLLETLEGLA HRGYTEGFLR RHTHDDYQNY EYGYSVSERQ
QFVGEFTGER KGQLAAVTVK NKFSVGDSLE LMTPQGNIHF TLEQMENAKG DAMPVAPGDG
YTVWMPVPQD VTLDYALLMR NFSGESTRNP YAK