Gene SNSL254_A3885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3885 
Symbol 
ID6486114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3763116 
End bp3764603 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content54% 
IMG OID642739149 
Producthypothetical protein 
Protein accessionYP_002042860 
Protein GI194444038 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGCA CAAAAATTCG ACTCTTAGCG GGCAGTCTGT TGATGTTGGC CTCTGCCGGC 
TATGTGCAGG CAGATGCGCT CCAGCCCGAT CCGGCATGGC AACAGGGGAC GCTGGCTAAT
GGGTTACAGT GGCAAGTGTT GGCTACGCCT CAGCGCCCCA GCGATCGTAT TGAAGTTCGT
CTCCAGGTTA ATACCGGTTC GCTCACCGAA AGTACGCAAC AGAGCGGGTT CAGCCATGCG
ATTCCCCGTA TCGCGCTGAC GCAAAGCGGT GGTCTGGATG CCGCACAGGC ACGTTCTTTA
TGGCAGCAAG GGTTTGATCC TAAACGTCCC ATGCCGCCCG TTATTGTTTC TTATGATTCC
ACGCTCTATA ACCTCAGTTT ACCCAATAAC CGTAACGATC TGCTGAAAGA AGCGCTGACC
TATCTGGCTA ACGTCTCCGG TAAATTAACC ATTACGCCAG AGACGGTGAA TCATGCGTTA
AGCAGCGAAG ATATGGTTGC GACGTGGCCA GCAGATACTA AAGAGGGCTG GTGGCGTTAT
CGGCTGAAAG GGTCGGCGTT ATTGGGGCAC GATCCCGCGG AACCGTTAAA GCAGCCGGTA
GACGCAGCCA AAATTCAGGC TTTCTATGAA AAATGGTACA CCCCGGATGC CATGACGCTG
ATTGTTGTCG GCAACATTGA TGCGCGCTCC GTCGCCGAGC AGATCAATAA AACGTTCGGT
ACGCTGAAAG GTAAACGCGA AACGCCCGCC CCGGTGCCGA CGCTTTCGCC GCTGCGGGCG
GAATCAGTGA GCATTATGAC CGATGCGGTG CGCCAGGATC GTCTCTCCAT TATGTGGGAT
ACGCCGTGGC AACCGATTCG CGAGTCGGCA GCGCTGTTGC GCTACTGGCA GGCGGATCTG
GCGCGCGAAG CGCTGTTCTG GCATATCCAG CAAGAGCTTA CTAAAAATAA CGCGAAAGAT
ATTGGTCTGG GGTTTGACTG CCGGGTTCTG TTCCTGCGCG CGCAGTGCGC CATCAACATT
GAATCACCTA ATGATAAGCT CAATACCAAT TTGAGCCTGG TGGCGAATGA ACTGGCGAAA
GTACGCGATA AAGGTTTGTC GGAAGAGGAG TTTACTGCGC TGGTGGCGCA GAAAAATCTC
GAATTGCAAA AGCTGTTCGC GACCTACGCG CGTACCGATA CTGACATTTT GACTGGACAG
CGTATGCGCT CGCTGCAGAA TCAAGTGGTG GATATCGCGC CGGAGCAGTA TCAGAAGCTG
CGCCAGAATT TCCTCAACAG CCTGACCGTC GATATGCTCA ATCAGAATCT ACGTCAGCAG
CTATCGCAGG AGATGGCATT AATTTTGCTG CAACCGCAAG GCGAGCCGGA ATTTAATATG
AAGGCGTTAA AGGCGACGTG GGATGAAATC ATGGTCCCGA CAACTGCCGC CGCTGTTGAA
GCAGATGAGG CGCATCCGGA AGTGACGGAG ACACCGGCGG CACAGTAA
 
Protein sequence
MQGTKIRLLA GSLLMLASAG YVQADALQPD PAWQQGTLAN GLQWQVLATP QRPSDRIEVR 
LQVNTGSLTE STQQSGFSHA IPRIALTQSG GLDAAQARSL WQQGFDPKRP MPPVIVSYDS
TLYNLSLPNN RNDLLKEALT YLANVSGKLT ITPETVNHAL SSEDMVATWP ADTKEGWWRY
RLKGSALLGH DPAEPLKQPV DAAKIQAFYE KWYTPDAMTL IVVGNIDARS VAEQINKTFG
TLKGKRETPA PVPTLSPLRA ESVSIMTDAV RQDRLSIMWD TPWQPIRESA ALLRYWQADL
AREALFWHIQ QELTKNNAKD IGLGFDCRVL FLRAQCAINI ESPNDKLNTN LSLVANELAK
VRDKGLSEEE FTALVAQKNL ELQKLFATYA RTDTDILTGQ RMRSLQNQVV DIAPEQYQKL
RQNFLNSLTV DMLNQNLRQQ LSQEMALILL QPQGEPEFNM KALKATWDEI MVPTTAAAVE
ADEAHPEVTE TPAAQ