Gene SNSL254_A0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0438 
Symbol 
ID6482867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp450643 
End bp453783 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content58% 
IMG OID642735861 
Productexonuclease subunit SbcC 
Protein accessionYP_002039635 
Protein GI194446724 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TCAGTTTGCG TCTGAAAAAC CTGAACTCGC TGAAAGGAGA ATGGAAGGTC 
GATTTTACCG CCGAACCGTT TGCCAGTAAC GGGTTATTCG CTATTACCGG CCCGACCGGC
GCGGGAAAAA CCACCTTACT TGACGCCATC TGCCTGGCGC TGTACCACGA AACGCCGCGC
CTGAATACGG TATCGCAGTC ACAAAACGAT TTGATGACGC GCGATACGGC GGAATGTCTG
GCCGAAGTGG AGTTCGAGGT AAAAGGCGAA GCCTGGCGCG CGTTCTGGAG CCAGAACCGA
GCCCGAAATC AGCCTGACGG CAATCTCCAG GCCCCACGCG TGGAGCTGGC GCGGTGTTCA
GACGGAAAGA TCTTTGCCGA TAAGGTAAAG GATAAACTGG AGATGATCGC CACGTTGACC
GGGCTGGACT ACGGGCGCTT CACCCGTTCG ATGTTGTTAT CACAGGGGCA GTTCGCCGCC
TTTCTTAACG CCAAAGCGAA AGAGCGCGCA GAGCTTCTGG AGGAGCTCAC CGGCACTGAG
ATTTACGGGC AGATCTCCGC ACAGGTATTT GAAAAACATA AATCAGCCCG TCTCGAACTG
GAAAAGTTAC AAGCGCAGGC CAGCGGCGTC GCCCTATTGG CGGACGAACA GTTACAGCAA
CTGGAAGCTA GTTTGCAGGC GCTTACTGAC GAAGAGAAAC GCCTGCTGGC CGACCAGCAG
GTACAGCAGC AGCACCTTCA CTGGCTGACC CGAAAAAACG AGCTCCACAC TGAATTACAC
GCCCGGCAGC AGGCGCTGTA CGCGGCGCAA GAGGCGCGAG AAAAAGCGCA GCCGCAGCTC
GCGGCGCTGA CTCTGGCGCA ACCTGCTCGC CAGTTGCGTC CGCATTGGGA GCGCATTCAG
GAACAAACCC GCGCCGTTGA GCGTGTTCGC CAGCACAGTG ATGAAGTGAA TGCTCGCTTA
CAAAGCGCGT ATCGCCTGCG CCAGCGTATC CGCGCCTGCG CGCATCGACA ATTCACACAA
CTGAACGCCA CAGGGCAGCG GTTGAAGACA TGGCTGGCGG AACACGACAG CATCCGCGTC
TGGCGCAGCG AACTGGCGGG GTGGCGAGCA TTATTAACAC AACAATCCCA CGACCGGGCG
CAACTGAGCC AATGGCAACA GCAGTTGCTG AGCGATACGC GTCAGCGTGA TGCGCTACCG
CCGCTCACGC TCGACCTCAC GCCACAGGCG CTGGCGGAAG CCAGGGCATT GCATACCCGA
CAGCGTCCGC TACGCCATCG TCTCGCCGCG CTTCAGGGGC AAATTCTCCC GAAACAAAAG
CGTCAGGCGC AGCTACAGGC AGCTATCGCG CGTCATCATC AGGAGCAGGC GCAATACACC
CAACGTCTGG CGGATAAGCG CCTGAGTTAT AAGACTAAAG CGCAGGAGCT TGCGGACGTT
CGCACTATTT GCGAGCAGGA GGCGCGCATC AAAGATCTGG AAAGCCAGCG GGCGCACTTA
CAGTCCGGCC AGCCGTGTCC GCTTTGCGGC TCGACGACGC ACCCCGCCAT CGCCGCCTAT
CAGGCGCTGG AACTGAGCGC GAATCAAACC CGCCGCGACG CGCTGGAAAA AGAGGTCAAA
ACGCTGGCGG AAGAAGGCGC GGCGCTGCGA GGACAGTTAG ATGCCTTAAC GCAACAATTA
CAGCGGGATG AAAGCGAGGC GCAGTCGTTG TTGCAGGAGG AGCAAGCGCT TACTGAGGAA
TGGCAAACGC TGTGCGCTAC GCTGGGCGTT CAGCTCCAGC CGCAAGAAGA CCTAGCGGGC
TGGCTGACCG CCGCGGAAGA GCATGAGCAA CAGTTGGATC AGCTCAGCCA GCGTCATGCT
CTACAAACGC AAATCGCGGC GCATACTGAA CAGGTAGCGC GCTTTACCGC GCAGATTGCG
CAGCGTCAGG CGTCGCTGAC GGCTGATTTG GCGCAGTATA CGCTTTCCCT TCCTGCGCCA
GAAGACGAGG CTTCGTGGCT GAATGAGCGC GCCGACGAAG CGAAAATATG GCAACAGCGT
CAGACAGAGT TCGCGGATTT GCAAACGCAG ATCGACAGGC TTGCCCCGTT GCTGGAGACG
CTACCGCAAA CGGATACCGC GGATTCCGAC GACGACGTGC CGCTGGATAA CTGGCGGCAG
GCGCATGATG AGTGCGTGTC ATTACAAAGC CAGTTGCAAA CCCTGCAGGA ACAAACGACG
CAGGAGCAGC AGCGCGCCGC CGAGGCGATA GCGCACTTTG ATGCGGCGTT AAAAAATAGC
CCGTTTGACA GCCAGGCAAC GTTCCTGGCG GCGTTGCTGG ATGAAGAGAC CGTGACCCGT
CTGGAAAAAC AACAACAAAC GCTGGAAAGT CAGCTACAAC AAGCGAAGGC GTTAAGCGCG
CAGTCCGCGC AGGCGCTGGC TGACCATCAG CAACAGCCGC CCGCCGGTCT GGACCCAACG
TGTACGGCGG AGCAGCTCGC GCAGCGGCTG ACGCAGTTGG CGCAACAACT GCGCGAAAAC
ACGACACGCC AGGGGGAAAT CCGCCAGCAA ATTAAACAGG ATGCAGATAA TCGGCAGCGC
CAACGCGCGC TGATGGCGGA AATGAAGCAA GCCTCTCAGC AAGTGGAAGA CTGGGGCTAT
CTCAATGCGC TGATCGGCTC TAAAGAAGGT GATAAGTTCC GCAAATTCGC CCAGGGACTG
ACGCTGGATA ATCTGGTCTG GCTGGCGAAT CATCAGCTCA CCCGCCTGCA TGGCCGTTAT
TTATTGCAGC GCAAAGCCAG CGACGCGCTG GAACTGGAGG TTGTCGATAC CTGGCAGGCC
GACGCCGTGC GCGATACGCG AACGTTATCA GGCGGCGAAA GTTTCCTGGT CAGCCTGGCG
CTGGCGCTGG CGTTATCAGA TTTGGTCAGC CATAAAACGC GCATTGATTC TTTGTTCCTG
GACGAAGGCT TCGGCACGCT TGATAGCGAA ACGCTGGACG CCGCGCTGGA TGCGCTCGAC
GCGCTGAATG CCAGCGGGAA GACCATCGGT GTGATTAGCC ACGTGGAAGC CATGAAAGAA
CGTATCCCTG TGCAGATAAA AGTGAAAAAA ATCAATGGGC TTGGTTATAG CAAACTGGAC
AAAACGTTTG CCGTGGAGTA A
 
Protein sequence
MKILSLRLKN LNSLKGEWKV DFTAEPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR 
LNTVSQSQND LMTRDTAECL AEVEFEVKGE AWRAFWSQNR ARNQPDGNLQ APRVELARCS
DGKIFADKVK DKLEMIATLT GLDYGRFTRS MLLSQGQFAA FLNAKAKERA ELLEELTGTE
IYGQISAQVF EKHKSARLEL EKLQAQASGV ALLADEQLQQ LEASLQALTD EEKRLLADQQ
VQQQHLHWLT RKNELHTELH ARQQALYAAQ EAREKAQPQL AALTLAQPAR QLRPHWERIQ
EQTRAVERVR QHSDEVNARL QSAYRLRQRI RACAHRQFTQ LNATGQRLKT WLAEHDSIRV
WRSELAGWRA LLTQQSHDRA QLSQWQQQLL SDTRQRDALP PLTLDLTPQA LAEARALHTR
QRPLRHRLAA LQGQILPKQK RQAQLQAAIA RHHQEQAQYT QRLADKRLSY KTKAQELADV
RTICEQEARI KDLESQRAHL QSGQPCPLCG STTHPAIAAY QALELSANQT RRDALEKEVK
TLAEEGAALR GQLDALTQQL QRDESEAQSL LQEEQALTEE WQTLCATLGV QLQPQEDLAG
WLTAAEEHEQ QLDQLSQRHA LQTQIAAHTE QVARFTAQIA QRQASLTADL AQYTLSLPAP
EDEASWLNER ADEAKIWQQR QTEFADLQTQ IDRLAPLLET LPQTDTADSD DDVPLDNWRQ
AHDECVSLQS QLQTLQEQTT QEQQRAAEAI AHFDAALKNS PFDSQATFLA ALLDEETVTR
LEKQQQTLES QLQQAKALSA QSAQALADHQ QQPPAGLDPT CTAEQLAQRL TQLAQQLREN
TTRQGEIRQQ IKQDADNRQR QRALMAEMKQ ASQQVEDWGY LNALIGSKEG DKFRKFAQGL
TLDNLVWLAN HQLTRLHGRY LLQRKASDAL ELEVVDTWQA DAVRDTRTLS GGESFLVSLA
LALALSDLVS HKTRIDSLFL DEGFGTLDSE TLDAALDALD ALNASGKTIG VISHVEAMKE
RIPVQIKVKK INGLGYSKLD KTFAVE