Gene SeD_A0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0433 
Symbol 
ID6871173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp449566 
End bp452706 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content58% 
IMG OID642783662 
Productexonuclease subunit SbcC 
Protein accessionYP_002214349 
Protein GI198245472 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.2903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TCAGTTTGCG TCTGAAAAAC CTGAACTCGC TGAAAGGAGA ATGGAAGGTC 
GATTTTACCG CCGAACCGTT TGCCAGTAAC GGGTTATTCG CCATTACCGG CCCGACCGGC
GCGGGAAAAA CCACCTTGCT TGACGCCATC TGCCTGGCGC TGTACCACGA AACGCCGCGC
CTGAATACGG TATCGCAGTC GCAAAACGAT TTGATGACGC GCGATACGGC GGAATGTCTG
GCCGAAGTGG AGTTTGAGGT AAAAGGCGAA GCCTGGCGCG CGTTCTGGAG CCAGAACCGA
GCCCGAAATC AGCCTGACGG CAATCTCCAG GCCCCACGCG TAGAGCTGGC GCGGTGTTCA
GACGGAAAGA TCTTTGCCGA TAAGGTAAAG GATAAACTGG AGATGATCGC CACGTTGACC
GGGCTGGACT ACGGGCGCTT CACCCGTTCG ATGTTGTTAT CACAGGGGCA GTTCGCCGCC
TTTCTTAACG CCAAAGCGAA AGAGCGCGCA GAGCTTCTGG AGGAGCTCAC CGGCACTGAG
ATTTACGGGC AGATCTCCGC ACAGGTATTT GAAAAACATA AATCAGCCCG TCTCGAACTG
GAAAAGTTAC AAGCGCAGGC CAGCGGCGTC GCCCTATTGG CGGACGAACA GTTACAGCAA
CTGGAAGCCA GTTTGCAGGC GCTTACTGAC GAAGAGAAAC GCCTGCTGGC CGACCAGCAG
GTACAGCAGC AGCACCTTCA CTGGCTGACC CGAAAAAACG AGCTCCACAC TGAATTACAC
GCCCGGCAGC AGGCGCTGTA CGCAGCGCAA GAGGCGCGAG AAAAAGCGCA GCCGCAGCTC
GCGGCGCTGA CTCTGGCGCA ACCTGCTCGC CAGTTGCGTC CGCATTGGGA GCGCATTCAG
GAACAAACCC GCGCCGTTGA GCGTGTTCGC CAGCACAGTG ATGAAGTGAA TGCTCGCTTA
CAAAGCGCGT ATCGCCTGCG CCAGCGTATC CGCGCCTGCG CGCATCGACA ATTCACGCAA
CTGAACGCCA CAGGGCAGCG GTTGAAGACA TGGCTGGCGG AACACGACGG CATCCGCGTC
TGGCGCAGCG AACTGGCGGG GTGGCGAGCA TTATTAACAC AACAATCCCA CGATCGGGCG
CAACTGAGCC AATGGCAACA GCAGTTGCTG AGCGATACGC GTCAGCGTGA TGCGCTACCG
CCGCTCACGC TCGACCTCAC GCCACAGGCG CTGGCGGAAG CCAGAGCATT GCATACCCGG
CAGCGTCCGC TACGCCATCG TCTCGCCGCG CTTCAGGGGC AAATTCTCCC GAAACAAAAG
CGTCAGGCGC AGCTACAGGC AGCTATCGCG CGTCATCATC AGGAGCAGGC GCAATACACC
CAACGTCTGG CGGATAAGCG TTTGAGTTAT AAGACTAAAG CGCAGGAGCT TGCGGACGTT
CGCACTATTT GCGAGCAGGA GGCGCGCATC AAAGATCTGG AAAGCCAGCG GGCGCACTTA
CAGTCCGGCC AGCCGTGTCC GCTTTGCGGC TCGACGACGC ACCCCGCCAT CGCCGCCTAT
CAGGCGCTGG AACTGAGCGC GAATCAAACC CGCCGCGACG CGCTGGAAAA AGAGGTCAAA
ACGCTGGCGG AAGAAGGCGC GGCGCTGCGA GGACAGTTAG ATGCCTTAAC GCAACAATTA
CAGCGGGATG AAAGCGAGGC GCAGTCGTTG TTGCAGGAGG AGCAAGCGCT TACTGAGGAA
TGGCAAACGC TGTGCGCTAC GCTGGGCGTT CAGCTCCAGC CGCAAGAAGA CCTCGCGGGC
TGGCTGACCG CCGCGGAAGA GCATGAGCAA CAGTTGGATC AGCTCAGCCA GCGTCATGCT
CTACAAACGC AAATCGCGGC GCATACTGAA CAGGTAGCGC GCTTTACCGC GCAGATTGCG
CAGCGTCAGG CGTCGCTGAC GGCTGATTTG GCGCAGTATA CGCTTTCCCT TCCTGCGCCA
GAAGACGAGG CTTCGTGGCT GAATGAGCGC GCCGACGAAG CGAAAATATG GCAACAGCGT
CAGACAGAGT TCGCGGATTT GCAAACGCAG ATCGACAGGC TTGCCCCGTT GCTGGAGACG
CTACCGCAAA CGGATACCGC GGATTCCGAC GACGACGTGC CGCTGGATAA CTGGCGGCAG
GCGCATGATG AGTGCGTGTC ATTACAAAGC CAGTTGCAAA CCCTGCAGGA ACAGACGACG
CAGGAGCAGC AGCGCGCCGC CGAGGCGATA GCGCACTTTG ATGCGGCGTT AAAAAATAGC
CCGTTTGACA GCCAGGCAAC GTTCCTGGCG GCGTTGCTGG ATGAAGAGAC CGTGACCCGT
CTGGAAAAAC AACAACAAAC GCTGGAAAGT CAGCTACAAC AAGCGAAGGC GTTAAGCGCG
CAGTCCGCGC AGGCGCTGGC TGACCATCAG CAACAGCCGC CCGCTGGTCT GGACCCAACG
TGTACGGCGG AGCAGCTCGC GCAGCGGCTG ACGCAGTTGG CGCAACAACT GCGCGAAAAC
ACGACACGCC AGGGGGAAAT CCGCCAGCAA ATTAAACAGG ATGCAGATAA TCGGCAGCGC
CAACGCGCGC TGATGGCGGA AATGAAGCAA GCCTCTCAGC AAGTGGAAGA CTGGGGCTAT
CTCAATGCGC TGATCGGCTC TAAAGAAGGT GATAAGTTCC GCAAATTCGC CCAGGGACTG
ACGCTGGATA ATCTGGTCTG GCTGGCGAAT CATCAGCTCA CCCGCCTGCA TGGCCGCTAT
TTATTGCAGC GCAAAGCCAG CGACGCGCTG GAACTGGAGG TTGTCGATAC CTGGCAGGCC
GACGCCGTGC GCGATACGCG AACGTTATCA GGCGGCGAAA GCTTCCTGGT CAGCCTGGCG
CTGGCGCTGG CGTTATCAGA TTTGGTCAGC CATAAAACGC GCATTGATTC TTTGTTCCTC
GACGAAGGCT TCGGCACGCT TGATAGCGAA ACGCTGGACG CCGCGCTGGA TGCGCTCGAC
GCGCTGAATG CCAGCGGGAA GACCATCGGT GTGATTAGCC ACGTGGAAGC CATGAAAGAA
CGTATCCCTG TGCAGATAAA AGTGAAAAAA ATCAATGGGC TTGGTTATAG CAAACTGGAC
AAAGCGTTTG CTGTGGAGTA A
 
Protein sequence
MKILSLRLKN LNSLKGEWKV DFTAEPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR 
LNTVSQSQND LMTRDTAECL AEVEFEVKGE AWRAFWSQNR ARNQPDGNLQ APRVELARCS
DGKIFADKVK DKLEMIATLT GLDYGRFTRS MLLSQGQFAA FLNAKAKERA ELLEELTGTE
IYGQISAQVF EKHKSARLEL EKLQAQASGV ALLADEQLQQ LEASLQALTD EEKRLLADQQ
VQQQHLHWLT RKNELHTELH ARQQALYAAQ EAREKAQPQL AALTLAQPAR QLRPHWERIQ
EQTRAVERVR QHSDEVNARL QSAYRLRQRI RACAHRQFTQ LNATGQRLKT WLAEHDGIRV
WRSELAGWRA LLTQQSHDRA QLSQWQQQLL SDTRQRDALP PLTLDLTPQA LAEARALHTR
QRPLRHRLAA LQGQILPKQK RQAQLQAAIA RHHQEQAQYT QRLADKRLSY KTKAQELADV
RTICEQEARI KDLESQRAHL QSGQPCPLCG STTHPAIAAY QALELSANQT RRDALEKEVK
TLAEEGAALR GQLDALTQQL QRDESEAQSL LQEEQALTEE WQTLCATLGV QLQPQEDLAG
WLTAAEEHEQ QLDQLSQRHA LQTQIAAHTE QVARFTAQIA QRQASLTADL AQYTLSLPAP
EDEASWLNER ADEAKIWQQR QTEFADLQTQ IDRLAPLLET LPQTDTADSD DDVPLDNWRQ
AHDECVSLQS QLQTLQEQTT QEQQRAAEAI AHFDAALKNS PFDSQATFLA ALLDEETVTR
LEKQQQTLES QLQQAKALSA QSAQALADHQ QQPPAGLDPT CTAEQLAQRL TQLAQQLREN
TTRQGEIRQQ IKQDADNRQR QRALMAEMKQ ASQQVEDWGY LNALIGSKEG DKFRKFAQGL
TLDNLVWLAN HQLTRLHGRY LLQRKASDAL ELEVVDTWQA DAVRDTRTLS GGESFLVSLA
LALALSDLVS HKTRIDSLFL DEGFGTLDSE TLDAALDALD ALNASGKTIG VISHVEAMKE
RIPVQIKVKK INGLGYSKLD KAFAVE