Gene SeD_A2404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2404 
SymbolsbcB 
ID6873617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2274706 
End bp2276136 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID642785495 
Productexonuclease I 
Protein accessionYP_002216153 
Protein GI198243454 
COG category[L] Replication, recombination and repair 
COG ID[COG2925] Exonuclease I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.191485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGTTA AAAACGACAG CATACAGAGC ACATTCCTCT TCCACGATTA CGAAACCTTC 
GGTACGCATC CGGCCCTCGA CAGACCTGCG CAATTCGCCG CGCTCCGTAC GGATAACGAC
TTCAACGTTA TTGGCGAGCC GGAGGTGTTT TATTGCAAAC CCGCCGATGA TTATCTACCG
CAGCCCGGCG CGGTGCTGAT TACCGGCATC ACGCCGCAGG AAGCGCGTGA GAAAGGAGAA
AACGAAGCCG CTTTCGCCAG ACGCATTCAT GCGCTGTTTA CCGTTCCTAA AACCTGCGTT
GTGGGCTACA ACAATGTGCG CTTTGATGAT GAAGTCACGC GCAATATTTT TTATCGCAAC
TTTTACGATC CCTATGCCTG GAGCTGGCAG CATGATAATT CACGTTGGGA TCTATTGGAT
GTCATGCGCG CCTGTTATGC GCTGCGCCCG GAGGGAATTA ACTGGCCGGA AAACGACGAC
GGCCTGCCCA GCTTTCGTCT GGAACATTTA ACCCAGGCGA ACGGGATCGA ACACAGCAAC
GCGCATGATG CGATGGCGGA TGTCTACGCC ACTATTGCGA TGGCGCAACT GGTGAAAACA
CGCCAGCCGC GACTGTTTGA TTATCTTTAT AGCCACCGCA GTAAACATAA ACTGGCGGCG
CTGATTGACG TTCCGCAGAT GAAGCCGCTG GTGCATGTCT CCGGCATGTT TGGCGCGTGG
CGCGGTAATA CAAGCTGGGT CGCGCCGCTG GCGTGGCATC CTGAAAACCG TAACGCGGTG
ATCATGGTCG ATTTAGCAGG CGATATTTCT CCTCTTCTTG AGCTGGACAG CGACACCCTT
CGCGAGCGGC TTTATACGGC CAAAGCCGAT CTTGGCGATC ACGTCGCAGT GCCGGTAAAG
CTGGTGCATA TCAATAAATG TCCGGTACTG GCGCAGGCGA ATACCTTGCG CCCGGAGGAT
GCCGACCGGC TGGGAATTAA CCGCCAGCAC TGCCTGGATA ACCTGAAAGT GTTGCGTGAA
AACCCGCAGG TCCGCGACAA AGTGGTGGCG ATTTTTGCCG AAGCCGAACC TTTTGCCGCC
TCGGATAACG TTGATGCCCA GCTCTATGAT GGTTTTTTCA GCGATGCCGA TCGCGCAGCC
ATGAAAATCG TACTCGAAAC CGAGCCGCGT AACCTGCCCG CGCTGGATAT TACCTTTGTC
GATAAGCGCA TTGAGAAGCT GCTGTTTAAT TACCGCGCAC GCAATTTTCC CGGTACGCTG
GATGACGCAG AGCAGCAGCG CTGGCTAGAG CATCGCCGTC AGGTGCTGAC GCCGGAGTTT
TTACAACAAT ATGCCAATGA ATTGCAGATG CTTTCTCAGC AGTATGCGGA AGATAAAACG
AAGCTGGGGT TGCTGAAATC ACTGTGGCAG TACGCAACTG AGATTGTGTA A
 
Protein sequence
MTVKNDSIQS TFLFHDYETF GTHPALDRPA QFAALRTDND FNVIGEPEVF YCKPADDYLP 
QPGAVLITGI TPQEAREKGE NEAAFARRIH ALFTVPKTCV VGYNNVRFDD EVTRNIFYRN
FYDPYAWSWQ HDNSRWDLLD VMRACYALRP EGINWPENDD GLPSFRLEHL TQANGIEHSN
AHDAMADVYA TIAMAQLVKT RQPRLFDYLY SHRSKHKLAA LIDVPQMKPL VHVSGMFGAW
RGNTSWVAPL AWHPENRNAV IMVDLAGDIS PLLELDSDTL RERLYTAKAD LGDHVAVPVK
LVHINKCPVL AQANTLRPED ADRLGINRQH CLDNLKVLRE NPQVRDKVVA IFAEAEPFAA
SDNVDAQLYD GFFSDADRAA MKIVLETEPR NLPALDITFV DKRIEKLLFN YRARNFPGTL
DDAEQQRWLE HRRQVLTPEF LQQYANELQM LSQQYAEDKT KLGLLKSLWQ YATEIV