Gene SNSL254_A2244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2244 
SymbolsbcB 
ID6484322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2152964 
End bp2154394 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID642737592 
Productexonuclease I 
Protein accessionYP_002041334 
Protein GI194443671 
COG category[L] Replication, recombination and repair 
COG ID[COG2925] Exonuclease I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.0792988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGTTA AAAACGACAG CATACAGAGC ACATTCCTCT TCCACGATTA CGAAACCTTC 
GGTACGCATC CTGCCCTCGA CAGACCTGCG CAATTTGCCG CGCTCCGTAC GGATAACGAC
TTCAACGTTA TTGGCGAGCC GGAGGTGTTT TATTGCAAAC CCGCTGATGA TTATCTACCG
CAGCCCGGCG CAGTGCTGAT TACCGGCATC ACGCCGCAGG AAGCGCGTGA GAAAGGAGAA
AACGAAGCCG CTTTCGCCAG ACGCATTCAT GCGCTGTTTA CCGTTCCTAA AACCTGCGTT
GTGGGCTACA ACAATGTGCG TTTTGATGAT GAAGTTACGC GCAATATTTT TTATCGCAAC
TTTTACGATC CCTACGCCTG GAGCTGGCAG CATGATAATT CACGTTGGGA TCTATTGGAT
GTCATGCGCG CCTGTTATGC GCTGCGCCCG GAGGGAATTA ACTGGCCGGA AAACGACGAC
GGCCTGCCCA GCTTTCGTCT GGAACATTTA ACCCAGGCGA ACGGGATCGA ACACAGCAAC
GCGCATGATG CGATGGCGGA TGTCTACGCC ACTATTGCGA TGGCGCAACT GGTGAAAACA
CGCCAGCCGC GACTGTTTGA TTATCTTTAT AGCCACCGCA GTAAACATAA ACTGGCGGCG
CTGATTGACG TCCCGCAAAT GAAGCCGCTG GTGCATGTCT CCGGCATGTT TGGCGCGTGG
CGCGGTAATA CAAGCTGGGT CGCGCCGCTG GCGTGGCATC CAGAAAACCG TAACGCGGTG
ATCATGGTCG ATTTAGCAGG CGATATTTCT CCTCTTCTTG AGCTGGACAG CGACACCCTT
CGCGAGCGGC TTTATACGGC CAAAGCCGAT CTTGGCGATC ACGTCGCAGT GCCGGTAAAG
CTGGTGCATA TCAATAAATG TCCGGTACTG GCGCAGGCGA ATACCTTGCG CCCGGAGGAT
GCCGACCGGC TGGGAATTAA CCGCCAGCAC TGCCTGGATA ACCTGAAAGT GCTGCGTGAA
AACCCGCAGG TCCGCGACAA AGTGGTGGCG ATTTTCGCCG AAGCCGAACC TTTTGCCGCC
TCGGATAACG TTGATGCCCA GCTCTATGAT GGTTTTTTCA GCGATGCCGA TCGCGCAGCC
ATGAAAATCG TACTCGAAAC CGAGCCGCGT AACCTGCCCG CGCTGGATAT TACCTTTGTC
GATAAGCGCA TTGAGAAGCT GCTGTTTAAT TACCGCGCAC GCAATTTTCC CGGTACGCTG
GATGACGCAG AGCAGCAGCG CTGGCTGGAG CATCGCCGTC AGGTGCTGAC GCCGGAGTTT
TTACAACAGT ATGCCAATGA ATTGCAGATG CTTTCTCAGC AGTATGCGGA AGATAAAACG
AAGCTGGGGT TGCTGAAATC ACTGTGGCAG TACGCAACCG AGATTGTGTA A
 
Protein sequence
MTVKNDSIQS TFLFHDYETF GTHPALDRPA QFAALRTDND FNVIGEPEVF YCKPADDYLP 
QPGAVLITGI TPQEAREKGE NEAAFARRIH ALFTVPKTCV VGYNNVRFDD EVTRNIFYRN
FYDPYAWSWQ HDNSRWDLLD VMRACYALRP EGINWPENDD GLPSFRLEHL TQANGIEHSN
AHDAMADVYA TIAMAQLVKT RQPRLFDYLY SHRSKHKLAA LIDVPQMKPL VHVSGMFGAW
RGNTSWVAPL AWHPENRNAV IMVDLAGDIS PLLELDSDTL RERLYTAKAD LGDHVAVPVK
LVHINKCPVL AQANTLRPED ADRLGINRQH CLDNLKVLRE NPQVRDKVVA IFAEAEPFAA
SDNVDAQLYD GFFSDADRAA MKIVLETEPR NLPALDITFV DKRIEKLLFN YRARNFPGTL
DDAEQQRWLE HRRQVLTPEF LQQYANELQM LSQQYAEDKT KLGLLKSLWQ YATEIV