Gene SNSL254_A0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0439 
Symbol 
ID6486808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp453780 
End bp454982 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content56% 
IMG OID642735862 
Productexonuclease subunit SbcD 
Protein accessionYP_002039636 
Protein GI194444758 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.196846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones104 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCC TCCACACCTC TGACTGGCAT CTGGGACAAA ACTTCTACAG TAAAAGCCGC 
GCCGCGGAGC ATCAGGCTTT TCTGGACTGG CTGCTGGAGA CCGCGCAGGC CCATCAGGTG
GATGCCATTA TTGTCGCTGG CGATATTTTT GATACCGGTT CGCCGCCAAG CTATGCCCGA
GAACTTTATA ACCGTTTCGT CGTTAATTTA CAGCAAACGG GTTGTCATCT GGTGGTGCTG
GCCGGTAATC ATGATTCCGT CGCCACGCTA AACGAGTCGC GCGACATTCT GGCGTTTCTC
AATACAACCG TGATCGCCAG CGCGGGCTAT GCGCCGCGGC TACTTCATCG TCGCGACGGT
TCTCCGGGCG CCGTACTGTG CCCCATTCCC TTTTTGCGCC CGCGCGACAT TATTACCAGT
CAGGCGGGGT TATCCGGCAG CGAGAAACAG CAGCAACTTC TTCATGCGAT TGCCGATTAT
TATCAACAGC AGTATCAGGA AGCGTGCCAG CTACGCGGCG AACGAAAGCT GCCGGTTATC
GCGACGGGAC ATTTAACCAC CGTCGGCGCC AGCAAAAGCG ATGCGGTTCG CGACATTTAT
ATCGGTACGC TGGATGCCTT TCCGGCGCAG CATTTCCCCC CCGCAGATTA TATCGCATTA
GGACACATTC ACCGCGCGCA ATGTGTCGGC GGCACGGAGC ATATCCGCTA TTGCGGCTCG
CCCATCGCCC TCAGCTTTGA TGAGTGCGGC AAAAGCAAAT GCGTGCATCT GGTGACCTTC
GACCAGGGGA AATGGCAAAG CACCGAAAGT CTGGCTGTCC CCGTGACTCA ACCGTTGGCG
GTTTTAAAAG GCGACCTGGC ATCAATTACC GAACAGCTTG AGCAGTGGCG CGGCGTTGAG
CAATCGCCCC CCGTCTGGCT GGATATTGAA ATCACAACCG ATGACTATCT GCACGATATC
CAACGCAGAA TACAGACATT AACGGAGTCA CTCCCCGTAG AGGTATTACT GGTGCGCCGT
AGCCGCGAAC AGCGCGAGCG CTCGCTGGCG AACGAGCGGC GGGAAACATT AAGCGAGCTT
AGCGTGGAAG AGGTTTTTGC GCGGCGTCTG GCGCTGGAAG CGTTAGATAC CCCGCAGCGC
GAGCGCCTGA ATCAGCTCTT TTCCAGCACG CTCTACGCGT TGAATGAGGA GCATGAGGCA
TGA
 
Protein sequence
MRILHTSDWH LGQNFYSKSR AAEHQAFLDW LLETAQAHQV DAIIVAGDIF DTGSPPSYAR 
ELYNRFVVNL QQTGCHLVVL AGNHDSVATL NESRDILAFL NTTVIASAGY APRLLHRRDG
SPGAVLCPIP FLRPRDIITS QAGLSGSEKQ QQLLHAIADY YQQQYQEACQ LRGERKLPVI
ATGHLTTVGA SKSDAVRDIY IGTLDAFPAQ HFPPADYIAL GHIHRAQCVG GTEHIRYCGS
PIALSFDECG KSKCVHLVTF DQGKWQSTES LAVPVTQPLA VLKGDLASIT EQLEQWRGVE
QSPPVWLDIE ITTDDYLHDI QRRIQTLTES LPVEVLLVRR SREQRERSLA NERRETLSEL
SVEEVFARRL ALEALDTPQR ERLNQLFSST LYALNEEHEA