Gene SNSL254_A0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0037 
Symbol 
ID6486426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp37096 
End bp38658 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content47% 
IMG OID642735481 
Product5'-Nucleotidase domain protein 
Protein accessionYP_002039263 
Protein GI194444661 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.361552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.180073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAA AGTTTTCGAT ATCCCTACTG TCGCTGTGCA TTGGTTTGTC TTCAGCCATT 
TCCTTTTCAG CCGATGCGCG TGACATCACA ATTTATTATA CAAACGATTT ACATGCCCAT
GTAACCCCAG AAATTATCCC CTATGTATCC AAGACACGTC CGGTAGGCGG CTTTGCGCCC
ATCTCGAAAA TTGTCAAAGA TGCAAAAGCG AAAGAGAAAG ATGTCTTTTT CTTTGATGCT
GGCGACTATT TCACCGGACC TTTTATCAGT ACGCTGACCA AAGGCGAGGC TATTATTGAT
ATTTTAAATA CCATGCCTTA CGACGCCGTC TCTGTCGGTA ACCATGAATT TGACCATGGC
CATGAGAATC TGGTTAAACA ACTCAGCAAA TTGCAATTCC CGGTATTGTT GGATAATGTT
TTTTACAGCG GCACAGATAC GCCATTAATT AAAGAACCGT ATACCATCGT GGAAAAAGAT
GGATTCAAGA TCGGCGTCAT CGGTATGCAC GGCGTTTCCG CATTCTATGA AGCGATTGCC
GCAGGCGTGC GTGAAGGCGT TGACTGCCGC GATCCGATTC CTTATGTGAA AAAACAGCTG
GAAGAGTTAA AAGGGAAAGT TGACCTGACC GTGCTGCTCG CCCACGAAGG CGTACCGGGT
ATGCAGTCCA GCGCAGGCGA GGCTGATGTC GCGCGCGCGC TGAAAACCGA CGTTGATATG
GCGAAATCGC TGGAAGGCTA TGGACTTAAC GTCCTGATTA CCGGCCATGC GCATAAAGGT
ACGCCAGAAC CGATTAAAGT GGGCGATACC CTTGTCGTTT CCACGGATGC GTACACCATC
GAATTAGGTA AACTGGTGCT TGACTGGAAC CCGGAAACCA AAAAAGTGGA CAGCTACAAT
GGTAAGTTGA TCACCATGTA TGCGGATACT TATAAGCCAG ATCCGGTCAC GCAGGCCAAA
ATTGACGAAT GGGATAACAA GGTTAAGAAA ATTACCGATG AGGTGGTCGC GCACTCTCCG
GAAGTGCTGA CCCGTTCTTA CGGTGAATCC GCGCCAACCG GCAACTTAAT CACCGATGCC
CTGATGGCTA CCGTTCCTGG CGCCGACGCT TCCTTCTATA ATGCTGGCGG CATCCGTACC
GAATTGCCTA AAGGTAATAT CACCTATGGT GATGTGCTGA GTATGTATCC GTTCACCAAC
GATGTCATGA GCATGGAAAT CAGCGGTAAG GACCTGAAAT CCATCATGTC ACACGCTGCC
GATCTGAAAA ACGGTATGCT GCACGTATCT AAAACCGTCC AGTTTAAATA TGACAGCACC
AAACCGCTGG GCCAGCGTAT TGTTGAATTT GATATCAAAG GCAAACCGGT AGAAGACAAT
AAACTCTATA CCGTCGCGCT GGACTCCTTT ATCGGTAAAG GTGGTGGCGG ATTTACCTTC
ACTAAAGGTA AAAATATCAA ATATATAGGG ATACAAACCG CACCGGCGTT GGTTAACTAT
ATGAAGCAGG TTAACAATAT TCAACCTGAC CACACCATGC GCGTGGATGA TATTAGCAAA
TAA
 
Protein sequence
MNKKFSISLL SLCIGLSSAI SFSADARDIT IYYTNDLHAH VTPEIIPYVS KTRPVGGFAP 
ISKIVKDAKA KEKDVFFFDA GDYFTGPFIS TLTKGEAIID ILNTMPYDAV SVGNHEFDHG
HENLVKQLSK LQFPVLLDNV FYSGTDTPLI KEPYTIVEKD GFKIGVIGMH GVSAFYEAIA
AGVREGVDCR DPIPYVKKQL EELKGKVDLT VLLAHEGVPG MQSSAGEADV ARALKTDVDM
AKSLEGYGLN VLITGHAHKG TPEPIKVGDT LVVSTDAYTI ELGKLVLDWN PETKKVDSYN
GKLITMYADT YKPDPVTQAK IDEWDNKVKK ITDEVVAHSP EVLTRSYGES APTGNLITDA
LMATVPGADA SFYNAGGIRT ELPKGNITYG DVLSMYPFTN DVMSMEISGK DLKSIMSHAA
DLKNGMLHVS KTVQFKYDST KPLGQRIVEF DIKGKPVEDN KLYTVALDSF IGKGGGGFTF
TKGKNIKYIG IQTAPALVNY MKQVNNIQPD HTMRVDDISK