Gene SNSL254_A1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1046 
Symbol 
ID6483463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1061742 
End bp1064627 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content53% 
IMG OID642736452 
Productgifsy-1 prophage RecE 
Protein accessionYP_002040211 
Protein GI194442376 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.74306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00000020097 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTGGAA CTAATCCTGT ATTTTTAGTC CGCAAAGCAA AGAAATCATC AGGCCAGAAA 
GACGCTGTAC TCTGGTGCAG TGATGATTTT GAAGCGGCAA ATGCAACACT GGATTATCTT
CTGATTAAAT CCGGTGCGAA GCTGAAAGAT TATTTCAAAG CTGTCGCTAC TAATTTCCCT
GTCGTTAACG AGCTGCCGCC GGAAGGCGAA CTGAGCCTCA CTTTCTGCGA TTACTATCAA
CTCGCTAAAG ACAATATGAC CTGGACGCAA ATCCCCGGCG TCACCCTGCC ATCATCTGAA
GCCGCCGCCG CGGCGCGCCA GCATATCGTC GATGGTGTTG ATACCGAAAC AGGCGAAGTG
CTGGAAGACC ACACCGAAAA TTTTGGTAAC GAAAGCAACA GCCCTGCCCA GGCAACAGCC
CCAGCCCCCG AGCTGACTGT TGTCGCAACT ATGCCTCTCC GTCACCGCGT TCTTGCTCAG
TACATAGGTG AAGGTGAGTA TCTTTATCAC GTCGACGCCT CCCAGAAAAA AGAAATTCTG
CGTCTCGAAA TGGACACCGA TAATTCATAT GTCCAGAACC TGCTGCTTGC CGCCGAGAAT
GTTGAAGCGT TCAAGAAAGC CATTGAACAT GACATTCACA AAATAGTGAA TGCCGTTAAA
AAAGTATTCC CTGTCGATGG AAAAACTCCT GAACTGGCGA CTGTTATCCA GTTCCTTAAA
ACATGGTTCG AGACGGAGCA TATCGATCGC GGTTTGCTCG TTAAGGAGTG GGCGAAAGGC
AACCGTGTAT CGGCTATTCA ACGCACTGAA AGCGGCGCCA ACGCTGGCGG TGGCAATAAG
ACTGACCGTA ACCCTGATTA CGAACACACT CTCGATACTC TGGACGTAGA GATTGCAATG
GCCACTTTGC CTATGGACTT TAATATCTAT GAGCTACCTG GCAGCGTTTA CCGTCGCGCA
AAAGAAATCG TAAAGAAAAA GGAAAGTCCG TTCAAAGAAT GGTCCGCAGC ACTTCGCGCA
ACGCCCGGTA TCCTGGATTA TTCCCGCGCC GCTATTTTCG CGCTGATCCG AAGCGCACAC
CCTGAGTTTT ATCACTACCC CGGACGCCTT CAGGGGTATA TCAACGCCAA CTTAACGGAG
ACTGATCACG AGAACCCCAC CGAGGAAGCT CTCACGGCTG CCCGACACAC TCCGGAAAAA
GACGCGGTAG AAGAAGCCAA CCGACAGCTT GCCGCCGCGC GCGGTGAATA TGTGGAAGGC
ATCAGCGACC CGAACGACCC AAAATGGGTG AAAACCGGGA CAAGCCAGCC GACCACCGAA
CCTGAACTGG TTAAAAATGT TGGCAACGGT ATTTTCGACG TGTCCGCTTT AATGCAGAAC
TCATCAACTC ATGGCACAGA AACGAATCCG GAGACCACCA GCAATGTGCA GGTTCAAAAA
GCTGACAGTG ATGAAAAACA GGCTGGTGAT GCGGTGCAGG CAGGCGAAGA CGATCTGGGT
ACTGGTAAAG AAGCAGTTAC CGTAGAGAAC CAGAATCAGG CTGAGGCGCA ACAAAACGTA
CCGGAATCGC AACAAGAAGA GCCAGAAGCA GCCTGGCCGG AATACTTCGA GCCGGGCCGC
TATGAAGGTG TACCAAACGA AGTTTACCAC GCCGCCAATG GGATCAGCTC AACTCAGGTG
AAAGATGCTC GCGTGTCGCT GATGTACTTT AACGCGCGTC ACGTAGAGAA AACTATCGTC
AAAGAGCGCT CTCCAGTGCT TGATATGGGC AACCTGGTAC ATGCTCTGGC TCTACAGCCG
GAAAACCTCG AAGCGGAGTT CAGCGTAGAG CCGGAGATCC CTGAGGGTGC TTTCACCACC
ACCGCCACCC TGCGCGAGTT CATCGACGCG CACAACGCCA GCCTGCCAGC GCTGCTGAGT
GCTGACGATA TCAAAGCGCT GCTGGAAGAG TACAACGCCA CCCTGCCGTC GCAGATGCCG
CTTGGAGCTT CGGTAGATGA AACCTATGCA TCGTATGAGC AGCTTCCCGA AGAATTCCAG
CGCATTGAAA ACGGCACCAA ACATACAGCC ACGGCGATGA AAGCCTGCAT CAAAGAGTAC
AACGTCACCC TGCCCGCGCC GGTTAAAACC AGCGGCAGCC GTGACGCGCT GCTGGAGCAA
CTGGCAATAA TCAACCCTGA CCTGGTCGCT CAGGAAGCGC AAAAATCGTC GCCGTTGAAA
GTCTCTGGCA CGAAGGCCGA TCTGATTCAG GCCGTGAAAT CAGTCAACCC GGCAGTGGTA
TTCGCCGACG AATTGCTGGA TGCGTGGCGG GAGAACACCG AAGGGAAAGT GCTGGTCACC
CGCCAACAGC TCAGCACCGC GCTGAACATT CAGAAAGCCC TGCTGGAGCA CCCGACCGCC
GGCAAATTGC TGACTCACCC AAGCCGCGCT GTCGAGGTTA GCTATTTTGG GATTGATGAG
GAAACCGGGT TGGAAGTTCG GGTACGCCCT GACCTTGAGC TCGATATGGG CGGCCTGCGC
ATTGGCGCCG ACCTGAAAAC TATCAGCATG TGGAACATCA AGCAGGAAGG CCTGCGTGCG
AAGTTGCACC GGGAAATCAT CGATCGGGAC TATCACCTGA GCGCGGCCAT GTACTGCGAA
ACTGCGGCGC TGGACCAGTT TTTCTGGATT TTCGTCAACA AAGACGAGAA CTACCACTGG
GTCGCCATCA TTGAGGCGTC TACCGAGTTG CTGGAACTTG GCATGCTGGA ATACCGCAAA
ACAATGCGAG AGATAGCAAA CGGCTTCGAC ACTGGTGAAT GGTCAGCGCC TATCACAGAA
GACTACACCG ACGAACTGAA CGATTTTGAT GTGCGCCGCC TTGAAGCGTT GCGCGTACAG
GCATAA
 
Protein sequence
MSGTNPVFLV RKAKKSSGQK DAVLWCSDDF EAANATLDYL LIKSGAKLKD YFKAVATNFP 
VVNELPPEGE LSLTFCDYYQ LAKDNMTWTQ IPGVTLPSSE AAAAARQHIV DGVDTETGEV
LEDHTENFGN ESNSPAQATA PAPELTVVAT MPLRHRVLAQ YIGEGEYLYH VDASQKKEIL
RLEMDTDNSY VQNLLLAAEN VEAFKKAIEH DIHKIVNAVK KVFPVDGKTP ELATVIQFLK
TWFETEHIDR GLLVKEWAKG NRVSAIQRTE SGANAGGGNK TDRNPDYEHT LDTLDVEIAM
ATLPMDFNIY ELPGSVYRRA KEIVKKKESP FKEWSAALRA TPGILDYSRA AIFALIRSAH
PEFYHYPGRL QGYINANLTE TDHENPTEEA LTAARHTPEK DAVEEANRQL AAARGEYVEG
ISDPNDPKWV KTGTSQPTTE PELVKNVGNG IFDVSALMQN SSTHGTETNP ETTSNVQVQK
ADSDEKQAGD AVQAGEDDLG TGKEAVTVEN QNQAEAQQNV PESQQEEPEA AWPEYFEPGR
YEGVPNEVYH AANGISSTQV KDARVSLMYF NARHVEKTIV KERSPVLDMG NLVHALALQP
ENLEAEFSVE PEIPEGAFTT TATLREFIDA HNASLPALLS ADDIKALLEE YNATLPSQMP
LGASVDETYA SYEQLPEEFQ RIENGTKHTA TAMKACIKEY NVTLPAPVKT SGSRDALLEQ
LAIINPDLVA QEAQKSSPLK VSGTKADLIQ AVKSVNPAVV FADELLDAWR ENTEGKVLVT
RQQLSTALNI QKALLEHPTA GKLLTHPSRA VEVSYFGIDE ETGLEVRVRP DLELDMGGLR
IGADLKTISM WNIKQEGLRA KLHREIIDRD YHLSAAMYCE TAALDQFFWI FVNKDENYHW
VAIIEASTEL LELGMLEYRK TMREIANGFD TGEWSAPITE DYTDELNDFD VRRLEALRVQ
A