Gene SeD_A1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1078 
Symbol 
ID6872575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1081217 
End bp1084405 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content53% 
IMG OID642784263 
Productgifsy-1 prophage RecE 
Protein accessionYP_002214937 
Protein GI198242606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.673362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00000173114 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAATTTT TCTATGTAGT AAAAGCTACG CAGAAATCCG GAAAGCAAGA TGCGACGGTC 
TGGTTCACTG CAAAATCAGA AGCGCGCGCC AACCTTATGC TGGATGTCGT TCTGGAAGAT
GCTGAAATTG AAACCGGCCG CGGTAAGGAT TATGCAAGGC CGATCCGCAC CAATTTTCCG
GTAGTCAACG AGCTGCCGCC GGAAGGTGAA ATAAGTTTTA CCTTCACTAA TTATTATCGC
CTCGGTGAAG ATGGCATGAC TTGGGAACAA ATCCCCGGCG TCACCCTGCC ATCATCTGAA
GCCGCCGCCG TGGCCCGCCA GCACATCGTT GACGGTGTTG ATACCGAAAC AGGCGAGGTA
TTGGAAGACC ACTCCGAAAA TTTTGAGAAC AAAGGCAACA GCCCTACCCC AGCCCCCGAG
CTGACTGTTG TCGCAACTAT GCCCCTCCGT CACCGCGTTC TTGCTCAGCA CATAGGTGAA
GGTGAGTATC TTTATCACGT CGACGCCTCC CAGAAAAAAG AAATTCTACG TCTCGAAATG
GACACCGATA ATTCATATGT CCAGAACCTG CTGCTTGCCG CCGAGAATGT TGAAGCGTTC
AAGAAAGCTA TCGAGCACGA TATTCACAAA GCAGTGAATG CGCATAAACA GGTATTTCCT
GTCGATGGAA AAGTGCCTGA GTTATGCACC ACTATTAAGT TTTTTAAGGA ATGGTTCAGT
GCTGAACACA TTAACCGCGG CCTGCTGGTT AAGGAATGGG CTGAACGCCT GAAGAATAAA
CCAGCACCCG TTAAAAAATC CGGGCCACAT AAAGTAATTG TCGCCGACGT AAATAAGCCA
GAACGTCCAC GCCGTAGCGA AAAACCGACA CACAGAACGA TTAACTATGA GCTCGCCTGT
GGTTTCTGTG AGGAGCTGGA TCTGAATAAC CTGCGTCCTG CAATGGATTT TGCAAAACGT
ATCATCGCCG AAGACCGGGA AGACTGGAAG CGAATGTCGA TGACAGTGGG CATTATCCCC
GACATCAAAG GCTACGACCG ACAGACCATT ATTGACCTGG TACGCAAGGC ACCAAAGGCC
GTACATAACG GTAATCCTGA TCTTCGCCGG ACGTGGTGCG AAAGCTTTCT TGCCGTTCAT
GGTGTTCGCG ATCCGGACTG GTACGAATAT GCGCCTGATA ACACCCCAAC AACCCATGAA
GAAAATGCAG CAAGGCTTCG TCAGGCGGGT AAATGTCTGC GGGATATTGA GGCAGGGAGA
TTTCAGTGTG ATGAAGAAAA ACCACAACCA GCAGGCGAAC TGGCAGATGA ACCAGCAACG
CCTGAAGCAG TGGAACAGGA CACAATTGAA CATCATCCGG ACCCGCAGCC GCTGGAGAAT
GAGCCACCTG TAAGCCAGAC AGAAGCAGGC TACCAGAAAA TACGGGCAGA ACTGTACGAA
GCACGTAAAA ACATTCCACC CAAAAACCCG GTTGATGTTG GTAAACAACT GGCAGCCGCG
CGCGGTGAAT ATGTGGAAGG CATCAGCGAC CCGAACGACC CAAAATGGGT GAAAACCGAG
ACAAGCCATC CGACCACCGA ACCTGAACTG GTTAAAAATG TCGGCAACGG TATTTTCGAC
GTGTCCGCTT TAATGCAGAA CTCATCAACT CATGGCACAG AAACGAATCC GGAGACCACC
AGCAATGTGC AGGTTCAAAA AGCTGACAGT GATGAAAAAC AGGCTGGTGA TGCGGTGCAG
GCAGGCGAAG GCGATCTGGG TACTGGTAAA GAAGCAGTTA CCGTAGAGAA CCAGAATCAG
GCTGAGACGC ACCAGAACAA CGATTCTGTG AGCCAATCTG AACCTGAGGC GCAACAAAAC
GTACCGGAAT CGCAACAAGA AGAGCCAGAA GCAGCCTGGC CGGAATACTT CGAGCCGGGC
CGCTATGAAG GTGTACCAAA CGAGGTTTAC CACGCCGCCA ACGGGATCAG CTCAACTCAG
GTGAAAGATG CTCGCGTGTC GCTGATGTAC TTTAACGCGC GTCACGTAGA GAAGACTATC
GTCAAAGAGC GCTCTCCAGT GCTTGATATG GGCAACCTGG TACATGTTCT GGCTCTACAG
CCGGAAAACC TCGAAGCAGA GTTCAGCGTA GAGCCGGAGA TCCCTGAGGG TGCTTTCACC
ACCACCGCCA CCCTGCGCGA GTTCATCGAC GCGCACAACG CCAGCCTGCC AGCGCTGCTG
AGTGCTGACG ATATCAAAGC GCTGCTGGGA GAGTACAACG CCACCCTGCC GTCGCAGATG
CCGCTTGGAG CTTCGGTAGA TGAAACCTAT GCATCGTATG AGCAGCTTCC CGAAGAATTC
CAGCGCATTG AAAACGGCAC CAAACATACA GCCACGGCGA TGAAAGCCTG CATCAAAGAG
TACAACGCTA CCCTGCCCGC GCCGGTTAAA ACCAGCGGCA GCCGTGACGC GCTGCTGGAG
CAACTGGCAA TAATCAACCC TGACCTGGTC GCTCAGGAAG CGCAAAAATC GTCGCCGTTG
AAAGTCTCTG GCACGAAGGC CGATCTGATT CAGGCCGTGA AATCAGTCAA CCCGGCAGCG
GTATTCGCCG ACGAATTGCT GGATGCGTGG CGGGAGAACA CCGAAGGGAA AGTGCTGGTC
ACCCGCCAAC AGCTCAGCAC CGCGCTGAAC ATTCAGAAAG CCCTGCTGGA GCACCCGACC
GCCGGCAAAT TGCTGACTCA CCCAAGCCGC GCTGTCGAGG TTAGCTATTT TGGGATTGAT
GAGGAAACCG GGTTGGAAGT TCGGGTACGC CCTGACCTTG AGCTCGATAT GGGCGGCCTG
CGCATTGGCG CCGACCTGAA AACTATTAGC ATGTGGAACA TCAAGCAGGA AGGCCTGCGT
GCGAAGTTGC ACCGGGAAAT CATCGATCGG GACTATCACC TGAGCGCGGC CATGTACTGC
GAAACTGCGG CGCTGGACCA GTTTTTCTGG ATTTTCGTCA ACAAAGACGA GAACTACCAC
TGGGTCGCCA TCATTGAGGC GTCTACCGAG TTGCTGGAAC TTGGCATGCT GGAATACCGC
AAAACAATGC GAGAGATAGC AAACGGCTTC GACACTGGTG AATGGTCAGC GCCTATCACA
GAAGACTACA CCGACGAACT GAACGATTTT GATGTGCGCC GCCTTGAAGC GTTGCGCGTA
CAGGCATAA
 
Protein sequence
MEFFYVVKAT QKSGKQDATV WFTAKSEARA NLMLDVVLED AEIETGRGKD YARPIRTNFP 
VVNELPPEGE ISFTFTNYYR LGEDGMTWEQ IPGVTLPSSE AAAVARQHIV DGVDTETGEV
LEDHSENFEN KGNSPTPAPE LTVVATMPLR HRVLAQHIGE GEYLYHVDAS QKKEILRLEM
DTDNSYVQNL LLAAENVEAF KKAIEHDIHK AVNAHKQVFP VDGKVPELCT TIKFFKEWFS
AEHINRGLLV KEWAERLKNK PAPVKKSGPH KVIVADVNKP ERPRRSEKPT HRTINYELAC
GFCEELDLNN LRPAMDFAKR IIAEDREDWK RMSMTVGIIP DIKGYDRQTI IDLVRKAPKA
VHNGNPDLRR TWCESFLAVH GVRDPDWYEY APDNTPTTHE ENAARLRQAG KCLRDIEAGR
FQCDEEKPQP AGELADEPAT PEAVEQDTIE HHPDPQPLEN EPPVSQTEAG YQKIRAELYE
ARKNIPPKNP VDVGKQLAAA RGEYVEGISD PNDPKWVKTE TSHPTTEPEL VKNVGNGIFD
VSALMQNSST HGTETNPETT SNVQVQKADS DEKQAGDAVQ AGEGDLGTGK EAVTVENQNQ
AETHQNNDSV SQSEPEAQQN VPESQQEEPE AAWPEYFEPG RYEGVPNEVY HAANGISSTQ
VKDARVSLMY FNARHVEKTI VKERSPVLDM GNLVHVLALQ PENLEAEFSV EPEIPEGAFT
TTATLREFID AHNASLPALL SADDIKALLG EYNATLPSQM PLGASVDETY ASYEQLPEEF
QRIENGTKHT ATAMKACIKE YNATLPAPVK TSGSRDALLE QLAIINPDLV AQEAQKSSPL
KVSGTKADLI QAVKSVNPAA VFADELLDAW RENTEGKVLV TRQQLSTALN IQKALLEHPT
AGKLLTHPSR AVEVSYFGID EETGLEVRVR PDLELDMGGL RIGADLKTIS MWNIKQEGLR
AKLHREIIDR DYHLSAAMYC ETAALDQFFW IFVNKDENYH WVAIIEASTE LLELGMLEYR
KTMREIANGF DTGEWSAPIT EDYTDELNDF DVRRLEALRV QA