Gene SeD_A0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0603 
Symbol 
ID6875860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp617064 
End bp618227 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content50% 
IMG OID642783821 
Productprophage DLP12 integrase 
Protein accessionYP_002214507 
Protein GI198242564 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.591827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.109932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATCT TCCGTAGAGG TGAAATATGG TACGCCTCAT ACTCGCTCCC GGGCGGGAAG 
CGAATTAAGG AATCTCTTGG GACAGCGGAC AAGCGGCAAG CTCAGGAGTT GCACGACAAA
AGAAAGGCTG AACTCTGGCG AGTAGACAAG CTCGGCGACT TTCCTGAAGT GACTTTTGAA
GAAGCATGCC TCCGCTGGCT GGAAGAGAAA GCAGACAAGA AATCGCTCGA TACCGATAAA
GGCCGGATGG GATTCTGGCT TGAGCATTTC GAAGGAGTAA GGATAAAGGA TATCACTGAG
GCGAAGATTT ACGCCGCGGT GAGCAGGATG CAAAACAGGA AGGTAAAGGA GATATGGCAG
CAGAAAGTTG AATCTGCCAA GAGAAAGGGT AAAGAAGCGC CAGTATTTGA GCCCAAGCCG
GTCACCACAT CGACAAAGGC AAAGCACCTC GCACTGATAA AGGCCATTCT CCGGGCGGCA
GAACGTGACT GGAAATGGCT GGAGAAAGCG CCTGTAATCA AGGTTCCTTC TGTCAGAAAC
AAGCGCGTCA GATGGCTTGA GCGTGATGAG GCAAAAAGAC TTATTGAAGA ATGTCCGGAA
CCGTTGAAAT CTGTTGTTAA ATTTGCGCTG GCAACGGGAC TTAGGCGGTC TAACATCATC
AATATGGAGT GGCAACAGAT CGACATGCAG CGTCGTGTTG CCTGGGTGAA CCCTGAAGAC
AGCAAGTCAA ACCGCGCTAT TGGCGTAGCG CTAAATGACA CTGCCTGTAA GGTATTGCGT
GACCAGATTG GTAAGCATCA TAAATGGGTG TTCGTGCATA CGAAAGAAGG CATCCGGCCT
GATGGTTCAA AGACGCCAAC CGTGAGAAAG ATGCGCGTCG ATGATCAGCG GGCGTGGAAT
GCAGCTTGCC GCCGGGCCGG AATTGAGGAT TTTCGCTTCC ACGATCTGAG GCACACGTGG
GCCAGCTGGC TGATTCAGTC CGGAGTTCCG CTTTCTGTTT TGCAGGAAAT GGGAGGATGG
GAGAGCATCG AGATGGTTCG ACGATATGCT CACCTTGCGC CGAACCATTT AACGGAACAC
GCGAAGCAAA TTGACTCGAT TTTCAGTGAT GATGTCCCAA ATATGTCCCA TATGGAAAAT
AATGATGGAA TTAAAGAGGC GTAA
 
Protein sequence
MSIFRRGEIW YASYSLPGGK RIKESLGTAD KRQAQELHDK RKAELWRVDK LGDFPEVTFE 
EACLRWLEEK ADKKSLDTDK GRMGFWLEHF EGVRIKDITE AKIYAAVSRM QNRKVKEIWQ
QKVESAKRKG KEAPVFEPKP VTTSTKAKHL ALIKAILRAA ERDWKWLEKA PVIKVPSVRN
KRVRWLERDE AKRLIEECPE PLKSVVKFAL ATGLRRSNII NMEWQQIDMQ RRVAWVNPED
SKSNRAIGVA LNDTACKVLR DQIGKHHKWV FVHTKEGIRP DGSKTPTVRK MRVDDQRAWN
AACRRAGIED FRFHDLRHTW ASWLIQSGVP LSVLQEMGGW ESIEMVRRYA HLAPNHLTEH
AKQIDSIFSD DVPNMSHMEN NDGIKEA