Gene SeD_A3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3066 
Symbol 
ID6871814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2955002 
End bp2956027 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content49% 
IMG OID642786095 
Productphage integrase 
Protein accessionYP_002216741 
Protein GI198243118 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA GAAAACAGCC TAACGGAAAA TGGTTGTGTG AATGCTACCC GAACGGACGG 
GATGGCAAAC GTGTACGCAA GCAATTTGCG ACTAAAGGCG AGGCCATAGC ATTCGAAAAC
CACACCATGG ATGAGGTGAA CAAAAAACCG TGGCTGGGGG AGAAGGAAGA TCGGCGGCAT
TTGTCAGAAG TGATTGATCA GTGGCATTTA CTTTATGGGC AGACGCTGGC AGACCCCAAA
CGCCTGATGG CAAAACGCAG CATTATTTGT AATGGCTTGG GCGATCCCAT TGCCTCAGAG
TTAACCGCAG GCGATTTTAC GAAATACAGG GAAGCACGGT TAAAAGGTGA AGTAAAAAAT
GAAGATGGCG TGCTTATGTC GCCAGTTAAG CCCCGTACGG TAAACCTTGA ACAACGTAAC
CTATCATCTG TTTTTGGCAC ACTGAAAAAG CTGGGCCACT GGTCAGCACC CAACCCGCTC
GCTGGGCTAC CAACATTCAA AATCGCAGAG GGCGAACTGG CGTTCCTGGC ACCGGAAGAA
ATTAAACGTC TACTGGATGC CTGTGCTGAT TCTCAGAGTC CCAGTTTGCT GATGATTGCA
AAAGTATGCC TGGCAACTGG CGCCCGATGG AGTGAAGCTG AAAACCTGCA GGGCCATCAG
CTATCAAAAT ACCGCATCAC TTATACCAAG ACGAAGGGCA AGAAAAACCG TACCGTGCCA
ATATCTCAGG ATCTGTATGA AGAACTCCCC AAAAACAGAG GGAAGCTATT CACGCCATGC
AGAAAAGCTT TTGAGCGTGC AGTAAAAAGA GCTGGTATTG AGCTACCAGA AGGCCAATGT
ACCCACGTGC TGCGCCATAC CTTCGCCAGC CACTTTATGA TGAACGGCGG AAACATACTG
GTACTGCGCG ATATTCTGGG CCACGCAGAT ATAAAAATGA CGATGGTTTA CGCTCACTTT
GCCCCTGACC ACCTCGAAGA CGCAGTGACA AAAAACCCGC TTCACAACCT CAATTGGAAA
CGCTAA
 
Protein sequence
MSIRKQPNGK WLCECYPNGR DGKRVRKQFA TKGEAIAFEN HTMDEVNKKP WLGEKEDRRH 
LSEVIDQWHL LYGQTLADPK RLMAKRSIIC NGLGDPIASE LTAGDFTKYR EARLKGEVKN
EDGVLMSPVK PRTVNLEQRN LSSVFGTLKK LGHWSAPNPL AGLPTFKIAE GELAFLAPEE
IKRLLDACAD SQSPSLLMIA KVCLATGARW SEAENLQGHQ LSKYRITYTK TKGKKNRTVP
ISQDLYEELP KNRGKLFTPC RKAFERAVKR AGIELPEGQC THVLRHTFAS HFMMNGGNIL
VLRDILGHAD IKMTMVYAHF APDHLEDAVT KNPLHNLNWK R