Gene SeD_A1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1074 
Symbol 
ID6872825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1078187 
End bp1079527 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content49% 
IMG OID642784259 
Productintegrase 
Protein accessionYP_002214933 
Protein GI198243350 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0000184312 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGCA GTACACTCGT CAATGCTCCT GGACGTCAGG AGGGATTAAT GGCTAATGCA 
TCATACCCGA CAGGCGTCGA AAACCACGGC GGTTCGCTCC GCATCTGGTT TCTGTATAAA
GGTAAACGTG TCAGGGAAAA CCTTGGTATC CCTGACACTG CAAAAAATCG CAAGATAGCT
GGCGAACTGC GTTCTTCGGT TTGTTTTGCG ATAAGGATGG GGAATTTTAA CTATGTGGAA
AAATTCCCAA ACTCACCGAA CCTTGCCCGG TTCGGTCAGG ATAGAAAGGA AATTACTGTG
CTGGAGCTTA CCGAAAGATG GTCCGAGCTG AAGAGAATGG AGATCAGCTC TAATACCATG
AGTAGGTACG AATCTATCAT AAAAAACATG CTTCCACTCA TCGGCGAAAA CAAAATGGTT
TCTGCGGTGA CTACTGAGGA TTTGCTGTAT GTCAGGAAGG AGTTGCTGAC GGGCTTTCAG
GTAATGAAGA AGGATCACCG GACTCAGGTT AAAGGCCGGA AATCGTCCAC AGTGAATAAT
TACATGATGC TGATGGCCGA GATCTTCCAG TTTGGAACAG ATAACGGCTA TGCAAAGGAA
AACCCGTTTA GCGGAATTAA CCGTCTCAAG AAAGCGAAAG GGGAACCAGA TCCACTCACG
ACAGACGAGT TCATCAGGTT TATCCAGGCA TGCGGACACC AGCAGATGAG AAATCTCTGG
TCACTGGCAG TCTATACCGG AATGAGGCAT GGGGAGTTGT GCGGTCTGGC CTGGGAAGAT
ATCGATCTGC ATGCCGGGAC GATCATTGTG AAGCGCAACC TTACCCAGAC GGATGAGTTC
ACCCTGCCAA AAACCAACGC AGGTACTGAC AGGGTGATAT ATCTCATTCA ACCAGCTATT
GATGCCCTGA GGAATCAGGC CCAGTTGACA CGCCTTGGCC GGCAGTTTGA GGTTGAAGTG
AAGTTGCGGG AATATGGACA ATCTGTCATT CAGCCCTGCA CGTTCGTATT CAGCCCTCAA
TGCGTCAAAC GTGGACCTCG CACAGGATAT CACTACGCGG TTAATTCCAT TAATAAAATT
TGGGCCCCGA TAATCAAGCG TGCCGGCATT CGTTACCGTA ACGCGTATCA GTCACGACAT
ACCTATGCAT GCTGGTCATT ATCAGCTGGT GCTAACCCAA ACTTTATAGC AACGCAGATG
GGGCATACCG ATGCACAGAT GGTTTACAAG GTGTATGGAA AGTGGATGTC AGAGAAGAGC
GCAGAACAGG TTTCTCTGCT CAACCAGGCA CTTTCCCGCT ATGCCCCATC ACTGCCCCAA
AGCATGGTAG CAGCGCAGTA G
 
Protein sequence
MKSSTLVNAP GRQEGLMANA SYPTGVENHG GSLRIWFLYK GKRVRENLGI PDTAKNRKIA 
GELRSSVCFA IRMGNFNYVE KFPNSPNLAR FGQDRKEITV LELTERWSEL KRMEISSNTM
SRYESIIKNM LPLIGENKMV SAVTTEDLLY VRKELLTGFQ VMKKDHRTQV KGRKSSTVNN
YMMLMAEIFQ FGTDNGYAKE NPFSGINRLK KAKGEPDPLT TDEFIRFIQA CGHQQMRNLW
SLAVYTGMRH GELCGLAWED IDLHAGTIIV KRNLTQTDEF TLPKTNAGTD RVIYLIQPAI
DALRNQAQLT RLGRQFEVEV KLREYGQSVI QPCTFVFSPQ CVKRGPRTGY HYAVNSINKI
WAPIIKRAGI RYRNAYQSRH TYACWSLSAG ANPNFIATQM GHTDAQMVYK VYGKWMSEKS
AEQVSLLNQA LSRYAPSLPQ SMVAAQ