Gene SeHA_C1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1107 
Symbol 
ID6488185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1102597 
End bp1103937 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content49% 
IMG OID642741349 
Productintegrase 
Protein accessionYP_002045001 
Protein GI194448103 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.781225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCA GTACACTCGT CAATGCTCCT GGACGTCAGG AGGGATTAAT GGCTAATGCA 
TCATACCCGA CAGGCGTCGA AAACCACGGC GGTTCGCTCC GCATCTGGTT TCTGTATAAA
GGTAAACGTG TCAGGGAAAA CCTTGGTATC CCTGACACTG CAAAAAATCG CAAGATAGCT
GGCGAACTGC GTTCTTCGGT TTGTTTTGCG ATAAGGATGG GGAATTTTAA CTATGTGGAA
AAATTCCCAA ACTCACCGAA CCTTGCCCGG TTCGGTCAGG ATAGAAAGGA AATTACTGTG
CTGGAGCTTA CCGAAAGATG GTCCGAGCTG AAGAGAATGG AGATCAGCTC TAATACCATG
AGTAGGTACG AATCTATCAT AAAAAACATG CTTCCACTCA TCGGCGAAAA CAAAATGGTT
TCTGCGGTGA CTACTGAGGA TTTGCTGTAT GTCAGGAAGG AGTTGCTGAC GGGCTTTCAG
GTAATGAAGA AGGATCACCG GACTCAGGTT AAAGGCCGGA AATCGTCCAC AGTGAATAAT
TACATGATGC TGATGGCCGA GATCTTCCAG TTTGGAACAG ATAACGGCTA TGCAAAGGAA
AACCCGTTTA GCGGAATTAA CCGTCTCAAG AAAGCGAAAG GGGAACCAGA TCCACTCACG
ACAGACGAGT TCATCAGGTT TATCCAGGCA TGCGGCCACC AGCAGATGAG AAATCTCTGG
TCACTGGCAG TCTATACCGG AATGAGGCAT GGGGAGTTGT GCGGTCTGGC CTGGGAAGAT
ATCGATCTGC ATGCCGGGAC GATCATTGTG AAGCGCAACC TTACCCAGAC GGATGAGTTC
ACCCTGCCAA AAACCGACGC AGGTACTGAC AGGGTGATAT ATCTCATTCA ACCAGCTATT
GATGCCCTGA GGAATCAGGC CCAGTTGACA CGCCTTGGCC GGCAGTTTGA GGTTGAAGTG
AAGTTGCGGG AATATGGACA ATCTGTCATT CAGCCCTGCA CGTTCGTATT CAGCCCTCAA
TGCGTCAAAC GTGGACCTCG CACAGGATAT CACTACGCGG TTAATTCCAT TAATAAAATT
TGGGCCCCGA TAATCAAGCG TGCCGGCATT CGTTACCGTA ACGCGTATCA GTCACGACAT
ACCTATGCAT GCTGGTCATT ATCAGCTGGT GCTAACCCAA ACTTTATAGC AACGCAGATG
GGGCATACCG ATGCACAGAT GGTTTACAAG GTGTATGGAA AGTGGATGTC AGAGAAGAGC
GCAGAACAGG TTTCTCTGCT CAACCAGGCA CTTTCCCGCT ATGCCCCATC ACTGCCCCAA
AGCATGGTAG CAGCGCAGTA G
 
Protein sequence
MKSSTLVNAP GRQEGLMANA SYPTGVENHG GSLRIWFLYK GKRVRENLGI PDTAKNRKIA 
GELRSSVCFA IRMGNFNYVE KFPNSPNLAR FGQDRKEITV LELTERWSEL KRMEISSNTM
SRYESIIKNM LPLIGENKMV SAVTTEDLLY VRKELLTGFQ VMKKDHRTQV KGRKSSTVNN
YMMLMAEIFQ FGTDNGYAKE NPFSGINRLK KAKGEPDPLT TDEFIRFIQA CGHQQMRNLW
SLAVYTGMRH GELCGLAWED IDLHAGTIIV KRNLTQTDEF TLPKTDAGTD RVIYLIQPAI
DALRNQAQLT RLGRQFEVEV KLREYGQSVI QPCTFVFSPQ CVKRGPRTGY HYAVNSINKI
WAPIIKRAGI RYRNAYQSRH TYACWSLSAG ANPNFIATQM GHTDAQMVYK VYGKWMSEKS
AEQVSLLNQA LSRYAPSLPQ SMVAAQ