Gene SeHA_C4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4738 
Symbol 
ID6491848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4619069 
End bp4620259 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content50% 
IMG OID642744792 
Productintegrase 
Protein accessionYP_002048369 
Protein GI194450328 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.657187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTTA CAGCCAGACA GGTCGAAACA GCCAGACCTA AAGAAAAAGA CTATAAACTC 
TCTGACGAAC GTGGTTTATA TCTGCTGGTA AAAACCACGG GTGCCCGCTA CTGGCGGCTT
AAATACCGGA TAGCAGGAAA AGAGAAAAAA CTGGCCCTCG GCGTCTATCC CGACGTCTCC
CTTGCTGAGG CCAGAATCAA ACGCGACGAT GCCCGAAAAA TCATCTCCGA AGGTGGTGAC
CCGGGCGAAA AGAAGCGAAA GGAAAAACTC ACTCAGAAAA TCTCTGCCAC CAATACGTTC
CATGCCCTCG CTACGGAATG GCACCAGCAT AAATCTTTGT CATGGTCTGA AAGTTACGCC
AGAAGCGTAC TGGAAGCGCT GGATAAAGAT ATTTTCCCGT ATCTGGGCAA ACGAAGCGTT
ACGGATATCC TCCCGCTGGA AATGCTGGAA ATTCTGCGCC GCATAGAAAA ACGTGGCTCG
CTGGAAAAAC TTCGTAAGGT GCGTCAATAC TGTAATCAGA TTTTTCGTTA TGCCATCGCC
ACCGGACGAG CCACTGTCAA TCCGGCATCT GAACTGACCA GTACGCTGGC GGCGCCAAAA
GCTGCACATT TCCCCCACCT GAGAGCAGAT GAGCTCCCTG TTTTTCTCCG GAAGCTCGCT
GAGTATCATG GCAGTCCTGT TACCCGCATG GCGACAAATC TGCTGCTTCT GACAGGCCTC
AGAACGATTG AACTACGGTC CGCTGAATGG TCAGAAATTG ATTTTGATAA TGCCCTGTGG
ACAATCCCTG AAAGCCGCAT GAAAATGCGA CGTAAACATG TCGTACCACT GTCACGACAG
GCCACTGACA TTCTGCTGCA GCTCAAAACT TTCTCCGGAC AATACCGGCT GGTTTTCCCG
GGACGTTGTG ATATCAACAA GCCAATGAGC GAAGCCAGCA TCAATATGGT GCTCAAACGT
ATCGGTTACG ATGGCAGGGC AACCGGTCAT GGTTTTCGTC ACACCATGAG TACCATTCTG
CACGAACAGG GCTTTAATTC TGCCTGGATT GAAATGCAGT TAGCTCATGT GGATAAAAAC
GCCATCAGGG GTACCTATAA TCATGCCCAG TATCTCGATG GTCGCCGTGA AATGATGCAA
TGGTACGCAG ATTACATTGA TTCGCTTTCC AGGCAAGAGA GTCAGGGTTA A
 
Protein sequence
MALTARQVET ARPKEKDYKL SDERGLYLLV KTTGARYWRL KYRIAGKEKK LALGVYPDVS 
LAEARIKRDD ARKIISEGGD PGEKKRKEKL TQKISATNTF HALATEWHQH KSLSWSESYA
RSVLEALDKD IFPYLGKRSV TDILPLEMLE ILRRIEKRGS LEKLRKVRQY CNQIFRYAIA
TGRATVNPAS ELTSTLAAPK AAHFPHLRAD ELPVFLRKLA EYHGSPVTRM ATNLLLLTGL
RTIELRSAEW SEIDFDNALW TIPESRMKMR RKHVVPLSRQ ATDILLQLKT FSGQYRLVFP
GRCDINKPMS EASINMVLKR IGYDGRATGH GFRHTMSTIL HEQGFNSAWI EMQLAHVDKN
AIRGTYNHAQ YLDGRREMMQ WYADYIDSLS RQESQG