Gene SeHA_C1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1124 
Symbol 
ID6491236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1112864 
End bp1113844 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content55% 
IMG OID642741366 
ProductIS5 transposase 
Protein accessionYP_002045018 
Protein GI194448842 
COG category[L] Replication, recombination and repair 
COG ID[COG3039] Transposase and inactivated derivatives, IS5 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value0.756281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATC AACTCACCTT CGCCGATAGT GAATTCAGCA CTAAGCGCCG TCAGACCCGA 
AAAGAGATTT TCCTCTCCCG CATGGAGCAG ATTCTGCCAT GGCAGAATAT GACCGCTGTC
ATCGAGCCGT TTTATCCCAA GGCGGGCAAT GGCCGACGGC CCTATCCGCT GGAGACCATG
CTGCGTATTC ACTGCATGCA GCATTGGTAC AACCTGAGCG ACGGTGCCAT GGAAGATGCC
CTGTACGAAA TCGCCTCCAT GCGCCTGTTT GCCCGATTAT CCCTGGATAG CGCCCTGCCG
GATCGCACCA CCATCATGAA TTTCCGCCAC CTGCTCGAGC AGCATCAACT GGCCCGTCAA
TTGTTCAAGA CCATCAATCG CTGGCTGGCC GAAGCAGGCG TCATGATGAC CCAAGGCACT
TTGGTGGATG CCACCATCAT TGAGGCACCC AGCTCTACCA AGAACAAAGA GCAGCAACGC
GATCCGGAGA TGCATCAGAC CAAGAAAGGC AATCAGTGGC ACTTTGGCAT GAAGGCCCAC
ATTGGTGTCG ATGCCAAGAG TGGCCTGACC CACAGCCTAG TCACCACCGC GGCCAACGAG
CATGACCTCA ATCAGCTGGG TAATCTGCTT CATGGAGAGG AGCAATTTGT CTCAGCCGAT
GCCGGCTACC AAGGAGCGCC ACAGCGCGAG GAGCTGGCCG AGGTGGATGT GGACTGGCTG
ATCGCCGAGC GTCCCGGCAA GGTAAAAACC TTGAAGCAGC ATCCGCGCAA GAACAAAACG
GCCATCAACA TCGAATACAT GAAAGCCAGC ATCCGTGCCA AGGTAGAGCA CCCGTTTCGC
ATCATCAAGC GGCAGTTCGG CTTCGTGAAA GCCAGATACA AGGGGCTGCT GAAAAACGAT
AACCAACTGG CGATGTTATT CACCCTGGCC AACCTGTTTC GGGTGGACCA AATGATACGT
CAGTGGGAGA GATCTCACTA A
 
Protein sequence
MSHQLTFADS EFSTKRRQTR KEIFLSRMEQ ILPWQNMTAV IEPFYPKAGN GRRPYPLETM 
LRIHCMQHWY NLSDGAMEDA LYEIASMRLF ARLSLDSALP DRTTIMNFRH LLEQHQLARQ
LFKTINRWLA EAGVMMTQGT LVDATIIEAP SSTKNKEQQR DPEMHQTKKG NQWHFGMKAH
IGVDAKSGLT HSLVTTAANE HDLNQLGNLL HGEEQFVSAD AGYQGAPQRE ELAEVDVDWL
IAERPGKVKT LKQHPRKNKT AINIEYMKAS IRAKVEHPFR IIKRQFGFVK ARYKGLLKND
NQLAMLFTLA NLFRVDQMIR QWERSH