Gene SeHA_C0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0452 
Symbol 
ID6489290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp450457 
End bp452415 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content46% 
IMG OID642740721 
Producttype III restriction-modification system StyLTI enzyme mod 
Protein accessionYP_002044388 
Protein GI194451308 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.31948e-23 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTGAAAG ATAACCAAAA ACACAACGAG TCTGTTGCCC CGAATAGCGT CTTTCTGTCT 
GAGTTACAAC GTGCATTACC GGAATTTTTT ACCGCCGATT GCTATAACGA GCAGGGCGAA
CTGATCGCGA AAGGCGGATT TGATCTCGCC AAATTTGAGC GCGCGCTGAA AGCGCGTAAT
ATTGATGAGC TGACTAGCGG TTATCAGATT GATTTTATTG GCAAAGATTA CGCAAAAAAA
CAGGCGGGTG AAAAATCCGT TACCGTTATC GTTCCTGACG TGGAACACAA TACTCTGGCA
GAAAACAAAA ACAGCCATAA TCTTTTTCTG ACCGGGGATA ATCTGGATGT TTTACGCCAT
CTGCAAAATA ATTACGCCGA TACCGTCGAT ATGATCTATA TAGATCCCCC TTATAACACC
GGATCGGACG GGTTTGTTTA TCCCGATCAT TTTGAATATA GCGATCGGGC GTTGCAGGAT
ATGTTTGGTC TTAATGATAC TGAACTGGCA CGTTTAAAAT CCATTCAGGG TAAATCGACG
CACTCCGCGT GGTTATCTTT TATGTATCCG CGTCTTTTCC TGGCCAGGAA GCTCCTGAAA
GATACCGGAT TTATTTTTAT CTCTATCGAC GATAATGAGT ACGCCAATCT TAAATTAATG
ATGGATGAGA TTTTTGGCGA AGGCGGATTT GTCACCAATG TGATGTGGAA GCGCAAAAAA
GAGATTTCTA ACGACTCTGA TAACGTTTCC ATCCAGGGGG AATACATTCT TGTTTATGCC
AAAACCGGTC AGGGCGCTTT ACGTTTAGAA CCGCTTTCTA AAGAGTATAT TCAGAAATCC
TATAAAGAAC CGACCGAACA GTTTCCAGAA GGGAAATGGC GGCCGGTGCC GTTAACGGTG
TCAAAAGGGC TGAGCGGCGG CGGCTATACC TATAAAATTA CCACGCCGAA CGGTACGGTA
CACGAAAGAC TATGGGCTTA TCCTGAAGCC AGTTACCAAA AACTGGTGGC CGATAATCTG
GTCTATTTTG GCAAAGATAA CGGCGGAATT CCCCAGCGAG TCATGTACGC GCATCACAGT
AAAGGGCAGC CAACGACCAA TTACTGGGAT AACGTAGCGT CGAATAAAGA GGGGAAAAAG
GAGATTCTGG ATCTCTTCGG CGACAACGTT TTTGATACGC CGAAACCGAC CGCATTATTG
AAGAAAATCA TCAAGCTCGC TATCGATAAA GACGGCGTCG TTCTGGACTT TTTTGCCGGT
TCCGGCACCA CGGCCCATGC GGTAATGGCG CTGAATGAAG AAGATGGGGG GCAGCGCACG
TTTATTCTGT GCACTATCGA TCAGGCATTA AGCAATAACA CTATCGCGAA AAAAGCAGGT
TATAACACTA TTGATGAAAT CAGCCGCGAG CGAATTACAC GCGTTGCGGC GAAGATCCGC
GCCAACAATC CCGCGACCAA TAGCGATCTC GGTTTTAAAC ATTATCGTTT TGCCACTCCG
ACACAGCAGA CGCTGGACGA TCTGGATAGC TTCGATATTG CTACCGGCCA TTTTATCAAT
ACCAGCGGTC AACTGGCCGC TTTCACCGAG TCAGGATTTA CCGACATGAT CAATCCTTTT
TCCGCCAGAG GATTGGGCGT GCCGGGCGGC GCAAGCGGCG AAGAGACCTT ATTAACGACA
TGGCTGGTCG CCGATGGTTA TAAAATGGAT ATTGACGTAC AGACCATTGA TTTTTCCGGC
TATTGCGCCA GGTATGTTGA TAATACGCGC CTGTATCTGA TTGATGAACG ATGGGGAACA
GAGCAGACCC GCGATCTTCT CAACCACATT GGTACGCACC AGCTTCCGGT TCAGACCATT
GTCATTTACG GCTACTCTTT CGACCTTGAA TCCATTCGTG AACTGGAAAT CGGCTTAAAA
CAGCTTGATC AAAAAGTGAA CCTGGTGAAG CGTTATTAA
 
Protein sequence
MLKDNQKHNE SVAPNSVFLS ELQRALPEFF TADCYNEQGE LIAKGGFDLA KFERALKARN 
IDELTSGYQI DFIGKDYAKK QAGEKSVTVI VPDVEHNTLA ENKNSHNLFL TGDNLDVLRH
LQNNYADTVD MIYIDPPYNT GSDGFVYPDH FEYSDRALQD MFGLNDTELA RLKSIQGKST
HSAWLSFMYP RLFLARKLLK DTGFIFISID DNEYANLKLM MDEIFGEGGF VTNVMWKRKK
EISNDSDNVS IQGEYILVYA KTGQGALRLE PLSKEYIQKS YKEPTEQFPE GKWRPVPLTV
SKGLSGGGYT YKITTPNGTV HERLWAYPEA SYQKLVADNL VYFGKDNGGI PQRVMYAHHS
KGQPTTNYWD NVASNKEGKK EILDLFGDNV FDTPKPTALL KKIIKLAIDK DGVVLDFFAG
SGTTAHAVMA LNEEDGGQRT FILCTIDQAL SNNTIAKKAG YNTIDEISRE RITRVAAKIR
ANNPATNSDL GFKHYRFATP TQQTLDDLDS FDIATGHFIN TSGQLAAFTE SGFTDMINPF
SARGLGVPGG ASGEETLLTT WLVADGYKMD IDVQTIDFSG YCARYVDNTR LYLIDERWGT
EQTRDLLNHI GTHQLPVQTI VIYGYSFDLE SIRELEIGLK QLDQKVNLVK RY