Gene SeD_A0389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0389 
Symbol 
ID6873325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp406724 
End bp408682 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content47% 
IMG OID642783623 
Producttype III restriction-modification system StyLTI enzyme mod 
Protein accessionYP_002214310 
Protein GI198243123 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.57999e-20 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTGAAAG ATAACCAAAA ACACAACGAG TCTGTTGCCC CGAATAGCGC CTTTCTGTCT 
GAGTTACAAC GTGCATTACC GGAATTTTTT ACCGCCGATC GCTATAACGA GCAGGGCGAA
CTGATCGCGA AAGGCGGATT TGATCTCGCC AGATTTGAGC GCGCGCTGAA AGCGCGTAAT
ATTGATGAGC TGACTAGCGG TTATCAGATT GATTTTATTG GCAAAGATTA CGCAAAAAAA
CAGGCGGGTG AAAAATCCGT TACCGTTATC GTTCCTGACG TGGAACACAA TACTCTGGCA
GAAAATAAAA ACAGCCATAA TCTTTTTCTG ACCGGAGATA ATCTGGATGT TTTACGCCAT
CTGCAAAATA ATTACGCCGA TACCGTCGAT ATGATCTATA TCGATCCCCC TTATAACACC
GGATCGGACG GGTTTGTCTA TCCCGATCAT TTTGAATATA GCGATCGGGC GTTGCAGGAT
ATGTTTGGTC TTAATGATAC CGAACTGGCA CGTTTAAAAT CCATTCAGGG TAAATCGACG
CACTCCGCGT GGTTATCTTT CATGTATCCG CGTCTTTTCC TGGCCAGGAA GCTCCTGAAA
GATACCGGAT TTATTTTTAT CTCTATCGAC GATAATGAGT ACGCCAATCT TAAATTAATG
ATGGATGAGA TTTTTGGCGA AGGCGGATTT GTCACCAATG TGATGTGGAA GCGCAAAAAA
GAGATTTCTA ACGACTCTGA TAACGTTTCC ATCCAGGGGG AATACATTCT TGTTTACGCC
AAAACCGGTC AGGGCGCTTT ACGTTTAGAA CCGCTTTCTA AAGAGTATAT TCAGAAATCC
TATAAAGAAC CGACCGAACA GTTTCCAGAA GGGAAATGGC GGCCGGTGCC GTTAACGGTG
TCAAAAGGGC TGAGCGGCGG CGGCTATACC TATAAAATTA CCACGCCGAA CGGTACGGTA
CACGAAAGAC TATGGGCTTA TCCTGAAGCC AGTTACCAAA AACTGGTGGC CGATAATCTG
GTCTATTTTG GCAAAGATAA CGGCGGTATT CCCCAGCGAG TCATGTACGC GCATCACAGT
AAGGGGCAGC CAACGACCAA TTACTGGGAT AACGTAGCGT CGAATAAAGA GGGGAAAAAG
GAGATTCTGG ATCTCTTCGG CGACAACGTT TTTGATACGC CGAAACCGAC CGCATTATTG
AAGAAAATCA TCAAGCTCGC TATCGATAAA GACGGCGTCG TCCTGGACTT TTTTGCCGGT
TCCGGCACTA CGGCCCATGC GGTAATGGCG CTGAATGAAG AAGATGGGGG GCAGCGCACG
TTTATTCTGT GTACTATCGA TCAGGCATTA AGCAATAACA CTATCGCGAA AAAAGCAGGT
TATAACACTA TTGATGAAAT CAGCCGCGAG CGAATTACAC GCGTTGCGGC GAAGATCCGC
GCCAACAATC CCGCGACCAA TAGCGATCTC GGTTTTAAAC ATTATCGTTT TGCCACTCCG
ACACAGCAGA CGCTGGACGA TCTGGATAGC TTCGATATTG CTACCGGCCA TTTTATCAAT
ACCAGCGGTC AACTGGCCGC TTTCACCGAG TCAGGATTTA CCGACATGAT CAATCCTTTT
TCCGCCAGAG GATTGGGCGT GCCGGGCGGC GCAAGCGGCG AAGAGACCTT ATTAACGACA
TGGCTGGTCG CCGATGGTTA TAAAATGGAT ATTGACGTAC AGACCGTTGA TTTTTCCGGC
TATTGCGCCA GGTATGTTGA TAATACGCGC CTGTATCTGA TTGATGAACG ATGGGGAACA
GAGCAGACCC GCGATCTTCT CAACCACATT GGTACGCACC AGCTTCCGGT TCAGACCATT
GTCATTTACG GCTACTCTTT CGACCTTGAA TCCATTCGTG AACTGGAAAT CGGCTTAAAA
CAGCTTGATC AAAAAGTGAA CCTGGTGAAG CGTTATTAA
 
Protein sequence
MLKDNQKHNE SVAPNSAFLS ELQRALPEFF TADRYNEQGE LIAKGGFDLA RFERALKARN 
IDELTSGYQI DFIGKDYAKK QAGEKSVTVI VPDVEHNTLA ENKNSHNLFL TGDNLDVLRH
LQNNYADTVD MIYIDPPYNT GSDGFVYPDH FEYSDRALQD MFGLNDTELA RLKSIQGKST
HSAWLSFMYP RLFLARKLLK DTGFIFISID DNEYANLKLM MDEIFGEGGF VTNVMWKRKK
EISNDSDNVS IQGEYILVYA KTGQGALRLE PLSKEYIQKS YKEPTEQFPE GKWRPVPLTV
SKGLSGGGYT YKITTPNGTV HERLWAYPEA SYQKLVADNL VYFGKDNGGI PQRVMYAHHS
KGQPTTNYWD NVASNKEGKK EILDLFGDNV FDTPKPTALL KKIIKLAIDK DGVVLDFFAG
SGTTAHAVMA LNEEDGGQRT FILCTIDQAL SNNTIAKKAG YNTIDEISRE RITRVAAKIR
ANNPATNSDL GFKHYRFATP TQQTLDDLDS FDIATGHFIN TSGQLAAFTE SGFTDMINPF
SARGLGVPGG ASGEETLLTT WLVADGYKMD IDVQTVDFSG YCARYVDNTR LYLIDERWGT
EQTRDLLNHI GTHQLPVQTI VIYGYSFDLE SIRELEIGLK QLDQKVNLVK RY