Gene YpAngola_A1242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1242 
Symbol 
ID5799707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1295481 
End bp1296806 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content32% 
IMG OID641339213 
Productmodification methylase 
Protein accessionYP_001605783 
Protein GI162421405 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00503065 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00000000592753 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGACA TAGACAAAGA CAATATGCTC TTGAATAAGG TAATTTCAAA TTCGAAAGGT 
GACAAAGCTT ATTGGTCATT TAAGGGGCGA GCAAAAAGAC AATACTGTCA TGCTTTGATT
CAATATCCAG CAATGATGGT TCCAGCTATG CAAGGGGAAC TTATAGATGC TGTATTAGAA
ATAGATTCAA ACGTAACAAG TATAATAGAT CCCTTTGTCG GTTCAGGGAC TACTCTTGGT
GAAGCTATGC GCCGAGGGCT TGATTTTGTC GGGATGGATA TCAATCCTCT TTCAATTTTA
GCATGTGAAG TAAAAAGTGG CCCATTGTAT ATTGATTCTT TTAAAGAAAA ATTCATTTTT
CTATTGGATA AAATACGTGC TGATTTACGA GTAGACATAG CTGTTCAAAT AAAAAATATT
GATAAATGGT TTACTAAAGA AGTCCAGTTA GATTTAAGTA AGATATATAG GGCTATTCAG
GACGAAAGTT CTAAGTGGGC TAGGAAGGTT TTTTGGTTGT GCATGAGTAA TACTGTTCGT
ACCGTTTGTA ATTCAAGAAG TTCAACGTTT AAACTTCATA TTAAATCGTT CAATCAAATC
GAGAATATAC CTAATACATT AGAAGTTTTT ACAAAAAACA TTGAAAAAAG TATAGTTGCG
TTATGTGAGC AGAAGAACAT ACTTACTGAA ATGGGCAGTT TAAAAAGATC TATTTCTAAG
AGTAAAATTA AAATAATACA TGCTGACACT TCTAAGAAGA TGAAAAAAAA TATCCAATGT
GATCTTCTGG TAAGTTCTCC ACCATACGGA GATAATAATA CTACCGTCAC TTATGGGCAA
TTTTCTTATT TATCCTTAAA ATGGATTGAT GAAAAAGATA TAGATAATTT GAAAATTAAA
GGTTTGCTTG AAACTCAAAA CAAAATTGAT TCTAGCAGCC TTGGTGGCTC CCTTAAGGGA
GCCAATGAAA AAATGGAACT CTTGAAATAT AAGTCTCCTA CTTTGCTAAA TACTGTAAAA
GAAATTTCAA AAGTTAACTC TGAAAATGTT AAGAAGTTAA TTACATTTAT TTTTGATCTA
GATCTTGCCT TGGATAATGC GTTGTTATAT CTGCGGAAAA ATGCGTATAT GATATGGACA
CTCGGTAACA GAAGAATTTC CAACATTGAA GTCCCATTAG ACAGAATAAT GAGAGAGATT
CTAGAATATA AAGGTTGTGT TTTTATTCAT CAAATAGAAA GGGAAATATT ATCTAAGAGA
ATGGCGATGA AAAATAGCAT TGCTAATACG ATGGATAAAG AATTGATTCT TATAATGAGG
AAGTAA
 
Protein sequence
MNDIDKDNML LNKVISNSKG DKAYWSFKGR AKRQYCHALI QYPAMMVPAM QGELIDAVLE 
IDSNVTSIID PFVGSGTTLG EAMRRGLDFV GMDINPLSIL ACEVKSGPLY IDSFKEKFIF
LLDKIRADLR VDIAVQIKNI DKWFTKEVQL DLSKIYRAIQ DESSKWARKV FWLCMSNTVR
TVCNSRSSTF KLHIKSFNQI ENIPNTLEVF TKNIEKSIVA LCEQKNILTE MGSLKRSISK
SKIKIIHADT SKKMKKNIQC DLLVSSPPYG DNNTTVTYGQ FSYLSLKWID EKDIDNLKIK
GLLETQNKID SSSLGGSLKG ANEKMELLKY KSPTLLNTVK EISKVNSENV KKLITFIFDL
DLALDNALLY LRKNAYMIWT LGNRRISNIE VPLDRIMREI LEYKGCVFIH QIEREILSKR
MAMKNSIANT MDKELILIMR K