Gene YpAngola_A1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1239 
Symbol 
ID5799704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1291081 
End bp1293144 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content48% 
IMG OID641339210 
Producthypothetical protein 
Protein accessionYP_001605780 
Protein GI162419185 
COG category[V] Defense mechanisms 
COG ID[COG1401] GTPase subunit of restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.793158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00100345 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTCAGG ATTCCTCTGC GTTAGCCTCT TTCTTTCGTT CTCATATCGC AAAAAATAAT 
CAATACAGCC CCGACTGGGG CAAACACTAT CGTCAGCTCG TAGAACAGGT CCAAGCCGAA
AAACCCAACT TCTCTGCGGA TACAATTCGG CAGATTTGGT ACGAACGTAG CAATGGTATC
TCCAGCCTCA GGCAGGGCGG TATGTCCCAG TTTGAATTCG AAACGGCGCA AAAACAGCTG
CTCGCTCTCA CCCGTCAGAT GGCAGATGAG TGTACGCAGG AGAACTACGA TCGGGTGATT
GAGCGCCTTG TTAAACTTAA AACAGATGGG ATATTAAACA AAGTATATTG GGCACTTTGC
CATCGTGCCT TTGCCGCGCT CTACCCGTCG AAAATTACCA ACGTTGTGAA TGTCAGCAAT
TTTTTCAGCA GTTATAACTA TTGCAATAAC CATTTTCAAC TTGACCTTTC TGAGGATCAG
GAGTGGTTCG CTCGCAACCT TGATCTCAAG AAAGTGTTGC ATCGAGCATT GGGCGATGAT
GTCGATCCTA TCGAACTGAA TATGTCGCTC TGGCATCTCT ACACGCAGGT GATTCAGAAG
AAGAATGATA TTGGGTTAAT CGGTAGTGAG ACTGCCTTGC CAGAGGATGA GCAAGACGAG
GAGGTTACCA CTCCGCAACT GCCAAAAAAT ACTATCCTCT ACGGACCTCC AGGTACCGGT
AAAACTTACT GTACCATCGA GCTAGCCGTA CGCGCCTGCG AACCAGCAGC CTACTCGCTG
CAGGAAGGCA AAGAAGAGAA CGAAAAACGC CGTGAGCTGA AAAAGGTTTA TGACCGACTG
ATTGCTGAAA AGCGGGTGCG CTTTATCACC TTCCATCAGA GTTTTGGCTA TGAAGAGTTT
ATCGAAGGGC TAAGAGCAGA AACCACCGAT GATGGCAATG TACGCTATGA AGTCAAAGCG
GGCATTTTTA AGCAAATTTG TGAAGATGCT GCTTTTGGTC ACGCCGGTGT TCAGCAGAAG
CTTGACGAGG CGCTTGCCCG CCTACAAGAA CGTTTATCTG AAAGTGGAAG CATTACCCTT
GAAACGCTGC AGGGCAAAGC CTTCCAACTG GCTTACAAGA GCCAGACCAC CTTTGGTATT
TTCCCGTCGC AATCGAAAAA AGAGGATTTA GGGCAGGGTT ACAATGCTTA CCTCAAGAAT
ATCAGTCTGG TTTACCAAAA CCCTCAGGCA AAGGTACACA ATCCTTCATA TGTACGTAGT
ATCCTTAATT ACCTCATTAA ACAGGAAGGT TTGCCTTCCA ACCCACAGGA ATCTGTATCT
GAAAAACGGC AAAACTACGT TCTGATTATT GACGAGATTA ACCGTGGCAA CATCTCCAAA
ATCTTCGGCG AACTGATCAC CCTGATAGAA ACCTCTAAAC GTGCCGGGGA ACCGGAGGCG
CTTAGCGTGA TTCTGCCTTA CTCTTCCAGT TCGTTCAGCG TACCGAATAA TCTCTATTTA
ATCGGCACCA TGAACACTGC TGACCGTTCG CTCACCGCAC TGGATACCGC CCTGCGTCGC
CGATTCGAGT TTGAAGCGAT GCTGCCGGAT ATCACCGTTC TAAAAGAAAC TGTTGTCAAA
GGTATTGATC TGCCACGTCT ACTGCAGACC TTGAATGACC GCATTGAGGT GCTGTACGAC
AGAGAACATA CACTAGGCCA TGCCTTTTTC ATCCCAGTAG TTCAGGTAAA AGAGGACGAA
GACCTGGCAT TTGAAAGGCT TAAGCGGATC ATGCGAAACA AGGTGCTGCC TCTGTTGGAG
GAGTACTTCT ACAACGACTG GCAAAAAATT CGTATGGTGT TGGGCGACAA TCAGAAAAGC
GGAAACCCGC AGCTACAGTT CGTGTGCGAG GTGAAAGACC AGAAACAGTT TGCCGATCTT
TTTGGCAATA ACGGAACTGA AGATCTGCAT GATATAGGTG CCAGTTTCCA TCTAGCCCCT
GAGGACGACA AGGTTTGGGA TAACCCGCTC GCCTGGCAAC AAATTTATGC GCCACATAAA
GGTAACCCGG TGAGTGGGGA ATGA
 
Protein sequence
MVQDSSALAS FFRSHIAKNN QYSPDWGKHY RQLVEQVQAE KPNFSADTIR QIWYERSNGI 
SSLRQGGMSQ FEFETAQKQL LALTRQMADE CTQENYDRVI ERLVKLKTDG ILNKVYWALC
HRAFAALYPS KITNVVNVSN FFSSYNYCNN HFQLDLSEDQ EWFARNLDLK KVLHRALGDD
VDPIELNMSL WHLYTQVIQK KNDIGLIGSE TALPEDEQDE EVTTPQLPKN TILYGPPGTG
KTYCTIELAV RACEPAAYSL QEGKEENEKR RELKKVYDRL IAEKRVRFIT FHQSFGYEEF
IEGLRAETTD DGNVRYEVKA GIFKQICEDA AFGHAGVQQK LDEALARLQE RLSESGSITL
ETLQGKAFQL AYKSQTTFGI FPSQSKKEDL GQGYNAYLKN ISLVYQNPQA KVHNPSYVRS
ILNYLIKQEG LPSNPQESVS EKRQNYVLII DEINRGNISK IFGELITLIE TSKRAGEPEA
LSVILPYSSS SFSVPNNLYL IGTMNTADRS LTALDTALRR RFEFEAMLPD ITVLKETVVK
GIDLPRLLQT LNDRIEVLYD REHTLGHAFF IPVVQVKEDE DLAFERLKRI MRNKVLPLLE
EYFYNDWQKI RMVLGDNQKS GNPQLQFVCE VKDQKQFADL FGNNGTEDLH DIGASFHLAP
EDDKVWDNPL AWQQIYAPHK GNPVSGE