Gene YpAngola_A2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2284 
Symbol 
ID5800754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2392363 
End bp2394387 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content47% 
IMG OID641340177 
Producthypothetical protein 
Protein accessionYP_001606722 
Protein GI162420199 
COG category[S] Function unknown 
COG ID[COG4458] Uncharacterized protein conserved in bacteria, putative virulence factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.869648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGG ATATTTATTC GTTGGATGAG CAACAGATTA CCGAACATCT ACGGCAGTTG 
ATGATGCATA GGCGGCCAAT CATTAACGCT GGGATCAATA GCGATGATGT GATTGCACTT
TGGGACTCCC TTGCCCACCA CGACACACAA CGCCAGAAAG AACTGGCGAC TCACTTCTGG
CCTACCGCGA TTGAACTGGC ACCGTATCTG AGTATTGATG ACCGGGCCAA ATTATTTTCG
TTATTGTGGG GAGAAAACGA TCCACTCACT GACGCTTATC GCCATTTTTC CTATATTTTA
CAGCACCTTA GCGGCACACG GAAATTATTA GCACCGCTAA GTGTTTTGGT TGACGATACC
CTGTTACCTG CCAATGGTGT CATGAATATT GCCACGCTTG GTGACTTGAA TACACCTGCA
GATAACCCTA TCCAAGTGCT TCCCTTGATC AACGGTCATA CCGCGAAATG CGTGACACTG
TCACAAGCGG AATTGACGCT GTTAGCGGTG GAACTGAAGA TCCCACTCGA TAAACCCGCG
CGTGAAAGTG CATTTGAATC CGTTGAATTG CTTAATTTTC CTGATGCTCG CGGTTCACAA
ACTATCCCGG CATTAATGGA AAACGCCGCT TATCCGCTGG CCTCGCTACT GTCACAGGCT
AAAAATGCCT ATTTACTCGA ACGTTATACC AACCAACAGC AAATCAACCT GTTATTGGTG
TGTACCGCCA CCGATCAGCG TTCAGAGATA AAAAGTACCA GCAAGGCATT GGATTATTGG
GTTAAACAGA CTCAGGGAGA GAGCGTTCAG ATACGCTCGC GTCGTAATCC GGGTTTAATC
TGGGCCTTAA CGCCCCATGA TCAGCGCATT ACCGCGAATC TACCTCTCTC CACCCCGACA
GACGCCCAAA CGCATGACGC GAATATCAAA AATTACGATG AAGCGGTACA ACGCTATGTT
GGTAATCCCG GTGACAGTTG GGGAACCCTA TTGGCATTGG ATGCACGTGG GGTTGAACGG
ATGATCTCGT ACCTATCGAA AGAAATCCTT CGGGATATCA AATCAGAACG TCTAACCGAA
CAACTTCACG AGTTACAACG GGAACTGACC AATAATCTGT TCACGGGTTG GTATCAACCT
TCGGTTACGG ACGAACGACA GCAAAAACAA CGTATTGTAG AAATATTACT TAAAGCGTTG
CAAACCCGTA CTGGGGTACA TGGTGAATTG TTGGAGCAAT TACTTCCTTC CCGCGATGAA
CTACGCCGCC TTTATCTGCA ACAGCACCTT TATCTATCAC CCCGCCTTTA TCAACAAGAG
AAAAAAGGGG TTGCGGATTT TTTCATACCA CTCGCCAGCA CTGAACCTTT CAGTATCGGT
ATCGATATCG ATCTGTTTAG TGACCTTCCC ACTCCCACTG AGCCATCGCT GACACCACAA
GCCAGACATG ATAACGATGA AGCGGAGTAT GCCGCTCATG TGCATTATTA TTGGATAAAT
CATCTACGCC AACTGCCCGA TAATACTGCG CTACTTGAAT TACTGGGCGT CACTAAACCC
ACGATAGAAC TGCTGGTCGC GGAATTCATC ACTGCCAGTA TTCGGCTGGA TATTGCTAGG
AATTTACGAC AGGCACTGGC TGATAACGAA CCGGCGGATC TGCATCGTGC AGCCAAAGCG
GACCGTCAGG TTTCTCGTGC GCTCACTGTA CTCGGTGATT TCATTGCCTG GTTAGGTTTT
TTGCAAATCA GCGAAGATAA GCGCCCGAAT AGCCGCATCA ACCGAGGGTA TAAAATTTTT
GCCCAGCCCC CTAAATCAGT ATCAACTTTA GGGGTCTCTC ACCGGTTAAC GCAACTGGCA
TTAACCCCAA CCAACAGTAC CGCATTTTAT ATCTATGATT GGTTAGTGGG TTTGGGTGAA
ATGATTATTC AAAACGTGGG ATATTCAGCC AGTAATGAGA TAAGCCCGGC ACAGCGGCAA
CAACTGGCGG CAATATTATC GGTGATAAAA CCCGCAAATG ATTAA
 
Protein sequence
MPEDIYSLDE QQITEHLRQL MMHRRPIINA GINSDDVIAL WDSLAHHDTQ RQKELATHFW 
PTAIELAPYL SIDDRAKLFS LLWGENDPLT DAYRHFSYIL QHLSGTRKLL APLSVLVDDT
LLPANGVMNI ATLGDLNTPA DNPIQVLPLI NGHTAKCVTL SQAELTLLAV ELKIPLDKPA
RESAFESVEL LNFPDARGSQ TIPALMENAA YPLASLLSQA KNAYLLERYT NQQQINLLLV
CTATDQRSEI KSTSKALDYW VKQTQGESVQ IRSRRNPGLI WALTPHDQRI TANLPLSTPT
DAQTHDANIK NYDEAVQRYV GNPGDSWGTL LALDARGVER MISYLSKEIL RDIKSERLTE
QLHELQRELT NNLFTGWYQP SVTDERQQKQ RIVEILLKAL QTRTGVHGEL LEQLLPSRDE
LRRLYLQQHL YLSPRLYQQE KKGVADFFIP LASTEPFSIG IDIDLFSDLP TPTEPSLTPQ
ARHDNDEAEY AAHVHYYWIN HLRQLPDNTA LLELLGVTKP TIELLVAEFI TASIRLDIAR
NLRQALADNE PADLHRAAKA DRQVSRALTV LGDFIAWLGF LQISEDKRPN SRINRGYKIF
AQPPKSVSTL GVSHRLTQLA LTPTNSTAFY IYDWLVGLGE MIIQNVGYSA SNEISPAQRQ
QLAAILSVIK PAND