Gene YpAngola_A0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0004 
SymbolyieM 
ID5798466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2525 
End bp3991 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content48% 
IMG OID641338031 
Producthypothetical protein 
Protein accessionYP_001604649 
Protein GI162419510 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.729255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGTC TGGCAACACT GGATATGCTG CTGTCGATAA GTGAAGGTGA ACTTATCGAA 
GAGATGGTGG TAGGTTTACT GGCAGCACCT CAGTTGGCGA TTTTTTTTGA AAAATTCCCG
CGGATTAAAC GGGCATTAAT GAAAGATATT CCCGGCTGGA AACAAAATTT GCAACAACGG
ATCCGTGAAG CTAGCGTTCC GCCAGGGCTG GCGAATGAAT TTTCCTTATA TCAACAGAGT
CTGTTGGAGG ATAGCCCACA GTTTTATGCG CATCTGCCCG ATATCGTGGC TCAGTTACAA
GATTTACATT CCCCGTTTGC AACGCAGGCT AAAACATTAG TGCAAACCGC AGACCTGGCA
AAAAACCCGC CAGGCGGAGA TAGCCTGCAA ACGCTCTTTT TACAACGTTG GCGTGTCAGT
CTGATCCTAC AAACCATCAC AATTCATCAT CAGTTGTTAG AACAAGAACG GGAACAGCTC
TTAGCTGAGC TACAGCGAAG ATTAGCGCTC AGTGGTGCAC TGGAACCCAT TCTCACCACC
AATGACAATG CCGCTGGGCG TCTTTGGGAT ATGAGTCAGG GGCATTTGCA ACGTGGCGAT
TATCAATTAT TGCTGCAATA TGGCGACTTT CTGCAACAGC AACCTGAATT AATACGTTTA
GCCGAACAGC TCGGCCGCAG CCGCTCAGCC AAAGCACAAC CGGCACCTGA CGCCCGCTAT
GAGCCTTACA CCGTTATGGT ACGCCAGCCA GATTCGGTGC CTGAGGAGGT CAGTGGTATT
CATCAGAGTA ATGACATCCT GCGATTATTG CCTACTGAAT TGGTGATGTT AGGGATGAGT
GAGTTGGAGT TTGAGTTTTA TCGCCGCTTA CTGGAACGGC GTTTGCTCAC ATACCGATTA
CAAGGAGACA ACTGGCAGGA AAAAACGCAA CAGAGACCCG TCAGCCTTAA ACAAAATGAT
GAGCAACCCC GTGGGCCATT TATTGTCTGC GTCGATACCT CAGGCTCGAT GGGAGGCTTC
AACGAGCAAT GCGCCAAAGC ATTTTGTCTG GCCTTACTCC GTATCGCGTT AGCCGATAAT
CGCCGCTGTT ACATCATGTT ATTTGCTACT GAAATAATTC ACTACGAGCT ATCAGCTGAC
AATGGTATTG AGCAAGCCAT ACGTTTCCTC AACCAACATT TTCGTGGTGG TACGGATCTG
GCGGCATGCT TAGCCAACAC GTTAAATAAA ATGGAAGACA GAGAATGGTA TGACGCAGAC
GCGGTGATCA TTTCAGACTT TATCGCCCAG CGTTTGCCAG AAGAGTTAGT CAGAAAGATA
AAAATCCAGC AGCAGGCCCA CCAGCACCGT TTCCATGCTG TTGCCATGTC CGCTTATGGC
AAACCAGGGA TCATGCGTAT TTTTGATCAT ATCTGGCGTT TTGATACAAG TTTAAAAAGC
CGTTTGATAC GCCGCTGGAA ACGATAA
 
Protein sequence
MLSLATLDML LSISEGELIE EMVVGLLAAP QLAIFFEKFP RIKRALMKDI PGWKQNLQQR 
IREASVPPGL ANEFSLYQQS LLEDSPQFYA HLPDIVAQLQ DLHSPFATQA KTLVQTADLA
KNPPGGDSLQ TLFLQRWRVS LILQTITIHH QLLEQEREQL LAELQRRLAL SGALEPILTT
NDNAAGRLWD MSQGHLQRGD YQLLLQYGDF LQQQPELIRL AEQLGRSRSA KAQPAPDARY
EPYTVMVRQP DSVPEEVSGI HQSNDILRLL PTELVMLGMS ELEFEFYRRL LERRLLTYRL
QGDNWQEKTQ QRPVSLKQND EQPRGPFIVC VDTSGSMGGF NEQCAKAFCL ALLRIALADN
RRCYIMLFAT EIIHYELSAD NGIEQAIRFL NQHFRGGTDL AACLANTLNK MEDREWYDAD
AVIISDFIAQ RLPEELVRKI KIQQQAHQHR FHAVAMSAYG KPGIMRIFDH IWRFDTSLKS
RLIRRWKR