Gene YpAngola_A1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1948 
Symbol 
ID5800418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2022520 
End bp2024286 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content50% 
IMG OID641339872 
Producthypothetical protein 
Protein accessionYP_001606422 
Protein GI162419790 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00321676 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0629552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA CTTTTATTCC CGGCAAAGAC GCTGCGCTGG AAGACTCTAT CTCCCGTTTT 
CAGCAAAAAC TGAGCGACCT CGGTTTTAAT ATTGAAGAAG CCTCTTGGCT TAACCCTGTT
CCTCACGTTT GGTCGGTACA TATCCGCGAC CGCGACTGCC CGTTGTGCTT TACCAACGGC
AAAGGCGCCA GTAAAAAAGC GGCTCTGGCA TCAGCTCTGG GCGAATATTT TGAGCGTTTA
TCCACCAACT ATTTCTTTGC TGACTTCTAT CTGGGCAAAG CCATCGCTGA GGGTGATTTC
GTTCATTACC CAAACGAGAA GTGGTTTCCG ATCCCGGCGG ATAATCTGCT ACCGGAAGGC
ATTCTGGATG AGCGTCTGCT GGCGTTTTAC GACCCAGAGC AAGAGTTGGT CGCCAGTGAT
TTGGTTGATT TGCAGTCGGG TAATGCCAAG CGTGGTATCT GTTCACTGCC CTTTACTCGT
CAATCAGATT TAGAAACTGT CTATATCCCG ATGAATATTA TCGGCAATCT GTATGTTTCA
AACGGTATGT CAGCAGGGAA TACCGCTAAC GAAGCCCGCG TACAGGCACT GTCTGAAGTT
TTTGAGCGTT ATGTGAAAAA CCGCATTATT GCGGAGTCCA TCAGCTTGCC AGAGATCCCC
GCCGAGGTAT TGAATCGCTA TCCAGGCGTG GTAGAAGCTA TCACTAAGCT GGAAGAAGAA
GGTTTCCCTA TCCTGTCTTA CGACGCCTCT CTGGGGGGCG CTTATCCGGT TATCTGTGTC
GTGCTATTTA ACCCGTCAAA CGGTACCTGT TTCGCGTCAT TCGGCGCACA TCCCGATTTT
GGTGTGGCAT TAGAACGCAC GGTGACGGAG TTATTACAGG GCCGCAGTCT GAAAGATCTT
GATGTCTTTA CTGCACCAAC CTTTGATGAC GAAGAAGTGG CAGAACATAC CAACCTGGAA
ACCCACTTTA TCGATTCAAG CGGTCTAATT AGTTGGGATA TGTTTAAGCA GGATGCAGAT
TACCCATTTG TTGACTGGAG CTTCAAGGGC ACCACAGAAG AAGAGTTCGC GACATTGATG
GCTATCTTCC AACAAGAAGG TGCCGAAGTG TACATCGCAG ATTATGAGCA CTTAGGTGTC
TACGCCTGCC GGATCTTGGT GCCGGGGATG TCTGATATCT ACCCTGCCGA AGACTTGCTG
ATGGCGAACA ACACCATGGG TGTACATCTG CGTGATACTT TACTGGCCTT GCCAGATACA
GACTGGCAGC CAGCACAGTA CTTGGAATTG ATTCAGCAGA TTGATGATGA AGGCTTGGAT
GATTTTGCCC GTGTGCGTGA ACTGCTGGGT ATCGCCTCCG GTAAAGATAA TGGCTGGTAC
ACCCTGCGGG TTGGCGAACT GAAATCAATG CTGGCACTGG CGGGCGGTGA TCTCGAGCAA
GCGCTGATCT GGGTTGAATG GACACAAGAC TTTAACGCTT CCGTCTTTAC AGCAAAACAG
GCCAACTATT ACCGTTGCCT GCAAACGCTA CTGCTCTTGA ACCAAGAGCC TGATCGTGAC
CCGATGCAAT ATTACAATGC ATTCGTCAAG ATGTATGGTC AGGAAGCGGT CGATATTGCC
TCAGCGGCCT TATCGGGTGA AATTCGCTTC AACGGCCTGT TCAGTGTTGA CGAAGATCTG
AAAGCATTAC CCGCTCATCA GGCTTTACTC GGCGCTTATG CTAAGTTGCA GGCGGCTAAA
CGTCGCCATT GGGCTAAAAG CGAGTAA
 
Protein sequence
MTQTFIPGKD AALEDSISRF QQKLSDLGFN IEEASWLNPV PHVWSVHIRD RDCPLCFTNG 
KGASKKAALA SALGEYFERL STNYFFADFY LGKAIAEGDF VHYPNEKWFP IPADNLLPEG
ILDERLLAFY DPEQELVASD LVDLQSGNAK RGICSLPFTR QSDLETVYIP MNIIGNLYVS
NGMSAGNTAN EARVQALSEV FERYVKNRII AESISLPEIP AEVLNRYPGV VEAITKLEEE
GFPILSYDAS LGGAYPVICV VLFNPSNGTC FASFGAHPDF GVALERTVTE LLQGRSLKDL
DVFTAPTFDD EEVAEHTNLE THFIDSSGLI SWDMFKQDAD YPFVDWSFKG TTEEEFATLM
AIFQQEGAEV YIADYEHLGV YACRILVPGM SDIYPAEDLL MANNTMGVHL RDTLLALPDT
DWQPAQYLEL IQQIDDEGLD DFARVRELLG IASGKDNGWY TLRVGELKSM LALAGGDLEQ
ALIWVEWTQD FNASVFTAKQ ANYYRCLQTL LLLNQEPDRD PMQYYNAFVK MYGQEAVDIA
SAALSGEIRF NGLFSVDEDL KALPAHQALL GAYAKLQAAK RRHWAKSE