Gene YpAngola_A1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1961 
Symbol 
ID5800431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2041111 
End bp2042538 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content49% 
IMG OID641339884 
Producthypothetical protein 
Protein accessionYP_001606434 
Protein GI162419810 
COG category[S] Function unknown 
COG ID[COG5383] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.588491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCCC AACAGTTTGT GCATCCCGAT GAGATACGGG CCAAATTCTC CAGCGCTATG 
TCGGATATGT ATCAGACCGA AGTCCCTCTA TATAGCACAT TATTACGGCT CGTTGCGGAC
ACGAATACAC AAGAGATGGT GCAAGATCAG AAACTTACCC GTCATTTGCA GCAAACTGGG
GAAATTGAAC GCCTAACGAT GGAAAGACAT GGCGCCATCC GCGTCGGGAC AGCCGAAGAA
CTCAAGATGC TGCGGCGCTT GTTTGCAGTC ATGGGAATGG TGCCGGTCGG TTATTATGAT
TTAGCACCAG CGGGTGTCCC GGTTCACTCC ACCGCCTTCC GTGCGGTGCA TGAAACCTCA
CTGCAAGCCT GCCCTTTCAG AGTCTTCACC TCATTACTGC GTCTGGAGTT GATTGAGCAA
CCTACACTAC GGCAACTGGC TGCGGATATT CTGGCGAAAA GAACAATTTT CACCCCACAG
GCGATTAAAC TAATTGTTCA GCATGAAACA TCGGGGGGGC TCAATCGCCA TCAAGCCGAT
GATTTCATTG CTCAGTCGCT GGAAACATTT CGCTGGCATC ATCAGGCAAC GGTCAGCACT
GAAACCTACC AACAGCTACA CGATCAACAC CGTCTGATTG CTGATGTGGT CGCTTTCAAA
GGTCCGCATA TTAACCACCT AACTCCACGG ACCTTAAATA TCGACGTCGT ACAAACAGCA
ATGGCAGAGC ATAATATGAT GCCCAAAGCC GTTATCGAAG GGCCACCCCC CCGCCACTGC
CCAATACTGT TACGCCAAAC CAGCTTCAAG GCGCTGGAGG AAAAAATCGC GTTTGTATCA
AATGGTGGGC AAATAATACC AGGCCACCAT ACGGCCCGCT TCGGGGAGAT AGAGCAACGA
GGAGCCGCTT TGACGGCCAA AGGCCGTAAT CTGTATGACC ACTTACTGCA AAGCGCACAG
GATCAATTGA ACGTTCCTGT AAATGAGAAC AATGCGGCGC AATATAGCGC GATCCTGAGC
GAGAAATTTA GTCAATTCCC TGATGATTAC CCAACAATGC GAGCAGAGAA ACTCGCCTTC
TTCCGCTATT TTCCTACAGA AAAAGGCCTT ATCACTGCAT CAATACAGGA AATACAGGAA
ATACAGGAAA TACAGGAAAT GACATTAGAC GAACTTATTG ATAACGGCTT TATTCAATAT
GAACCGTTGG TTTATGAAGA TTTCCTGCCA GTCAGCGCCG CTGGAATATT CCAGTCAAAC
TTAGGCGAGA AAGGGCAGAG TCACTTTACC GGGCACTCCA ATAAAGCAGA CTTCCAGCGG
GATTTGGGGA TTGCGGTTAT TGATGAACTG CAACTCTACG AAGCAACCCA GCAACGTTCC
GTTGCTGAAT GTGCAGCCGC CCTCAAACTA ACGTTGTTAA GCCAATAA
 
Protein sequence
MPPQQFVHPD EIRAKFSSAM SDMYQTEVPL YSTLLRLVAD TNTQEMVQDQ KLTRHLQQTG 
EIERLTMERH GAIRVGTAEE LKMLRRLFAV MGMVPVGYYD LAPAGVPVHS TAFRAVHETS
LQACPFRVFT SLLRLELIEQ PTLRQLAADI LAKRTIFTPQ AIKLIVQHET SGGLNRHQAD
DFIAQSLETF RWHHQATVST ETYQQLHDQH RLIADVVAFK GPHINHLTPR TLNIDVVQTA
MAEHNMMPKA VIEGPPPRHC PILLRQTSFK ALEEKIAFVS NGGQIIPGHH TARFGEIEQR
GAALTAKGRN LYDHLLQSAQ DQLNVPVNEN NAAQYSAILS EKFSQFPDDY PTMRAEKLAF
FRYFPTEKGL ITASIQEIQE IQEIQEMTLD ELIDNGFIQY EPLVYEDFLP VSAAGIFQSN
LGEKGQSHFT GHSNKADFQR DLGIAVIDEL QLYEATQQRS VAECAAALKL TLLSQ