Gene YpAngola_A2952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2952 
Symbol 
ID5801424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3112747 
End bp3114354 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content51% 
IMG OID641340798 
Producthypothetical protein 
Protein accessionYP_001607328 
Protein GI162418688 
COG category[S] Function unknown 
COG ID[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.314306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAT TTGAACGCCA GATCCGTGCA GCCATTTCCG CAGCACGCAA TGGCGCAAAA 
CATGCGGAAC AGTCACTGAC TACACCAATG TGGCAAGCCA AAAGCACCGT AGCCTCATTG
GGTGGGATTG TCCCTAGAAG TGGCTCTTCG TCAACGTCAC AGGCGGAGAA CTATAAGGAA
GGTCTCGCGG ACCAGGCTGC CTCGGGCAAC AACATGGCGC GCACGAGTGC GCCACCGGTC
ACTTTGTATC AGCAACAGCC AAATGCGAAT GACAGCTATC CAAACGGGAA TAACAACAAT
CCAAACGGGG ATAACAACAA TCCAAACGGG AGTAACAACA ATATAGCGAG AGTACAGCGT
ATGCCGCATG GCATTTCCAG GGGCTTATAT GAGCGCCCTG GGATGTTATT GGGTGCCTGG
GATAACGCCT ATATTGCTGC GGCTATGCCT TTGCTGCTGG TGGAAAATAT TCGTAGCTGG
CCGACGCGTA ACGCCGCAGA GGTCAGGCCA CCGATTGTGC GGGAATTACA ATATTTCCAG
CAACATTTGC AGAAAAAGAA CTACCCGCAA GAAGACATTA ACCACCTGTC TTACCTGCTA
TGTACCTATA TCGATGGCAT TTTTAACGGG CTGCAAACCC CAGACTCCTA CAACCAAAGT
CTGTTAGTGG AGTTTCACCG TGATGCCTGG GGGGGTGAGG ACTGCTTCGA ACATCTGCGG
GTCTATATGA ACTCGCCGAA ACAGTACCGG GAAGTTCTGG AATTCTATGA TCTGATTATG
TGCCTTGGTT TTGACGGTAA ATACCAGATG ATAGAGCATG GTGCGGTTCT GCTGATGGAT
TTACGCAGCC GTCTCCACAC GCAACTCTAC GGTCAGGACG CCACACAATC TTTGGCTATC
GCGCAAGCGG TCAAAGGTTC TCCGCGTCGC CAATATATCA AGGCGCTGAA AATCTTCACC
TATGGTTTCG CACTGTGCCT TTGTGCTTAC GGCGTCACGG CGTGGTATCT GCACCAGCAA
TCCCAACAGA TCCGCAGCAA CATTCTGACG TGGGTACTGC CTGAACCGCG GAAAATCAAC
ATCATGGAGA CCTTGCCGAA TCCGCTATCC AACATCCTGA ATGAAGGGTG GCTGGAGGTC
AGGAAAGATC CGCGTGGATG GCTATTAATC TTCACCTCCG ACGGCGCGTT CCGCACGGGT
GAAGCGACCC TCTCGGAAGA GTTTATCAAC AAGAAGAATA TCGAACGTCT TGGGCTGGCA
TTAGCCCCAT GGCCGGGAGA TATCGAGGTT ATTGGTCATA CGGATAACAA ACCGTTCCGT
AGCACTTCCG GTAACAACAA CCTCAAACTT TCCGCGGCCA GAGCATCGGT GGTGGCAGAT
AAACTGCGGG AATCCACTCA AATCAACGAA ACCCATCAGC GAGAAATAAG TGCCATCGGA
CGGGGGGAGA GCGATCCTTT AGCTGACAAT GCAACGGAAG AAGGGCGCAA GCGTAACCGG
CGTGTGGATA TCCTATGGAA AATTGGTCAG CGCGATGCCG ATAAGGCCAT GAAGCAATTC
CTGGAGAACC CAACACCAGA AGTTCAAGGA ACGAATACCC AACAATAG
 
Protein sequence
MNEFERQIRA AISAARNGAK HAEQSLTTPM WQAKSTVASL GGIVPRSGSS STSQAENYKE 
GLADQAASGN NMARTSAPPV TLYQQQPNAN DSYPNGNNNN PNGDNNNPNG SNNNIARVQR
MPHGISRGLY ERPGMLLGAW DNAYIAAAMP LLLVENIRSW PTRNAAEVRP PIVRELQYFQ
QHLQKKNYPQ EDINHLSYLL CTYIDGIFNG LQTPDSYNQS LLVEFHRDAW GGEDCFEHLR
VYMNSPKQYR EVLEFYDLIM CLGFDGKYQM IEHGAVLLMD LRSRLHTQLY GQDATQSLAI
AQAVKGSPRR QYIKALKIFT YGFALCLCAY GVTAWYLHQQ SQQIRSNILT WVLPEPRKIN
IMETLPNPLS NILNEGWLEV RKDPRGWLLI FTSDGAFRTG EATLSEEFIN KKNIERLGLA
LAPWPGDIEV IGHTDNKPFR STSGNNNLKL SAARASVVAD KLRESTQINE THQREISAIG
RGESDPLADN ATEEGRKRNR RVDILWKIGQ RDADKAMKQF LENPTPEVQG TNTQQ