Gene YpAngola_A2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2044 
Symbol 
ID5800514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2132568 
End bp2133746 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content51% 
IMG OID641339964 
ProductN-acetylneuraminic acid mutarotase 
Protein accessionYP_001606514 
Protein GI162420044 
COG category[S] Function unknown 
COG ID[COG3055] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03547] mutatrotase, YjhT family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00301941 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0393599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGT TATACCCACA ATATAAGAAA CAATTGACGC CAAAAATAGT GCTGTTTAGT 
GCGTTATCTC TATTGATGAT GGCATCGCTA CCTAATACCT ATGCGGAACA ATATCCTGAC
GTTCCGGTAC CATTTAAAAA CGGTACGGGT GGCAAAGTGG AGAACAGCTT ATATGTGGGG
CTGGGGAGCG CCGGTGTGTC GTGGTTCCGT CTGGATACAG ATAAAACCGG TGCTGGGTGG
CAGAAAGTGG CTAATTTCCC AGGCCAGCCT CGTGAAAAGG CTGTGACGGT GGTATTGGCA
GGGAAACTGT ACGTGTTCGG TGGGGTGGGG AAAACGAACG CCAACGATAC CCAAGTACGG
GCACTGGATG ATGCGTATCG TTTTGATCCA CAAACTAATC AATGGCAGCA ACTGGCTACC
CGCGCTCCCC GAGGTCTGGT GGGGACCGTG GCCACGACAC TGGACGGCTC CCAGGCGGTG
CTGTTAGGCG GTGTAAATAA AGCGATTTTT GACGGTTATT TTACTGATCT TGCGTCAGCC
GGAAGTGATG AAGTACGCAA GAGTGCGGTA ATAAACGCCT ATTTTAATCA GGCTCCGGCT
GATTATTTCT ACAACCGTGA TGTCCTTATC TATGATCCGC AAAAAAATCA GTGGAAAAGC
GGCGGTTTGC TGCCATTTTT AGGGACAGCC GGGTCAGCCA TCAGCCGCAT GGATAATCGC
TTGATATTAA TCAATGGCGA AATTAAACCT GGTTTACGCA CAGCGGCTGT ATGGCAGGGG
CTAATGCAAG GGAATGTACT GGAGTGGCAG CCACAACCGG ACTTAATCGG AGCTGAAACG
GGATCCGCAC AGGAAGGATT GGCTGGCGCA TTTTCCGGTA TCAGTCATAA AACAGTCTTA
GTCGCCGGAG GCGCTAATTT CCCTGGTGCC TGGAAGCAAT TTAATCGCGG TCACCTATAT
GCTCACCAAG GGCTGGAAAA ACAGTGGCAC CAGCAGGTGT ATGCGTTGGT CGATAATCAG
TGGAGAATCG CCGGGAAGTT GCCTCAACCG CTGGGTTATG GCGTCTCTAT TCAGGGGCCA
GATAAGGTCA TTTTGATTGG CGGCGAAACC ACCGGAGGTA CGGCCACTTC AGCGGTCACA
CAGCTATCTT GGCAGGGAGG AAAACTGCAT ATTGAGTAA
 
Protein sequence
MTQLYPQYKK QLTPKIVLFS ALSLLMMASL PNTYAEQYPD VPVPFKNGTG GKVENSLYVG 
LGSAGVSWFR LDTDKTGAGW QKVANFPGQP REKAVTVVLA GKLYVFGGVG KTNANDTQVR
ALDDAYRFDP QTNQWQQLAT RAPRGLVGTV ATTLDGSQAV LLGGVNKAIF DGYFTDLASA
GSDEVRKSAV INAYFNQAPA DYFYNRDVLI YDPQKNQWKS GGLLPFLGTA GSAISRMDNR
LILINGEIKP GLRTAAVWQG LMQGNVLEWQ PQPDLIGAET GSAQEGLAGA FSGISHKTVL
VAGGANFPGA WKQFNRGHLY AHQGLEKQWH QQVYALVDNQ WRIAGKLPQP LGYGVSIQGP
DKVILIGGET TGGTATSAVT QLSWQGGKLH IE