Gene YpAngola_A1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1042 
Symbol 
ID5799505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1066802 
End bp1068358 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content46% 
IMG OID641339030 
Producthypothetical protein 
Protein accessionYP_001605602 
Protein GI162420162 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0653884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGGC CCCATCAAAT GGCTGAATTT ATAGTGTCCT CATTTGAAAC AATTAGTGAT 
GAATTACACC TTCTCTGTCG GCGTTATCGG GCAGTAGCAC TCACTCTTGA TGGTAAAAGC
CTTTCGATAG CTTCAGCCCA GACCGTTGAT GAAGCGTTAT TGACCGCGTT GCGTTTCACA
TGTGGCCGTC AAGTCAGAGT GGAATATTGG CCCGAAGCCA AGATTGAACA ATCATTGTAT
TTGGGGAGCC CGACGAAAAA TAACCTAAAC AGTAGTATTA ACAACGGGCT AAATAGCGCG
TTGTTATCCC AAAAGGGAGG TGATCCGTCG CGAGTAAAAC CGCACAATCA GCAGCAACAG
ACAATAGATG ATACTGTGTT TGATAATGAG AGTGACACAC CCGTCATTCA ATTCATAACC
CAGACGCTAA GTCTAGCCAT TCAAAAACGC GCTTCAGATA TCCATTTTGA GCCTTATCAA
CACCACTATC GTGTTCGTTT AAGAATTGAT GGTGTGTTGC ATGAATTCAC CCCACCCGAG
GCCGAATGGG CAGCTCGGAT TAGCAGTTGC CTGAAGGTCA TGGCGAAATT AAATATTGCT
GAACGGCGAT TACCACAAGA TGGTCAACTA ACCTTACCCT TTGGTGATTC ACACTATTCA
ATGCGGATAG CGACTCTCCC TACGCAATAT GGTGAAAAAG TGGTATTGCG TATTCTTCAA
ATACAACAGC AAACCACGTT AGAAAAGCTG GGGATGACGG ATGCGGCACT GAAACAATTA
ACACAGGCAT TATCAGCACC ACAAGGGCTG ATTCTGGTCA CCGGCCCTAC CGGTAGTGGC
AAAACCATTA CGTTATATTG CAGTTTAGCG CGGCTGAATC AGACACAAAG AAACATCTGT
AGCGTCGAAG ATCCTATTGA GATCCCCGTC AATGGCATTA ACCAAACCCA GGTAAACAGC
AAGATCGGTC TGGATTTCTC TCGAATACTA CGAGCCATCC TACGACAGGA CCCTGATGTC
ATTATGGTTG GTGAAATTCG TGATAATGAA ACCGCCAGTA TCGCAGTTAA CGCGGCCCAG
ACCGGGCATT TGGTCCTATC GACGCTGCAC ACTAACTCAA CAGCAGAAAC GCTGATACGC
ATGGCACAAA TGGGAATAGA ACGCCATTTA ATCGCCTCAA GTCTAAAACT CGTCATTGCT
CAACGCTTGG TGCGCCGCCT ATGTTTACAT TGCCGCCAGG CTGCATCTCA CCCCTTTATT
CCACCAGCTC ACATAAGGTC TGGTCCGATC CAACACTATC TAGCTGTAGG TTGTGAGCAT
TGTTGTACGG GTTATTATGG CCGAACGGGT ATTTATGAAA TGCTGAGTGT AACGCCGCAG
ATTCAGCAAG CCATACTCAA TAATGCCAGC CCTGTAAAAC TGGTACAAAT TGCCCAGAAG
CAAGAACAAA CAGCCTTACT CTGCTCAGGT TTAGCTTTAA TCGAAAAAGG CATCACTACC
CTTAGTGAAA TTAATCGTGT TGTGGGCTTC GTAGCAGAAA CAGAGGTCAC CTCTTGA
 
Protein sequence
MIRPHQMAEF IVSSFETISD ELHLLCRRYR AVALTLDGKS LSIASAQTVD EALLTALRFT 
CGRQVRVEYW PEAKIEQSLY LGSPTKNNLN SSINNGLNSA LLSQKGGDPS RVKPHNQQQQ
TIDDTVFDNE SDTPVIQFIT QTLSLAIQKR ASDIHFEPYQ HHYRVRLRID GVLHEFTPPE
AEWAARISSC LKVMAKLNIA ERRLPQDGQL TLPFGDSHYS MRIATLPTQY GEKVVLRILQ
IQQQTTLEKL GMTDAALKQL TQALSAPQGL ILVTGPTGSG KTITLYCSLA RLNQTQRNIC
SVEDPIEIPV NGINQTQVNS KIGLDFSRIL RAILRQDPDV IMVGEIRDNE TASIAVNAAQ
TGHLVLSTLH TNSTAETLIR MAQMGIERHL IASSLKLVIA QRLVRRLCLH CRQAASHPFI
PPAHIRSGPI QHYLAVGCEH CCTGYYGRTG IYEMLSVTPQ IQQAILNNAS PVKLVQIAQK
QEQTALLCSG LALIEKGITT LSEINRVVGF VAETEVTS