Gene YpAngola_A3268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3268 
SymbolgspE 
ID5801745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3466340 
End bp3467851 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content39% 
IMG OID641341094 
Productgeneral secretion pathway protein E 
Protein accessionYP_001607616 
Protein GI162418936 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000698586 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000127286 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGATA TTGAGTCTGA GTTCCATACT TTGCCATTAA CGTTTAGTTG GTCAAAGAGT 
CATGGTGTGC TTATTCTGCC TATGGCACAG GGCAGTCAGT TGATTTGTCG TAAATCAGCG
ACATTGGAGG CTATACTTGA AGGGAATCGG GTGGCTCATA GTCCAATACT TTTTAATTTA
GTTACTGATG AAGAGTTTGA AAATAATTTA ACTGAACGCT ACCAAAATAA TTCGGCTGAC
CCATATCAGA CAATGAGCGT TATCAGCAAT GAGATCGACA TTTACTCTTT TGCTAACGAA
TTATCTGACG ATGATGATTT ATTGGATATA AATGATGAAG CACCTGTAAT AAAATTAATC
AATAGCATTC TGGTCGAAGC AATTAAAGAG TTAGCATCTG ATATACACAT AGAGTCTTTC
GATAAAAAAC TTACTGTAAG ATTTCGTATT GATGGTGTGT TAAGGAAAAT ATTGGAATTA
CAGCGCCATG TTGCACTATT ATTAGTGTCG CGTATAAAAG TCATGGCCAA ATTAGATATT
GCAGAAAAAC GTATTCCACA AGATGGGCGC ATAGCTCTAA ATTTAGCGGG TAGAGCATTG
GATGTTCGTG TTTCGGTATT ACCATCTAGC CATGGTGAAC GGGTCGTTAT GCGTTTGCTT
GACAAAAATA GCATTAAATT GGATTTACCC TCATTAGGTA TGTCGAAAGA TAACTGTATT
TCTATGAATA CGCAGGTACA TAAACCCCAT GGCATTATCT TGGTTGTTGG CCCAACGGGT
TCGGGTAAAA GCACTACGTT ATATGCTGCT TTAATGGGGA TTGACGCGAA TGAGAAAAAT
ATTATGACTG TAGAAGACCC TATTGAGTTC GACATACCTG GTATATCCCA AACGCAAGTA
AACCCAAAAA TTGAAATGAC ATTTGCTCGT AGCTTAAGGG CAATATTACG CCAAGATCCT
GACGTTATTC TTATCGGTGA AATTCGTGAT ATTGAAACGG CACAGATAGC GGTCCAAGCT
TCTCTAACCG GCCATTTAGT TCTATCTACA TTACATGCCA ACAGTGCGAT AGGTGCAGTG
ACCAGACTGA AGGATATGAA TGTAGAGGCA TTTATGCTTT CAAGTTCATT ATTGGCCATC
ATATCTCAGC GGTTGGTTAG AAGATTATGT ATAGCGTGTC GCCAAGAGTC GTCAGTGACG
GAGAAGGTAT TACAACGTCT GAATATTGAT GATAAATGCT CTTTGGTTGG TTCGGTGTAC
CGGGCTAAAG GTTGTAACAA ATGTAATCTT ACTGGTTATC GTGGACGTAT TGCTCTCCAT
GAATTTCTCG TAGTGGATAA CTTATTGCGG GGGGCTATTT ATAAAGGGCT AGGAGAGTTT
GAACTGGCTA AATTGGCCAG TGGGGCTATA AACAGCCTTC TATCTGATGG GATAAGCAAA
GTTATTGCAG GGTTAACCAC TATAGAGGAA CTGGTAAGAA TATGCCAAGA GGATGATAAT
GGCTGTATTT AA
 
Protein sequence
MTDIESEFHT LPLTFSWSKS HGVLILPMAQ GSQLICRKSA TLEAILEGNR VAHSPILFNL 
VTDEEFENNL TERYQNNSAD PYQTMSVISN EIDIYSFANE LSDDDDLLDI NDEAPVIKLI
NSILVEAIKE LASDIHIESF DKKLTVRFRI DGVLRKILEL QRHVALLLVS RIKVMAKLDI
AEKRIPQDGR IALNLAGRAL DVRVSVLPSS HGERVVMRLL DKNSIKLDLP SLGMSKDNCI
SMNTQVHKPH GIILVVGPTG SGKSTTLYAA LMGIDANEKN IMTVEDPIEF DIPGISQTQV
NPKIEMTFAR SLRAILRQDP DVILIGEIRD IETAQIAVQA SLTGHLVLST LHANSAIGAV
TRLKDMNVEA FMLSSSLLAI ISQRLVRRLC IACRQESSVT EKVLQRLNID DKCSLVGSVY
RAKGCNKCNL TGYRGRIALH EFLVVDNLLR GAIYKGLGEF ELAKLASGAI NSLLSDGISK
VIAGLTTIEE LVRICQEDDN GCI