Gene B21_02787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02787 
SymbolpulE 
ID8113498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2967992 
End bp2969485 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content58% 
IMG OID644848977 
Producthypothetical protein 
Protein accessionYP_003000550 
Protein GI251786246 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000039107 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCCTG TAGCACAGGA AACCACTGCT AACACCGTGC GTCTGCCCTA CAGTTTCAGC 
CGTCGGTTTA GCCTGGTGGC ATGGTGCGAA GCGTCGCTGG AGATCCTCCA TGTTCATCCG
TTGTCGCTCT CTGTTTTGCA GGAGCTGCAG CGGGGGCTGA ACGCGCCCTT TACGCTGCGG
CAAATCGACG AAGCCGAATT TGAACAGCGG CTGAATGCGG TCTGGCAGCG GGACTCTTCC
GAGGCTCGCC AACTGATGGA AGATCTCGGT TCTGCCGAGG ACTTTTTTAC CCTCGCAGAA
GAACTGCCGG AAACGGAAGA TCTGCTGGAA AGTGACGACG ATGCGCCGAT CATCAAACTG
ATCAACGCCA TGCTGGCAGA GGCGATTAAA GAAGGCGCTT CGGATATCCA CATCGAGACG
TTTGAAAAGA GTCTGGTGAT CCGTTTTCGT GTTGACGGCA CATTACATGA AATGTTGCGC
CCCGGTCGCA AGCTGGCCTC GCTGCTGGTC TCGCGTATCA AGGTGATGGC GCGTCTGGAT
ATCGCCGAAA AGCGCGTACC ACAGGATGGC CGTATTGCGC TGCTGCTGGG CGGTCGGGCG
ATCGACGTGC GCGTCTCCAC CATGCCTTCC GCCTGGGGCG AGCGCGTGGT GCTGCGACTG
CTGGACAAAA ACCAGGCCCG CCTGACGCTG GAGCGTCTGG GTTTAAGTCA CGAACTGACT
GCGCAGTTGC GCCAGCTGTT ACACAAACCG CACGGCATCT TTCTGGTGAC GGGGCCGACA
GGTTCCGGCA AAAGCACCAC GCTGTACGCC GGATTGCAGG AGCTGAACAA CCATTCGCGC
AACATTCTCA CGGTTGAAGA TCCCATCGAA TACATGATTG AAGGGATCGG TCAGACACAG
GTTAACACCC GCGTCGGCAT GACCTTCGCC CGTGGCCTGC GCGCGATTTT GCGTCAGGAC
CCGGATGTAG TGATGGTCGG TGAAATCCGC GATACCGAAA CCGCAGAAAT CGCCGTCCAG
GCTTCACTTA CCGGACACCT GGTCCTTTCC ACGCTGCATA CCAACACGGC GGTGGGGGCG
ATTACGCGTT TGCAGGATAT GGGGGTAGAG CCTTTCCTGC TCTCTTCCAG TCTGACGGGC
GTGATGGCGC AGCGACTGGT CCGTACGCTG TGCCCCGACT GCCGCCAGGC CCTGCCTGCC
ACTGACGAAG AAAAACGCCT GCTGGGAATT ACCGACGCCC GTACCGTCAC TCTGTACCAT
CCACAGGGCT GTCCCGCCTG TAATCACAAA GGTTTTCGCG GGCGTACTGC CATCCATGAG
TTGATCGTGG TGGACGCCAC ATTGCGTGAT TTGATCCACC GTCAGGCCGG GGAACTGGAG
CTGGAACGTT ATGTCCGGCA ACACTCTGCG GGCATCCGCA GCAATGGCAT TGAGAAAGTG
CTCGCCGGAG AAACCTCTCT CGATGAAGTT CTGCGGGTAA CCATGGAGGC GTAA
 
Protein sequence
MVPVAQETTA NTVRLPYSFS RRFSLVAWCE ASLEILHVHP LSLSVLQELQ RGLNAPFTLR 
QIDEAEFEQR LNAVWQRDSS EARQLMEDLG SAEDFFTLAE ELPETEDLLE SDDDAPIIKL
INAMLAEAIK EGASDIHIET FEKSLVIRFR VDGTLHEMLR PGRKLASLLV SRIKVMARLD
IAEKRVPQDG RIALLLGGRA IDVRVSTMPS AWGERVVLRL LDKNQARLTL ERLGLSHELT
AQLRQLLHKP HGIFLVTGPT GSGKSTTLYA GLQELNNHSR NILTVEDPIE YMIEGIGQTQ
VNTRVGMTFA RGLRAILRQD PDVVMVGEIR DTETAEIAVQ ASLTGHLVLS TLHTNTAVGA
ITRLQDMGVE PFLLSSSLTG VMAQRLVRTL CPDCRQALPA TDEEKRLLGI TDARTVTLYH
PQGCPACNHK GFRGRTAIHE LIVVDATLRD LIHRQAGELE LERYVRQHSA GIRSNGIEKV
LAGETSLDEV LRVTMEA