Gene B21_03128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03128 
SymbolgspE 
ID8115700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3317783 
End bp3319264 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content52% 
IMG OID644849311 
Producthypothetical protein 
Protein accessionYP_003000884 
Protein GI251786580 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.182977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTC ACTCACCGTA CCCCGCCAGT TGGGCGCTGG CACAACGAAT TGGTTATCTC 
TATTCAGAGG GCGAGATTAT TTATCTCGCC GATACGCCAT TCGAGCGGTT ACTCGATATT
CAACGTCAGG TTGGCCAGTG CCAGACCATG ACCAGCTTGT CACAGGCTGA TTTTGAAGCT
CGGCTGGAAG CGGTATTCCA TCAGAATACC GGTGAGTCGC AACAGATTGC GCAGGATATC
GATCAATCCG TCGATCTTCT CTCGCTTTCG GAAGAGATGC CCGCAAATGA AGATCTCCTG
AATGAAGATT CAGCGGCACC GGTTATCCGC TTGATCAATG CGATTTTGAG TGAGGCCATC
AAAGAAACCG CCTCTGATAT CCACATTGAA ACCTATGAAA AAACAATGTC GATCCGTTTT
CGCATCGACG GCGTTTTGCG GACAATTTTA CAGCCAAACA AAAAACTGGC GGCACTGCTT
ATCTCCCGAA TTAAGGTCAT GGCTCGTCTT GATATCGCCG AAAAACGTAT TCCACAGGAT
GGAAGAATTA GTTTGCGTAT CGGGCGACGT AACATAGATG TCCGCGTATC CACACTGCCG
TCCATCTATG GTGAACGCGC CGTACTCCGC CTGCTGGATA AAAACAGCCT CCAGCTTTCA
TTGAACAACC TGGGGATGAC GGCAGCGGAT AAGCAGGATT TAGAAAATCT CATTCAGCTT
CCGCACGGTA TTATCCTGGT GACAGGGCCG ACAGGCTCCG GTAAAAGCAC CACGCTCTAC
GCCATCCTTT CGGCGCTGAA TACTCCCGGC CGCAATATTC TGACGGTAGA AGATCCCGTG
GAATATGAGC TGGAAGGCAT TGGGCAAACG CAGGTGAATA CCCGTGTGGA TATGTCTTTC
GCTCGCGGCC TGCGCGCCAT ACTTCGCCAG GACCCGGATG TCGTCATGGT GGGGGAAATT
CGTGATACAG AAACCGCGCA GATTGCGGTT CAGGCCTCGC TCACCGGCCA TCTGGTACTC
TCAACACTCC ACACTAACAG TGCATCAGGC GCAGTGACCC GGCTCCGCGA CATGGGCTTC
GAATCATTCC TGCTTTCGTC TTCCCTGGCA GGGATTATCG CGCAACGTCT GGTTCGTCGC
CTGTGTCCGC AATGCCGACA ATTCACGCCC GTATCACCCC AACAAGCGCA GATGTTTAAA
TATCATCAGC TCGCGGTGAC AACAATTGGC ACTCCCGTAG GCTGCCCTCA TTGCCATCAA
TCCGGCTATC AGGGGCGCAT GGCGATCCAC GAAATGATGG TGGTGACGCC GGAATTACGG
GCCGCTATTC ATGAAAATGT GGATGAACAA GCACTGGAGC GACTAGTCCG GCAACAACAC
AAGGCCTTAA TCAAAAATGG CCTGCAAAAA GTGATAAGCG GTGACACCTC TTGGGATGAG
GTTATGCGCG TCGCCAGTGC CACGCTGGAG AGCGAAGCAT GA
 
Protein sequence
MRIHSPYPAS WALAQRIGYL YSEGEIIYLA DTPFERLLDI QRQVGQCQTM TSLSQADFEA 
RLEAVFHQNT GESQQIAQDI DQSVDLLSLS EEMPANEDLL NEDSAAPVIR LINAILSEAI
KETASDIHIE TYEKTMSIRF RIDGVLRTIL QPNKKLAALL ISRIKVMARL DIAEKRIPQD
GRISLRIGRR NIDVRVSTLP SIYGERAVLR LLDKNSLQLS LNNLGMTAAD KQDLENLIQL
PHGIILVTGP TGSGKSTTLY AILSALNTPG RNILTVEDPV EYELEGIGQT QVNTRVDMSF
ARGLRAILRQ DPDVVMVGEI RDTETAQIAV QASLTGHLVL STLHTNSASG AVTRLRDMGF
ESFLLSSSLA GIIAQRLVRR LCPQCRQFTP VSPQQAQMFK YHQLAVTTIG TPVGCPHCHQ
SGYQGRMAIH EMMVVTPELR AAIHENVDEQ ALERLVRQQH KALIKNGLQK VISGDTSWDE
VMRVASATLE SEA