Gene EcolC_0387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0387 
Symbol 
ID6066774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp435110 
End bp436591 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content52% 
IMG OID641599786 
Productgeneral secretory pathway protein E 
Protein accessionYP_001723392 
Protein GI170018438 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0402565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATTC ACTCACCGTA CCCCGCCAGT TGGGCGCTGG CACAACGAAT TGGTTATCTC 
TATTCAGAGG GCGAGATTAT TTATCTCGCC GATACGCCAT TCGAGCGGTT ACTCGATATT
CAACGTCAGG TTGGCCAGTG CCAGACCATG ACCAGCTTGT CACAGGCTGA TTTTGAAGCT
CGGCTGGAAG CGGTATTCCA TCAGAATACC GGTGAGTCGC AACAGATTGC GCAGGATATC
GATCAATCCG TCGATCTTCT CTCGCTTTCG GAAGAGATGC CCGCAAATGA AGATCTCCTG
AATGAAGATT CAGCGGCACC GGTTATCCGC TTGATCAATG CGATTTTGAG TGAGGCCATC
AAAGAAACCG CCTCTGATAT CCACATTGAA ACCTATGAAA AAACAATGTC GATCCGTTTT
CGCATCGACG GCGTTTTGCG GACAATTTTA CAGCCAAACA AAAAACTGGC GGCACTGCTT
ATCTCCCGAA TTAAGGTCAT GGCTCGTCTT GATATCGCCG AAAAACGTAT TCCACAGGAT
GGAAGAATTA GTTTGCGTAT CGGGCGACGT AACATAGATG TCCGCGTATC CACACTGCCG
TCCATCTATG GTGAACGCGC CGTACTCCGC CTGCTGGATA AAAACAGCCT CCAGCTTTCA
TTGAACAACC TGGGGATGAC GGCAGCGGAT AAGCAGGATT TAGAAAATCT CATTCAGCTT
CCGCACGGTA TTATCCTGGT GACAGGGCCG ACAGGCTCCG GTAAAAGCAC CACGCTCTAC
GCCATCCTTT CGGCGCTGAA TACTCCCGGC CGCAATATTC TGACGGTAGA AGATCCCGTG
GAATATGAGC TGGAAGGCAT TGGGCAAACG CAGGTGAATA CCCGTGTGGA TATGTCTTTC
GCTCGCGGCC TGCGCGCCAT ACTTCGCCAG GACCCGGATG TCGTCATGGT GGGGGAAATT
CGTGATACAG AAACCGCGCA GATTGCGGTT CAGGCCTCGC TCACCGGCCA TCTGGTACTC
TCAACACTCC ACACTAACAG TGCATCAGGC GCAGTGACCC GGCTCCGCGA CATGGGCGTC
GAATCATTCC TGCTTTCGTC TTCCCTGGCA GGGATTATCG CGCAACGTCT GGTTCGTCGC
CTGTGTCCGC AATGCCGACA ATTCACGCCC GTATCACCCC AACAAGCGCA GATGTTTAAA
TATCATCAGC TCGCGGTGAC AACAATTGGC ACTCCCGTAG GCTGCCCTCA TTGCCATCAA
TCCGGCTATC AGGGGCGCAT GGCGATCCAC GAAATGATGG TGGTGACGCC GGAATTACGG
GCCGCTATTC ATGAAAATGT GGATGAACAA GCACTGGAGC GACTAGTCCG GCAACAACAC
AAGGCCTTAA TCAAAAATGG CCTGCAAAAA GTGATAAGCG GTGACACCTC CTGGGATGAG
GTTATGCGCG TCGCCAGTGC CACGCTGGAG AGCGAAGCAT GA
 
Protein sequence
MRIHSPYPAS WALAQRIGYL YSEGEIIYLA DTPFERLLDI QRQVGQCQTM TSLSQADFEA 
RLEAVFHQNT GESQQIAQDI DQSVDLLSLS EEMPANEDLL NEDSAAPVIR LINAILSEAI
KETASDIHIE TYEKTMSIRF RIDGVLRTIL QPNKKLAALL ISRIKVMARL DIAEKRIPQD
GRISLRIGRR NIDVRVSTLP SIYGERAVLR LLDKNSLQLS LNNLGMTAAD KQDLENLIQL
PHGIILVTGP TGSGKSTTLY AILSALNTPG RNILTVEDPV EYELEGIGQT QVNTRVDMSF
ARGLRAILRQ DPDVVMVGEI RDTETAQIAV QASLTGHLVL STLHTNSASG AVTRLRDMGV
ESFLLSSSLA GIIAQRLVRR LCPQCRQFTP VSPQQAQMFK YHQLAVTTIG TPVGCPHCHQ
SGYQGRMAIH EMMVVTPELR AAIHENVDEQ ALERLVRQQH KALIKNGLQK VISGDTSWDE
VMRVASATLE SEA