Gene EcolC_0386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0386 
Symbol 
ID6066755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp433917 
End bp435113 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content51% 
IMG OID641599785 
Productgeneral secretion pathway protein F 
Protein accessionYP_001723391 
Protein GI170018437 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID[TIGR02120] general secretion pathway protein F 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.120721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATC GCTATCGCGC CATGACCCAG GATGGTCAAA AATTGCAAGG GATCATTGAT 
GCTAACGATG AACGTCAGGC ACGACTGCGG CTGCGTGAAG AAGGGCTTTT CCTGCTGGAT
ATTCGCCCCC AAAAAAGTTC GGGAGTAAAA ACACGTCGCC CGAGGATCAG CCATAGTGAA
CTGACGCTTT TCACCCGGCA GTTGGCAACC TTAAGCGCAG CGGCATTACC CCTGGAAGAG
AGCCTTGCCG TAATCGGTCA ACAAAGCAGT AATAAACGAC TGGGTGACGT GTTAAATCAG
GTACGCAGCG CCATCCTTGA AGGGCATCCC CTTTCCGATG CATTACAGCA TTTTCCCACG
CTTTTCGATT CGCTCTATCG TACCCTGGTA AAAGCGGGCG AAAAGAGCGG GCTGCTGGCC
CCGGTGTTGG AAAAGCTGGC TGATTACAAT GAAAACCGGC AGAAAATCCG CAGCAAGCTC
ATTCAGTCAC TGATCTACCC CTGTATGCTC ACTACGGTGG CGATTGGGGT CGTGATTATT
CTCCTCACTG CTGTCGTGCC CAAAATTACC GAACAGTTCG TGCATATGAA GCAGCAACTG
CCGCTGAGTA CACGCATTCT TTTAGGTCTG AGCGACACGT TGCAACGTAC CGGCCCGACA
TTATTAGCGA CAGTGTTTAT TGTCGCTGTA GGTTTCTGGC TCTGGTTAAA ACGCGGCAAT
AACCGCCACC GTTTTCATGC CATGTTGCTG CGCGTTGCGC TCATCGGCCC GCTGATTTGC
GCCATTAACA GCGCACGCTA TCTCCGCACT TTAAGTATTT TGCAATCCAG CGGCGTCCCT
CTGCTGGATG GGATGAATTT GTCCACCGAA AGCCTCAACA ACCTTGAAAT TCGCCAGCGT
CTGGCAAATG CGGCAGAGAA CGTTCGCCAG GGTAACAGCA TTCATCTTTC GCTGGAACAA
ACCGCAATTT TCCCGCCGAT GATGCTCTAC ATGGTGGCCT CTGGCGAAAA AAGCGGGCAG
CTCGGCACAT TAATGGTCAG AGCCGCAGAT AACCAGGAGA CACTCCAACA AAATCGGATC
GCCTTAACGC TCTCCATCTT CGAGCCAGCA CTCATTATTA CGATGGCACT GATCGTCCTG
TTTATTGTCG TGTCGGTACT CCAACCTCTT CTTCAACTTA ACTCAATGAT TAATTAA
 
Protein sequence
MNYRYRAMTQ DGQKLQGIID ANDERQARLR LREEGLFLLD IRPQKSSGVK TRRPRISHSE 
LTLFTRQLAT LSAAALPLEE SLAVIGQQSS NKRLGDVLNQ VRSAILEGHP LSDALQHFPT
LFDSLYRTLV KAGEKSGLLA PVLEKLADYN ENRQKIRSKL IQSLIYPCML TTVAIGVVII
LLTAVVPKIT EQFVHMKQQL PLSTRILLGL SDTLQRTGPT LLATVFIVAV GFWLWLKRGN
NRHRFHAMLL RVALIGPLIC AINSARYLRT LSILQSSGVP LLDGMNLSTE SLNNLEIRQR
LANAAENVRQ GNSIHLSLEQ TAIFPPMMLY MVASGEKSGQ LGTLMVRAAD NQETLQQNRI
ALTLSIFEPA LIITMALIVL FIVVSVLQPL LQLNSMIN