Gene EcHS_A3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3521 
SymbolgspF1 
ID5593610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3504160 
End bp3505356 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content51% 
IMG OID640922638 
Productgeneral secretion pathway protein F 
Protein accessionYP_001460119 
Protein GI157162801 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID[TIGR02120] general secretion pathway protein F 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value0.373913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATC GCTATCGCGC CATGACCCAG GATGGTCAAA AATTGCAAGG GATCATTGAT 
GCTAACGATG AACGTCAGGC ACGACTGCGG CTGCGTGAAG AAGGGCTTTT CCTGCTGGAT
ATTCGCCCCC AAAAAAGTTC GGGAGTAAAA ACACGTCGCC CGAGGATCAG CCATAGTGAA
CTGACGCTTT TCACCCGGCA GTTGGCAACC TTAAGCGCAG CGGCATTACC CCTGGAAGAG
AGCCTTGCCG TAATCGGTCA ACAAAGCAGT AATAAACGAC TGGGTGACGT GTTAAATCAG
GTACGCAGCG CCATCCTTGA AGGGCATCCC CTTTCCGATG CATTACAGCA TTTTCCCACG
CTTTTCGATT CGCTCTATCG TACCCTGGTA AAAGCGGGCG AAAAGAGCGG GCTGCTGGCC
CCGGTGTTGG AAAAGCTGGC TGATTACAAT GAAAACCGGC AGAAAATCCG CAGCAAGCTC
ATTCAGTCAC TGATCTACCC CTGTATGCTC ACTACGGTGG CGATTGGGGT CGTGATTATT
CTCCTCACTG CTGTCGTGCC CAAAATTACC GAACAGTTCG TGCATATGAA GCAGCAACTG
CCGCTGAGTA CACGCATTCT TTTAGGTCTG AGCGACACGT TGCAACGTAC CGGCCCGACA
TTATTAGCGA CAGTGTTTAT TGTCGCTGTA GGTTTCTGGC TCTGGTTAAA ACGCGGCAAT
AACCGCCACC GTTTTCATGC CATGTTGCTG CGCGTTGCGC TCATCGGCCC GCTGATTTGC
GCCATTAACA GCGCACGCTA TCTCCGCACT TTAAGTATTT TGCAATCCAG CGGCGTCCCT
CTGCTGGATG GGATGAATTT GTCCACCGAA AGCCTCAACA ACCTCGAAAT TCGCCAGCGT
CTGGCAAATG CGGCAGAGAA CGTTCGCCAG GGTAACAGCA TTCATCTTTC GCTGGAACAA
ACCGCAATTT TCCCGCCGAT GATGCTCTAC ATGGTGGCCT CTGGCGAAAA AAGCGGGCAG
CTCGGCACAT TAATGGTCAG AGCCGCAGAT AACCAGGAGA CACTCCAACA AAATCGGATC
GCCTTAACGC TCTCCATCTT CGAGCCAGCA CTCATTATTA CGATGGCACT GATCGTCCTG
TTTATTGTCG TGTCGGTACT CCAACCTCTT CTTCAACTTA ACTCAATGAT TAATTAA
 
Protein sequence
MNYRYRAMTQ DGQKLQGIID ANDERQARLR LREEGLFLLD IRPQKSSGVK TRRPRISHSE 
LTLFTRQLAT LSAAALPLEE SLAVIGQQSS NKRLGDVLNQ VRSAILEGHP LSDALQHFPT
LFDSLYRTLV KAGEKSGLLA PVLEKLADYN ENRQKIRSKL IQSLIYPCML TTVAIGVVII
LLTAVVPKIT EQFVHMKQQL PLSTRILLGL SDTLQRTGPT LLATVFIVAV GFWLWLKRGN
NRHRFHAMLL RVALIGPLIC AINSARYLRT LSILQSSGVP LLDGMNLSTE SLNNLEIRQR
LANAAENVRQ GNSIHLSLEQ TAIFPPMMLY MVASGEKSGQ LGTLMVRAAD NQETLQQNRI
ALTLSIFEPA LIITMALIVL FIVVSVLQPL LQLNSMIN