Gene EcHS_A3136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3136 
SymbolgspF2 
ID5595255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3144205 
End bp3145428 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content58% 
IMG OID640922255 
Productgeneral secretion pathway protein F 
Protein accessionYP_001459754 
Protein GI157162436 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID[TIGR02120] general secretion pathway protein F 


Plasmid Coverage information

Num covering plasmid clones83 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTGT TTTACTATCA GGCGCTGGAG CGTAATGGTC GCAAAACCAA AGGTATGATT 
GAGGCGGATT CCGCGCGTCA TGCCCGCCAG TTGTTGCGCG GTAAAGAGCT TATCCCCGTG
CATATTGAAG CCCGGATGAA TACTTCGTCA GGGGGGATGT TGCAGCGTCG GCGGCACGCA
CATCGTCGCG TGGCGGCGGC AGATCTTGCG CTGTTCACGC GCCAACTGGC AACGCTGGTA
CAGGCAGCAA TGCCGCTGGA AACCTGCTTA CAGGCGGTCA GTGAGCAAAG TGAAAAACTG
CATGTAAAAA GCCTCGGAAT GGCGCTGCGC AGCCGGATTC AGGAAGGTTA CACCCTGTCG
GACAGCCTGC GCGAACATCC CCGCGTCTTT GATTCTCTGT TTTGTTCGAT GGTTGCTGCC
GGAGAAAAAT CCGGGCATCT CGACGTGGTG CTCAATCGCC TGGCAGATTA CACCGAACAG
CGACAGCGCC TGAAATCACG TCTATTGCAG GCCATGCTCT ATCCGCTGGT TCTGCTGGTG
GTGGCAACGG GCGTAGTCAC TATTTTGCTG ACGGCAGTGG TGCCGAAAAT CATCGAACAG
TTCGATCACC TCGGACACGC GCTGCCCGCC ACCACCCGCG CGCTTATCGC CATGAGCGAC
GCGTTACAGG CCAGCGGCGT TTACTGGCTG GCGGGATTGC TGGCGCTTCT GGTGCTGGGG
CAACGGCTAC TTAAAAATCC TGCTATGCGC CTGCGCCGGG ATAAAACCTT GCTGCGTCTG
CCCGTGACGG GCCGTGTTGC GCGCGGGCTG AATACGGCGC GTTTTTCCCG CACATTAAGC
ATCCTCACCG CCAGCAGTGT TCCGCTACTG GAAGGCATTC AGACCGCTGC CGCCGTGTCG
GCAAATCGCT ATGTCGAACA ACAACTACTG CTGGCGGCAG ATCGCGTCCG CGAAGGAAGC
AGTCTGCGTG CCGCGCTGGC GGAGTTGCGC CTGTTCCCCC CGATGATGCT GTACATGATC
GCCTCCGGCG AACAGAGCGG CGAACTGGAA ACCATGCTTG AGCAGGCCGC TGTTAACCAG
GAACGGGAAT TTGATACCCA GGTGGGGCTG GCGTTAGGGC TGTTTGAACC GGCGCTGGTG
GTGATGATGG CGGGCGTGGT GCTGTTTATC GTCATCGCCA TCCTCGAGCC GATGCTGCAA
CTGAACAATA TGGTTGGAAT GTAA
 
Protein sequence
MALFYYQALE RNGRKTKGMI EADSARHARQ LLRGKELIPV HIEARMNTSS GGMLQRRRHA 
HRRVAAADLA LFTRQLATLV QAAMPLETCL QAVSEQSEKL HVKSLGMALR SRIQEGYTLS
DSLREHPRVF DSLFCSMVAA GEKSGHLDVV LNRLADYTEQ RQRLKSRLLQ AMLYPLVLLV
VATGVVTILL TAVVPKIIEQ FDHLGHALPA TTRALIAMSD ALQASGVYWL AGLLALLVLG
QRLLKNPAMR LRRDKTLLRL PVTGRVARGL NTARFSRTLS ILTASSVPLL EGIQTAAAVS
ANRYVEQQLL LAADRVREGS SLRAALAELR LFPPMMLYMI ASGEQSGELE TMLEQAAVNQ
EREFDTQVGL ALGLFEPALV VMMAGVVLFI VIAILEPMLQ LNNMVGM