Gene B21_03129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03129 
SymbolgspF 
ID8115701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3319261 
End bp3320457 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content51% 
IMG OID644849312 
Producthypothetical protein 
Protein accessionYP_003000885 
Protein GI251786581 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID[TIGR02120] general secretion pathway protein F 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATC GCTATCGCGC CATGACCCAG GATGGTCAAA AATTGCAAGG GATCATTGAT 
GCTAACGATG AACGTCAGGC ACGACTGCGG CTGCGTGAAG AAGGGCTTTT CCTGCTGGAT
ATTCGCCCCC AAAAAAGTTC GGGAGTAAAA ACACGTCGCC CGAGGATCAG CCATAGTGAA
CTGACGCTTT TCACCCGGCA GTTGGCAACC TTAAGCGCAG CGGCATTACC CCTGGAAGAG
AGCCTTGCCG TAATCGGTCA ACAAAGCAGT AATAAACGAC TGGGTGACGT GTTAAATCAG
GTACGCAGCG CCATCCTTGA AGGGCATCCC CTTTCCGATG CATTACAGCA TTTTCCCACG
CTTTTCGATT CGCTCTATCG TACCCTGGTA AAAGCGGGCG AAAAGAGCGG GCTGCTGGCC
CCGGTGTTGG AAAAGCTGGC TGATTACAAT GAAAACCGGC AGAAAATCCG CAGCAAGCTC
ATTCAGTCAC TGATCTACCC CTGTATGCTC ACTACGGTGG CGATTGGGGT CGTGATTATT
CTCCTCACTG CTGTCGTGCC CAAAATTACC GAACAGTTCG TGCATATGAA GCAGCAACTG
CCGCTGAGTA CACGCATTCT TTTAGGTCTG AGCGACACGT TGCAACGTAC CGGCCCGACA
TTATTAGCGA CAGTGTTTAT TGTCGCTGTA GGTTTCTGGC TCTGGTTAAA ACGCGGCAAT
AACCGCCACC GTTTTCATGC CATGTTGCTG CGCGTTGCGC TCATCGGCCC GCTGATTTGC
GCCATTAACA GCGCACGCTA TCTCCGCACT TTAAGTATTT TGCAATCCAG CGGCGTCCCT
CTGCTGGATG GGATGAATTT GTCCACCGAA AGCCTCAACA ACCTCGAAAT TCGCCAGCGT
CTGGCAAATG CGGCAGAGAA CGTTCGCCAG GGTAACAGCA TTCATCTTTC GCTGGAACAA
ACCGCAATTT TCCCGCCGAT GATGCTCTAC ATGGTGGCCT CTGGCGAAAA AAGCGGGCAG
CTCGGCACAT TAATGGTCAG AGCCGCAGAT AACCAGGAGA CACTCCAACA AAATCGGATC
GCCTTAACGC TCTCCATCTT CGAGCCAGCA CTCATTATTA CGATGGCACT GATCGTCCTG
TTTATTGTCG TGTCGGTACT CCAACCTCTT CTTCAACTTA ACTCAATGAT TAATTAA
 
Protein sequence
MNYRYRAMTQ DGQKLQGIID ANDERQARLR LREEGLFLLD IRPQKSSGVK TRRPRISHSE 
LTLFTRQLAT LSAAALPLEE SLAVIGQQSS NKRLGDVLNQ VRSAILEGHP LSDALQHFPT
LFDSLYRTLV KAGEKSGLLA PVLEKLADYN ENRQKIRSKL IQSLIYPCML TTVAIGVVII
LLTAVVPKIT EQFVHMKQQL PLSTRILLGL SDTLQRTGPT LLATVFIVAV GFWLWLKRGN
NRHRFHAMLL RVALIGPLIC AINSARYLRT LSILQSSGVP LLDGMNLSTE SLNNLEIRQR
LANAAENVRQ GNSIHLSLEQ TAIFPPMMLY MVASGEKSGQ LGTLMVRAAD NQETLQQNRI
ALTLSIFEPA LIITMALIVL FIVVSVLQPL LQLNSMIN