Gene EcSMS35_3667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3667 
SymbolhofQ 
ID6144186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3726254 
End bp3727492 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content54% 
IMG OID641618494 
Productouter membrane porin HofQ 
Protein accessionYP_001745634 
Protein GI170681657 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4796] Type II secretory pathway, component HofQ 
TIGRFAM ID[TIGR02515] type IV pilus secretin (or competence protein) PilQ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000045876 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0733814 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAT GGATAGCCGC ACTACTGTTG ATGCTGATAC CCGGCGTACA GGCGGCAAAG 
CCGCAAAAAG TGACACTGAT GGTGGATGAC GTTCCGGTAG CTCAGGTGTT GCAGGCGCTG
GCTGAACAGG AGAAGTTGAA CCTGGTGGTG TCGCCAGACG TCAGCGGTAC GGTGTCGTTA
CATCTAACAG ATGTTCCCTG GAAGCAGGCA CTACAAACTG TAGTGAAAAG CGCCGGACTG
ATAACGCGCC AGGAGGGCAA CATTCTCTCA GTGCATTCCA TTGCCTGGCA GAATGACAAT
ATCGCCCGTC AGGAAGCGGA GCAGGCGCGG GCGCAGGCAA ATCTGCCGCT GGAAAATCGC
AATATTACTC TGCAATACGC CGACGCCGGA GAGCTGGCGA AAGCTGGGGA GAAGCTACTG
AGTGCCAAAG GGAGTATGAC CGTCGATAAA CGCACCAATC GCCTTTTGCT GCGAGATAAC
AAAACGGCGT TAAGCGCCCT TGAACAGTGG GTAGCGCAAA TGGATCTGCC GGTCGGGCAA
GTTGAGCTGT CGGCGCATAT TGTCACCATT AATGAAAAAA GTTTGCGTGA GTTAGGCGTG
AAATGGACGC TGGCCGATGC GCAACAAGCT GGTGGCGTTG GGCAAGTCAC CACGCTTGGC
AGCGACCTCT CCGTAGCGAC GGCGACAACG CATGTCGGTT TTAACATTGG GCGTATCAAC
GGACGTTTGC TGGATCTTGA GCTTTCTGCG CTCGAACAAA AACAGCAGCT GGATATTATC
GCCAGTCCGC GTCTGCTGGC CTCACATCTT CAGCCTGCCA GCATTAAACA GGGGAGCGAA
ATTCCATATC AGGTTTCCAG CGGGGAAAGT GGCGCGACGT CGGTGGAATT TAAAGAGGCC
GTCCTGGGGA TGGAGGTCAC GCCCACGGTG TTACAAAAAG GTCGCATCCG GCTGAAATTA
CACATCAGCC AGAACGTTCC GGGGCAGGTG CTACAGCAGG CCGATGGCGA AGTGCTGGCG
ATTGATAAGC AGGAGATCGA AACGCAGGTC GAGGTCAAAA GCGGAGAAAC GTTGGCGCTG
GGCGGCATTT TTACCCGTAA AAATAAATCG GGTCAGGATA GCGTACCGTT GCTTGGCGAC
ATTCCCTGGT TCGGGCAATT ATTTCGTCAT GACGGAAAAG AAGATGAACG ACGCGAGTTA
GTGGTGTTTA TCACGCCACG ACTGGTTTCC AGTGAGTAA
 
Protein sequence
MKQWIAALLL MLIPGVQAAK PQKVTLMVDD VPVAQVLQAL AEQEKLNLVV SPDVSGTVSL 
HLTDVPWKQA LQTVVKSAGL ITRQEGNILS VHSIAWQNDN IARQEAEQAR AQANLPLENR
NITLQYADAG ELAKAGEKLL SAKGSMTVDK RTNRLLLRDN KTALSALEQW VAQMDLPVGQ
VELSAHIVTI NEKSLRELGV KWTLADAQQA GGVGQVTTLG SDLSVATATT HVGFNIGRIN
GRLLDLELSA LEQKQQLDII ASPRLLASHL QPASIKQGSE IPYQVSSGES GATSVEFKEA
VLGMEVTPTV LQKGRIRLKL HISQNVPGQV LQQADGEVLA IDKQEIETQV EVKSGETLAL
GGIFTRKNKS GQDSVPLLGD IPWFGQLFRH DGKEDERREL VVFITPRLVS SE