Gene EcolC_0322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0322 
SymbolhofQ 
ID6065266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp374304 
End bp375464 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID641599721 
Productouter membrane porin HofQ 
Protein accessionYP_001723327 
Protein GI170018373 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4796] Type II secretory pathway, component HofQ 
TIGRFAM ID[TIGR02515] type IV pilus secretin (or competence protein) PilQ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000700239 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.175294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGATG ACGTTCCGGT AGCTCAGGTG TTGCAGGCGC TGGCTGAACA GGAGAAGTTG 
AACCTGGTCG TGTCGCCAGA CGTCAGCGGT ACGGTGTCGT TACATCTAAC AGATGTTCCC
TGGAAGCAGG CACTACAAAC TGTAGTGAAA AGCGCCGGAC TGATAACGCG CCAGGAGGGC
AACATTCTCT CAGTGCATTC CATTGCCTGG CAGAATAACA ATATCGCCCG CCAGGAGGCG
GAGCAGGCGC GGGCGCAGGC AAATCTGCCG CTGGAAAATC GCAGTATAAC CCTGCAATAC
GCCGACGCGG GAGAACTGGC GAAAGCGGGG GAGAAGCTAC TGAGTGCCAA AGGGAGTATG
ACCGTCGATA AACGCACCAA TCGCCTTTTG CTACGAGATA ACAAAACGGC GTTAAGCGCA
CTTGAACAGT GGGTAGCGCA AATGGATCTG CCGGTCGGGC AGGTTGAGCT GTCGGCGCAT
ATTGTCACCA TTAATGAAAA AAGTTTGCGT GAGTTAGGCG TGAAATGGAC GCTGGCCGAT
GCGCAACACG CTGGTGGCGT TGGGCAAGTC ACCACGCTTG GTAGCGACCT CTCCGTAGCG
ACGGCGACAA CGCATGTCGG TTTTAACATT GGGCGCATCA ACGGACGTTT GCTGGATCTT
GAGCTTTCTG CGCTCGAACA AAAACAGCAG CTGGATATTA TCGCTAGTCC GCGTCTGCTG
GCCTCACATC TTCAGCCTGC CAGCATTAAA CAGGGGAGCG AAATTCCATA TCAGGTTTCC
AGCGGGGAAA GTGGCGCGAC GTCGGTGGAA TTTAAAGAGG CCGTCCTGGG GATGGAGGTC
ACGCCCACGG TGTTACAAAA AGGTCGCATC CGGCTGAAAT TACACATCAG CCAGAACGTT
CCGGGGCAGG TGCTACAGCA GGCTGATGGC GAAGTGCTGG CGATTGATAA GCAGGAGATC
GAAACGCAGG TCGAGGTCAA AAGCGGAGAA ACGTTGGCGC TGGGCGGCAT TTTTACCCGT
AAAAATAAAT CGGGTCAGGA TAGCGTACCG TTGCTTGGCG ACATTCCCTT GTTCGGGCAA
TTATTTCGTC ATGACGGAAA AGAAGATGAA CGACGCGAGT TAGTGGTGTT TATCACGCCA
CGACTGGTTT CCAGTGAGTA A
 
Protein sequence
MVDDVPVAQV LQALAEQEKL NLVVSPDVSG TVSLHLTDVP WKQALQTVVK SAGLITRQEG 
NILSVHSIAW QNNNIARQEA EQARAQANLP LENRSITLQY ADAGELAKAG EKLLSAKGSM
TVDKRTNRLL LRDNKTALSA LEQWVAQMDL PVGQVELSAH IVTINEKSLR ELGVKWTLAD
AQHAGGVGQV TTLGSDLSVA TATTHVGFNI GRINGRLLDL ELSALEQKQQ LDIIASPRLL
ASHLQPASIK QGSEIPYQVS SGESGATSVE FKEAVLGMEV TPTVLQKGRI RLKLHISQNV
PGQVLQQADG EVLAIDKQEI ETQVEVKSGE TLALGGIFTR KNKSGQDSVP LLGDIPLFGQ
LFRHDGKEDE RRELVVFITP RLVSSE