Gene EcolC_3552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3552 
Symbol 
ID6067403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3882482 
End bp3883867 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID641602969 
Producthypothetical protein 
Protein accessionYP_001726493 
Protein GI170021539 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000180133 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATATTC CACAGCTCAC TGCCCTGTGT CTGCGTTATC ATGGAGTCTT GCTGGATGCC 
AGCGAAGAGG TGGTTCATGT TGCGGTAGTC GATGCACCTT CGCATGAGCT ACTGGACGCA
TTGCATTTCG CTACCACCAA ACGTATTGAG ATCACCTGCT GGACGCGCCA ACAAATGGAA
GGTCACGCCA GTCGCACACA ACAGACATTG CCCGTAGCTG TTCAGGAGAA GCATCAGCCC
AAAGCAGAGT TGCTAACTCG AACGTTACAA TCTGCGCTGG AACAACGCGC GTCTGATATT
CATATCGAAC CAGCGGACAA TGCCTACCGC ATCCGCTTGC GTATCGACGG CGTATTGCAT
CCTTTACCGG ATGTTTCACC GGATGCCGGA GTCGCATTAA CCGCCAGATT AAAAGTGCTG
GGAAACCTGG ATATTGCGGA ACATCGCCTG CCGCAGGACG GGCAATTCAC TGTCGAACTG
GCAGGAAACG CCGTCTCATT TCGTATTGCG ACCTTACCAT GTCGGGGTGG TGAAAAGGTG
GTATTAAGGT TGTTACAGCA GGTGGGTCAG GCACTGGATG TCAACACGCT TGGAATGCAG
CCGTTACAAC TGGCGGACTT TGCTCATGCC TTGCAACAAC CACAGGGACT GGTGCTGGTA
ACTGGCCCTA CCGGCAGCGG CAAAACGGTC ACGCTTTATA GTGCCCTGCA AAAGCTGAAT
ACCGCTGACA TTAATATTTG TAGCGTCGAA GATCCAGTTG AGATCCCCAT AGCCGGACTA
AACCAGACGC AAATCCATCC GCGTGCCGGA CTCACCTTTC AGGGCGTTTT GCGTGCGTTA
TTGCGCCAGG ATCCTGACGT CATCATGATC GGAGAGATCC GCGATGGCGA AACAGCAGAG
ATCGCTATTA AAGCGGCGCA AACTGGTCAC CTGGTGTTGT CTACCCTACA CACTAATTCC
ACCTGCGAAA CGCTGGTACG TTTACAGCAA ATGGGGGTCG CCCGCTGGAT GCTATCATCG
GCGCTTACGC TGGTAATAGC CCAGCGTCTG GTACGCAAAC TTTGCCCACA TTGTCGCCGG
CAGCAAGGGG AGCCCATCCA CATTCCAGAC AATGTATGGC CATCGCCGCT GCCCCACTGG
CAGGCACCCG GTTGTGTACA TTGCTACCAC GGTTTTTATG GTCGTACGGC CTTATTTGAA
GTTCTGCCCA TAACGCCGGT CATTCGTCAG CTTATTTCCG CTAATACCGA CGTTGAATCG
CTGGAAACGC ACGCACGACA GGCGGGTATG CGTACGCTTT TTGAAAACGG CTGCCTGGCC
GTAGAGCAAG GCTTAACCAC CTTTGAAGAG TTAATCCGCG TACTGGGGAT GCCGCATGGC
GAGTAA
 
Protein sequence
MNIPQLTALC LRYHGVLLDA SEEVVHVAVV DAPSHELLDA LHFATTKRIE ITCWTRQQME 
GHASRTQQTL PVAVQEKHQP KAELLTRTLQ SALEQRASDI HIEPADNAYR IRLRIDGVLH
PLPDVSPDAG VALTARLKVL GNLDIAEHRL PQDGQFTVEL AGNAVSFRIA TLPCRGGEKV
VLRLLQQVGQ ALDVNTLGMQ PLQLADFAHA LQQPQGLVLV TGPTGSGKTV TLYSALQKLN
TADINICSVE DPVEIPIAGL NQTQIHPRAG LTFQGVLRAL LRQDPDVIMI GEIRDGETAE
IAIKAAQTGH LVLSTLHTNS TCETLVRLQQ MGVARWMLSS ALTLVIAQRL VRKLCPHCRR
QQGEPIHIPD NVWPSPLPHW QAPGCVHCYH GFYGRTALFE VLPITPVIRQ LISANTDVES
LETHARQAGM RTLFENGCLA VEQGLTTFEE LIRVLGMPHG E