Gene EcHS_A3278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3278 
Symbol 
ID5592096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3291248 
End bp3293950 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content46% 
IMG OID640922396 
Producthypothetical protein 
Protein accessionYP_001459890 
Protein GI157162572 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.887347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTCA GGATTGCCAT GGATAAAAAA TTACTGGCTC TTTTGATCCT GGCGAGTCTC 
AGCCCGGCAG AGGCGGCATT AACCAAAATC CCCGCAGGGT TTGAGGTTAT TGCTCAGGGA
CAGCAGGAGT ATATCGAGGT TTATTTTTCA GGGAAAAATC TCGGTAAATA TTATGCAATG
GTTAATCTTG ATACCGTAAC ATTTCTTGAT CCAGCAAGTT TATATAACAA GCTGGAACTG
GATGTCGACG ATCAGAAAAT CGCGCATATA GTGAAAGAAA AATTATCGCA GCCGCTAGCT
CGCCACGGTG AATTGGCTTG CGGTTATGTA CGTACTGACT CAGGGTGTGG TTTTCTGAAT
ACCGATACGC TGGAAATAAT CTATAATGAT GAAGAAAGTT CGGCAACGTT GTTTATTAAT
CCGCAATGGA ATTCAGCTTT CGATGCGAAG TCATTATATT TAAATCCAGA CAAAAATACG
GTTAACGCTT TTATACATCA GCAAGACATC AATGTTCTGG CACAGGATGA TTACCAATCG
TTGTCTATTC AGGGAAACGG TGCGCTGGGA ATAACAGAAA ATAGCTATAT TGGTGCACAC
TGGAATTTCA ACGGTTATGA TGCAGATGAT GTCAGTGACA GTAATGCTGA TGTCAGCGAT
CTCTATTATC GTTATGATTT TTTACGTCGT TATTATGTGC AGGCGGGGCG CATGGACAAC
CGCACGCTAT TTAATGCACA AGGCGGGAAC TTTACCTTTA ACTTTTTGCC ACTCGGTGCA
ATCGACGGGA TGCGTATCGG GTCGACTCTT AGCTATTTAA ACCAGGCGCA AAGCCAGCAG
GGAACCCCGG TAATGGTCCT GCTTTCGCGC AATTCTCGTG TTGACGCTTA TCGTAATGAG
CAACTTCTGG GATCGTTTTA TCTCAATAGT GGTTCGCAAT TTATTGATAC CAGTTCCTTT
CCCCCGGGTA GCTATAGCGT AGCGTTAAAA GTCTATGAAA ATAACCAACT CACCCGCACC
GAGCTAGTAC CGTTTACCAA AACAGGCGGT CTGACGGACG GAAATGCGCA ATGGTTCTTA
CAGGCAGGTA AAACTACATC ACAGGCTTCT GATGATGAAA GTTCAGCTTA TCAACTGGGG
GTACGCCTGC CATTACATCC GCAATATGAG CTCTACGCAG GGCTGGCGAA TGCCGATAAT
GTGAGTGCTT TCGAGTTAGG TAATGACTGG ACGGCAAATT TAGGCGGGGC AGGGAATCTT
GCAATCAGCG CCAGCGTGTT CCGTAACGAT GACGGCGGCA AAGGTGATAT GCAACAGGCC
AACTGGAGTA ATTCGGGATG GCCGACGTTG GGCTTTTATC GGACCAACTC TGACGGCGAT
GCTTGTGCAA CCGACAGCAG AGAGAGCTAT AACGCCTTAA GCTGTTATGA AAGTATTTCC
GCGACGGTTT CACAGAATTT TGTCGGCTGG AATATGATGC TGGGTTATTC CCGCACACAA
AATAACACTG ATGATAGTTT GCGTTGGGAT AAACAGCAGA GCTTTGAAAA TAACTATCTT
CGCCAGACAA CTGCGCAAAG TATCTCCGAA ACTGTACAAC TTAGCGCTTC CCGCGCTTTT
GTGATGCGTG ACTGGATCTT GAGTACTTCG GTTGGTGTTT TCCATCGTAA TGACAACGGT
GGCGATAACG ACGACAACGG CTTGTACTTA TCGTTTTCGT TATCTGACAC GCCAACGATG
GACAGCAATA ACAACAGCCA TTCAACCAAT GTTTCTACGG ATTATCGTTA TAGCGATCAG
GATGGCGATC AAACGTCATG GCAGTTATCC CATACTTTTT ATAACGATTC ATTCAGCCAT
AAAGAACTTG GCGTAACCGT TGGGGGCCTG AACACCGATA CCATAAACAG CGCGGTTAAC
GGGCGTTGGG ATGGTCAATA CGGAAATGTC TACGCTACCG TATCTGACAG TTATGACCGT
AAGAATCATG ATCATCTCTC GGCCTTTACG GGGACTTACA GCTCTACACT GGCTGTCAGT
CGCTATGGCG TTAATTTGGG TGCCAGTGGT ACAGACGATT TGCTGGGTGC GGTATTGGTG
GATGTGAAAG GCTTCTCTGA ACAGGATGAA GAGAGTCAGG ATCTGCAACT CGAAGCGCGG
GTGGCAGGCA GCCGAACGTT GCAGCTTGGT CAAAGCGACA GTGTGTTGTT CCCTTATCCT
GGATTTCAGT CTGGTTTTGT TGAGGTTAAC GACAGTAGCC AGGGCAATCA ACAAGGGACA
ACAAACATCA TTAACGGTGC GGGGAATCGT GAATTAATGT TGTTGCCTGG CAAACTGCGC
TATCGCGAAG TGTCTGCCAG CTTTAATTAC AACTATATCG GTCGCTTGTT ATTACCAGCA
TCGGTAGAGA AATTCCCGCT GGTTGGTCTG AATAGCGCCA TGTTACTGGT AGCTGAAGAT
GGCGGATTTA CACTTGAAAT TAACGGTAGC GAAAAAGAGC TATATCTGCT TTCCGGGCAG
CAATTCCTTA AGTGTCCGCT GAGTGTTGTA AAGAAACGCG CCAGCATTCG TTACAGCGGA
GATGTCACAT GTAGTGTGGT GACTTATTCA CAATTACCGG AGTCCATTCA GGTTCAGGCA
CAGTTGAAAC AGCCTAAATT ACGTGGAAAC GTTCAGACGG CGCAAAGGGA GGTTGCACCA
TGA
 
Protein sequence
MDFRIAMDKK LLALLILASL SPAEAALTKI PAGFEVIAQG QQEYIEVYFS GKNLGKYYAM 
VNLDTVTFLD PASLYNKLEL DVDDQKIAHI VKEKLSQPLA RHGELACGYV RTDSGCGFLN
TDTLEIIYND EESSATLFIN PQWNSAFDAK SLYLNPDKNT VNAFIHQQDI NVLAQDDYQS
LSIQGNGALG ITENSYIGAH WNFNGYDADD VSDSNADVSD LYYRYDFLRR YYVQAGRMDN
RTLFNAQGGN FTFNFLPLGA IDGMRIGSTL SYLNQAQSQQ GTPVMVLLSR NSRVDAYRNE
QLLGSFYLNS GSQFIDTSSF PPGSYSVALK VYENNQLTRT ELVPFTKTGG LTDGNAQWFL
QAGKTTSQAS DDESSAYQLG VRLPLHPQYE LYAGLANADN VSAFELGNDW TANLGGAGNL
AISASVFRND DGGKGDMQQA NWSNSGWPTL GFYRTNSDGD ACATDSRESY NALSCYESIS
ATVSQNFVGW NMMLGYSRTQ NNTDDSLRWD KQQSFENNYL RQTTAQSISE TVQLSASRAF
VMRDWILSTS VGVFHRNDNG GDNDDNGLYL SFSLSDTPTM DSNNNSHSTN VSTDYRYSDQ
DGDQTSWQLS HTFYNDSFSH KELGVTVGGL NTDTINSAVN GRWDGQYGNV YATVSDSYDR
KNHDHLSAFT GTYSSTLAVS RYGVNLGASG TDDLLGAVLV DVKGFSEQDE ESQDLQLEAR
VAGSRTLQLG QSDSVLFPYP GFQSGFVEVN DSSQGNQQGT TNIINGAGNR ELMLLPGKLR
YREVSASFNY NYIGRLLLPA SVEKFPLVGL NSAMLLVAED GGFTLEINGS EKELYLLSGQ
QFLKCPLSVV KKRASIRYSG DVTCSVVTYS QLPESIQVQA QLKQPKLRGN VQTAQREVAP