Gene EcHS_A1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1049 
Symbol 
ID5591789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1060831 
End bp1063431 
Gene Length2601 bp 
Protein Length866 aa 
Translation table11 
GC content48% 
IMG OID640920214 
Productouter membrane usher protein fimD-like protein 
Protein accessionYP_001457779 
Protein GI157160461 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.607471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAGAA CTCACCGACA ACACAGCCTG TTAAGCTCTG GTGGAGTGCC ATCGTTTATT 
GGTGGGCTGG TGGTGTTTGT GTCGGCAGCG TTCAATGCAC AAGCTGAAAC CTGGTTCGAA
CCTGCCTTTT TCAAAGATGA TCCCTCAATG GTGGCCGATT TGTCTCGTTT CGAAAAAGGA
CAAAAAATAA CGCCAGGGGT TTATCGTGTC GATATTGTTC TGAATCAGAC AATTGTAGAT
ACGCGCAACG TCAATTTTGT TGAGTTAACG CCAGAGAAGG GGATTGCCGC CTGTTTGACG
ACTGAAAGCC TGGATGCAAT GGGTGTGAAT ACTGATGCGT TTCCGGCTTT TAAACAACTG
GACAAACAAG CGTGTGCGCT ATTGGCGGAG ATTATTCCGG ATGCCAGCGT AACTTTTAAT
GTGAATAAAC TCCGTCTGGA AATTTCAGTA CCGCAAATTG CTATAAAAAG TAACGCTCGT
GGTTATGTCC CCCCTGAACG TTGGGATGAA GGGATCAACG CGCTATTACT GGGATATTCA
TTTAGCGGGG CTAACAGTAT TCATAGCAGC GCAGACAGTG ATTCTGGCGA CAGTTATTTT
CTGAATTTAA ACAGTGGCGT TAATTTAGGC CCATGGAGAT TGCGCAACAA TTCAACATGG
AGTCGCAGTA GTGGCCAAAC CGCAGAATGG AAGAATCTCA TCAGCTATTT GCAGCGGGCG
GTTATTCCAC TAAAAGGCGA ACTGACCGTA GGTGATGATT ATACTGCAGG CGATTTTTTC
GATAGTGTCA GCTTTCGTGG TGTGCAGCTG GCGTCAGATG ACAACATGCT GCCAGACAGC
CTGAAAGGGT TTGCGCCTGT GGTGCGTGGT ATCGCCAAAA GCAATGCCCA GATAACGATT
AAGCAAAATG GTTACACCAT TTACCAAACT TATGTATCGC CTGGTGCTTT TGAAATTAGT
GATCTCTATT CCACGTCGTC GAGCGGTGAT TTGTTAGTTG AAATCAAAGA AGCGGACGGC
AGCGTCAATA GCTACAGCGT ACCGTTTTCC AGCGTGCCAT TACTCCAGCG TCAGGGGCGA
ATCAAATACG CGGTGACACT GGCGAAATAC AGAACCAATA GTAATGAACA GCAGGAGAGC
AAATTTGCCC AGGCCACGTT GCAGTGGGGC GGACCGTGGG GAACGACATG GTATGGTGGT
GGACAATATG CTGAATATTA CCGTGCCGCC ATGTTTGGTC TGGGATTTAA CCTTGGCGAT
TTCGGAGCAA TTTCGTTCGA TGCGACCCAG GCGAAGAGTA CGCTGGCAGA CCAAAGCGAA
CATAAAGGTC AGTCATATCG TTTTCTGTAT GCCAAAACGC TCAACCAATT GGGCACTAAT
TTTCAATTGA TGGGCTATCG CTATTCGACG TCGGGTTTCT ACACCCTTTC CGACACCATG
TATAAACATA TGGATGGCTA CGAATTTAAT GACGGTGATG ATGAAGATAC GCCGATGTGG
TCGCGTTATT ACAATTTGTT TTACACCAAA CGTGGCAAAC TGCAGGTCAA TATCTCCCAG
CAATTAGGCG AGTACGGTTC GTTTTATTTA AGTGGTAGCC AGCAAACTTA CTGGCATACC
GATCAACAGG ATCGGCTATT ACAGTTTGGC TACAACACGC AAATTAAAGA TCTCTCGCTG
GGGGGTTCCT GGAACTACAG TAAGTCCCGT GGTCAACCTG ATGCTGATCA GGTGTTTGCA
CTAAATTTTT CCCTGCCGCT CAATCTGTTG CTCCCCAGAA GTAATGATAG CTATACCAGG
AAAAAAAATT ACGCCTGGAT GACCTCTAAC ACCAGTATCG ATAACGAAGG GCACATTACA
CAAAACCTGG GTTTAACGGA GACACTACTC GATGACGGTA ACCTGAGCTA CAGCGTGCAA
CAGGGATATA ACAGCGAGGG GAAAACGGCT AATGGTAGCG CCAGCATGGA CTACAAAGGG
GCGTTTGCAG ATGCCCGAGT GGGCTACAAC TACAGCGATA ACGGCAGTCA ACAACAACTG
AACTACGCTC TTTCAGGCAG TTTAGTTGCC CATTCACAGG GCATTACCCT GGGGCAATCG
CTGGGGGAAA CTAACGTTCT GATTGCAGCA CCAGGCGCAG AGAATACTCG TGTGGCGAAC
AGCACCGGGC TGAAAACTGA CTGGCGCGGA TATACCGTTG TTCCTTATGC CACTTCTTAT
CGGGAAAATC GAATCGCACT TGATGCGGCG TCGTTAAAAC GTAACGTGGA TCTTGAAAAT
GCAGTAGTCA ACGTGGTTCC CACCAAAGGG GCGTTGGTTC TGGCGGAGTT CAATGCCCAT
GCGGGTGCAA GGGTATTAAT GAAAACATCA AAGCAGGGTA TACCGCTGCG TTTTGGCGCG
ATAGCGACGC TGGACGGCGT ACAGGCTAAT AGCGGCATAA TTGATGATGA TGGCTCGCTC
TATATGGCGG GTTTACCGGC GAAGGGAACA ATAAGCGTGC GCTGGGGCGA AGCTCCCGAT
CAAATTTGTC ATATCAATTA CGAGCTTACC GAACAACAAA TTAACTCTGC GATTACGCGA
ATGGATGCCA TATGCAGATA A
 
Protein sequence
MYRTHRQHSL LSSGGVPSFI GGLVVFVSAA FNAQAETWFE PAFFKDDPSM VADLSRFEKG 
QKITPGVYRV DIVLNQTIVD TRNVNFVELT PEKGIAACLT TESLDAMGVN TDAFPAFKQL
DKQACALLAE IIPDASVTFN VNKLRLEISV PQIAIKSNAR GYVPPERWDE GINALLLGYS
FSGANSIHSS ADSDSGDSYF LNLNSGVNLG PWRLRNNSTW SRSSGQTAEW KNLISYLQRA
VIPLKGELTV GDDYTAGDFF DSVSFRGVQL ASDDNMLPDS LKGFAPVVRG IAKSNAQITI
KQNGYTIYQT YVSPGAFEIS DLYSTSSSGD LLVEIKEADG SVNSYSVPFS SVPLLQRQGR
IKYAVTLAKY RTNSNEQQES KFAQATLQWG GPWGTTWYGG GQYAEYYRAA MFGLGFNLGD
FGAISFDATQ AKSTLADQSE HKGQSYRFLY AKTLNQLGTN FQLMGYRYST SGFYTLSDTM
YKHMDGYEFN DGDDEDTPMW SRYYNLFYTK RGKLQVNISQ QLGEYGSFYL SGSQQTYWHT
DQQDRLLQFG YNTQIKDLSL GGSWNYSKSR GQPDADQVFA LNFSLPLNLL LPRSNDSYTR
KKNYAWMTSN TSIDNEGHIT QNLGLTETLL DDGNLSYSVQ QGYNSEGKTA NGSASMDYKG
AFADARVGYN YSDNGSQQQL NYALSGSLVA HSQGITLGQS LGETNVLIAA PGAENTRVAN
STGLKTDWRG YTVVPYATSY RENRIALDAA SLKRNVDLEN AVVNVVPTKG ALVLAEFNAH
AGARVLMKTS KQGIPLRFGA IATLDGVQAN SGIIDDDGSL YMAGLPAKGT ISVRWGEAPD
QICHINYELT EQQINSAITR MDAICR