Gene EcolC_2937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2937 
Symbol 
ID6065552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3203041 
End bp3205500 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content50% 
IMG OID641602349 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionYP_001725891 
Protein GI170020937 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCG TGAATATTTA TCGACTCTCT TTTGTATCCT GCCTGGTCGT GGCGATGCCT 
TGCGCATTGG CGGTCGAATT CAACCTTAAT GTTCTCGATA AATCGATGCG CGACCGCATT
GATATTTCAT TATTAAAGGA AAAAGGAGTC ATTGCTCCCG GTGAGTATTT TGTTAGCGTT
GCGGTAAATA ATAACCAAAT CAGTAACGGG CAAAAGATTA ACTGGCACAA AAATGACGAT
AAAACCATTC CGTGCATCAA TGATTTACTG GTCGATAAAT TTGGCTTAAA ACCTGAAGTC
CGTCAGTCGT TACCATTGAT AAATCAGTGC GTCGATTTTA GCTCCCGACC TGAAATGCTC
TTCAATTTCG ATCAAGCCAA TCAGCAACTA AATATCACCA TTCCGCAAGC CTGGCTGGCG
TGGCACTCAG AAAACTGGAC CCCACCCTCC ACATGGAAAG AAGGTGTCGC CGGTATCCTG
ATGGATTACA ACTTGTTTGC CAGCAGCTAC CGCCCACAGG ACGGCAGCAG CAGCACTAAC
CTGAACGCCT ACGGTACCAC CGGAATTAAC GCCGGGGCAT GGCGCTTACG TAGTGATTAT
CAGTTGAATC AGACTGATAG CGATGATAAC CATGAACAGT CAGGCGGAAT ATCGCGCACC
TATCTTTTTC GTCCATTACC GCAATTAGGC TCTAAATTAA CCCTCGGCGA AACGGATTTT
AGTTCCAATA TTTTCGACGG TTTTTCTTAT ACCGGCGCGG CACTGGCAAG TGACGAGCGA
ATGTTGCCAT GGGAACTACG CGGCTACGCC CCACAAATTA GCGGTATTGC ACAGACCAAT
GCCACGGTGA CGATCAGTCA ATCAGGCCGC GTCATTTACC AGAAAAAAGT CCCACCAGGC
CCATTTATCA TTGACGACCT TAATCAGTCT GTTCAGGGCA CACTGGATGT CAAAGTGACG
GAAGAAGATG GTCGGGTGAA CAATTTCCAG GTTTCGGCAG CATCGACGCC CTTCCTGACT
CGTCAGGGAC AGGTTCGCTA TAAACTGGCC GCGGGTCAGC CACGGCCCTC CATGTCACAT
CAAACTGAAA ATGAAACCTT TTTTAGCAAT GAAGTTTCCT GGGGGATGCT GTCAAACACC
TCGCTGTACG GCGGCCTGCT GCTTTCTGGT GATGACTACC ATTCTGCCGC AATGGGTATT
GGGCAAAATA TGCTGTGGCT TGGTGCGCTG TCGTTTGATG TCACGTGGGC CAGTAGCCAT
TTTGATACTC AGCAGGACGA GCGGGGCTTA AGCTACCGTT TTAATTACAG CAAACAAGTG
GATGCCACTA ACAGCACGAT TTCCCTCGCC GCTTATCGTT TCTCCGATCG TCATTTTCAC
AGCTACGCCA ACTATCTGGA TCACAAATAC AACGACAGCG ATGCGCAGGA CGAAAAACAG
ACGATCAGCT TATCTGTGGG CCAACCGATT ACCCCACTAA ACCTCAATCT TTACGCCAAC
CTGCTACATC AAACCTGGTG GAATGCAGAC GCCTCCACGA CCGCCAACAT CACAGCCGGT
TTTAATGTTG ATATTGGTGA CTGGAGAGAT ATCTCTATTT CGACGTCATT CAATACAACC
CATTACGAAG ATAAAGATCG CGACAACCAG ATTTACCTGT CGATTTCGCT CCCCTTCGGT
AATGGTGGTC GGGTTGGTTA TGACATGCAA AACAGTAGCC ACAGCACCAC ACACCGCATG
TCGTGGAACG ATACGCTGGA TGAACGTAAT AGCTGGGGCA TGTCTGCCGG ACTGCAATCC
GACCGTCCTG ACAATGGAGC CCAGGTGAGC GGTAACTATC AGCACCTGAG TTCAGCGGGT
GAGTGGGATA TTTCTGGTAC CTATGCCGCC AATGATTACA GTTCCGTCAG CAGCAGCTGG
AGCGGTTCTT TCACCGCAAC CCAATATGGT GCAGCGTTTC ATCGCCGCAG CTCCACCAAT
GAACCTCGCC TGATGGTCAG CACCGATGGC GTGGCAGATA TTCCGGTTCA GGGCAATCTC
GACTACACCA ACCATTTTGG CATTGCGGTG GTGCCGTTGA TTTCCAGTTA TCAGCCTTCC
ACCGTGGCGG TGAACATGAA TGACTTACCC GACGGCGTAA CAGTTACAGA AAACGTTATC
AAAGAAACGT GGATTGAAGG CGCGATAGGT TACAAATCAC TGGCTTCCCG TTCCGGTAAA
GACGTTAACG TCATCATTCG CAACGCCAGC GGTCAGTTTC CTCCCCTCGG AGCGGATATC
CGCCAGGATG ACAGCGGCAT TAGCGTGGGG ATGGTTGGCG AGGAAGGACA TGCCTGGTTA
AGCGGAGTCG CTGAAAATCA AAAGTTTACC GTGGTCTGGG GTGATAGCCA GCATTGCTCG
CTCCATCTTC CTGAACATAT GGAAGACACC GCAAATCGCC TGATTTTACC TTGTCATTAA
 
Protein sequence
MDTVNIYRLS FVSCLVVAMP CALAVEFNLN VLDKSMRDRI DISLLKEKGV IAPGEYFVSV 
AVNNNQISNG QKINWHKNDD KTIPCINDLL VDKFGLKPEV RQSLPLINQC VDFSSRPEML
FNFDQANQQL NITIPQAWLA WHSENWTPPS TWKEGVAGIL MDYNLFASSY RPQDGSSSTN
LNAYGTTGIN AGAWRLRSDY QLNQTDSDDN HEQSGGISRT YLFRPLPQLG SKLTLGETDF
SSNIFDGFSY TGAALASDER MLPWELRGYA PQISGIAQTN ATVTISQSGR VIYQKKVPPG
PFIIDDLNQS VQGTLDVKVT EEDGRVNNFQ VSAASTPFLT RQGQVRYKLA AGQPRPSMSH
QTENETFFSN EVSWGMLSNT SLYGGLLLSG DDYHSAAMGI GQNMLWLGAL SFDVTWASSH
FDTQQDERGL SYRFNYSKQV DATNSTISLA AYRFSDRHFH SYANYLDHKY NDSDAQDEKQ
TISLSVGQPI TPLNLNLYAN LLHQTWWNAD ASTTANITAG FNVDIGDWRD ISISTSFNTT
HYEDKDRDNQ IYLSISLPFG NGGRVGYDMQ NSSHSTTHRM SWNDTLDERN SWGMSAGLQS
DRPDNGAQVS GNYQHLSSAG EWDISGTYAA NDYSSVSSSW SGSFTATQYG AAFHRRSSTN
EPRLMVSTDG VADIPVQGNL DYTNHFGIAV VPLISSYQPS TVAVNMNDLP DGVTVTENVI
KETWIEGAIG YKSLASRSGK DVNVIIRNAS GQFPPLGADI RQDDSGISVG MVGEEGHAWL
SGVAENQKFT VVWGDSQHCS LHLPEHMEDT ANRLILPCH