Gene EcolC_3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3090 
Symbol 
ID6066236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3384094 
End bp3386703 
Gene Length2610 bp 
Protein Length869 aa 
Translation table11 
GC content48% 
IMG OID641602507 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionYP_001726041 
Protein GI170021087 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.832581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC CCACTACTAC GGATATTCCG CAGAGGTATA CCTGGTGTCT GGCCGGAATT 
TGTTATTCAT CTCTTGCCAT TTTACCCTCC TTTTTAAGCT ATGCGGAAAG TTATTTCAAC
CCGGCATTTT TATTAGAGAA TGGCACATCC GTTGCTGATT TATCGCGCTT TGAGAGAGGT
AATCATCAAC CTGCGGGCGT GTATCGGGTG GATCTCTGGC GTAATGATGA GTTCATTGGT
TCGCAGGATA TCGTATTTGA ATCGACAACA GAAAATACAG GTGATAAATC AGGTGGGTTA
ATGCCCTGTT TTAACCAGGT ACTTCTTGAA CGAATTGGCC TTAATAGCAG TGCATTTCCC
GAGTTAGCCC AGCAGCAAAA CAATAAATGC ATCAATTTAC TGAAAGCTGT ACCTGATGCC
ACAATTAACT TTGATTTTGC AGCGATGCGC CTGAACATCA CTATTCCTCA GATAGCGTTG
TTGAGTAGCG CTCACGGTTA CATTCCGCCT GAAGAGTGGG ATGAAGGTAT TCCTGCTTTA
CTCCTGAATT ATAATTTCAC CGGTAACAGA GGTAATGGTA ACGATAGCTA TTTTTTTAGT
GAGCTCAGCG GGATTAATAT TGGCCCGTGG CGTTTACGCA ACAATGGTTC CTGGAACTAT
TTTCGCGGAA ATGGATATCA TTCAGAACAG TGGAATAATA TTGGCACCTG GGTACAGCGC
GCCATTATTC CGCTGAAAAG TGAACTGGTA ATGGGAGACG GCAATACAGG AAGTGATATT
TTCGATGGCG TTGGATTTCG TGGTGTACGG CTTTATTCTT CTGATAATAT GTATCCTGAT
AGCCAGCAAG GGTTTGCCCC AACGGTACGT GGGATTGCCC GTACGGCGGC CCAGCTAACG
ATTCGGCAAA ATGGTTTTAT TATCTATCAA AGCTATGTTT CCCCCGGCGC TTTTGAAATT
ACAGATTTGC ACCCGACATC TTCAAATGGC GATCTGGACG TCACCATCGA CGAGCGCGAT
GGCAATCAGC AGAATTACAC AATTCCGTAT TCAACAGTGC CGATTTTACA ACGCGAAGGG
CGTTTCAAAT TTGACCTGAC GGCGGGCGAT TTTCGTAGCG GTAATAGTCA GCAATCATCA
CCTTTCTTTT TTCAGGGCAC GGCACTCGGT GGTTTACCAC AGGAATTTAC AGCCTACGGC
GGGACGCAAT TATCTGCCAA TTACACCGCC TTTTTATTAG GACTGGGGCG CAACCTCGGG
AACTGGGGCG CAGTGTCGCT GGATGTGACC CATGCGCGCA GCCAGTTAGC CGACGACAGT
CGTCATGAGG GGGATTCCAT TCGCTTCCTC TATGCGAAAT CGATGAACAC CTTCGGTACC
AATTTTCAGT TAATGGGTTA CCGCTATTCG ACACAAGGTT TTTATACCCT TGATGATGTT
GCGTATCGTC GAATGGAGGG GTACGAATAT GATTACGATT ATGACGGTGA GCATCGGGAT
GAACCGATAA TCGTGAATTA CCACAATTTA CGCTTTAGCC GTAAAGACCG TTTGCAGTTA
AATATTTCAC AATCACTTAA TGACTTTGGC TCGCTTTATA TTTCTGGTAC CCATCAAAAA
TACTGGAATA CTTCGGATTC AGATACGTGG TATCAGGTGG GGTATACCAG CAGCTGGGTT
GGCATCAGTT ATTCGCTCTC ATTTTCGTGG AATGAATCTG TAGGGATCCC CGATAACGAA
CGTATTGTCG GACTTAATGT TTCAGTGCCT TTCAATGTTT TGACCAAACG TCGCTACACC
CGGGAAAATG CGCTCGACCG CGCTTATGCC TCCTTTAACG CCAACCGTAA CAGCAACGGG
CAAAATAGCT GGCTGGCAGG TGTAGGTGGG ACCTTACTGG AAGGCCACAA CCTGAGTTAT
CACGTAAGCC AGGGTGATAC CTCGAATAAT GGGTATACGG GCAGCGCCAC GGCAAACTGG
CAGGCCGCTT ACGGTACGCT GGGAGTCGGG TATAACTACG ACCGCGATCA ACATGACGTT
AACTGGCAGC TGTCTGGCGG TGTGGTCGGA CATGAAAATG GCATAACGCT GAGCCAGCCT
TTAGGGGATA CCAATGTTTT GATTAAAGCG CCTGGCGCAG GCGGTGTACG CATTGAAAAT
CAAACTGGCA TTTTAACCGA CTGGCGCGGC TATGCGGTGA TGCCGTATGC CACGGTTTAT
CGGTATAACC GTATCGCGCT TGATACCAAT ACGATGGGGA ATTCCATCGA TGTTGAAAAA
AATATTAGCA GCGTTGTGCC GACGCAAGGC GCGTTGGTTC GTGCCAATTT TGATACCCGC
ATAGGCGTGC GGGCGCTCAT TACCGTTACC CAGGGCAGAA AACCGGTGCC GTTTGGATCA
CTGGTACGGG AAAACAGTAC CGGAATAACC AGTATGGTGG GTGATGACGG GCAAGTTTAT
TTAAGTGGTG CGCCATTGTC TGGTGAATTA CTGGTTCAGT GGGGAGACGG CGCGAACTCA
CGCTGCATTG CGCACTATGT ATTGCCGAAG CAAAGCTTAC AGCAAGCCGT CACAGTTATT
TCGGCAGTTT GCACACATCC TGGCTCATAA
 
Protein sequence
MKIPTTTDIP QRYTWCLAGI CYSSLAILPS FLSYAESYFN PAFLLENGTS VADLSRFERG 
NHQPAGVYRV DLWRNDEFIG SQDIVFESTT ENTGDKSGGL MPCFNQVLLE RIGLNSSAFP
ELAQQQNNKC INLLKAVPDA TINFDFAAMR LNITIPQIAL LSSAHGYIPP EEWDEGIPAL
LLNYNFTGNR GNGNDSYFFS ELSGINIGPW RLRNNGSWNY FRGNGYHSEQ WNNIGTWVQR
AIIPLKSELV MGDGNTGSDI FDGVGFRGVR LYSSDNMYPD SQQGFAPTVR GIARTAAQLT
IRQNGFIIYQ SYVSPGAFEI TDLHPTSSNG DLDVTIDERD GNQQNYTIPY STVPILQREG
RFKFDLTAGD FRSGNSQQSS PFFFQGTALG GLPQEFTAYG GTQLSANYTA FLLGLGRNLG
NWGAVSLDVT HARSQLADDS RHEGDSIRFL YAKSMNTFGT NFQLMGYRYS TQGFYTLDDV
AYRRMEGYEY DYDYDGEHRD EPIIVNYHNL RFSRKDRLQL NISQSLNDFG SLYISGTHQK
YWNTSDSDTW YQVGYTSSWV GISYSLSFSW NESVGIPDNE RIVGLNVSVP FNVLTKRRYT
RENALDRAYA SFNANRNSNG QNSWLAGVGG TLLEGHNLSY HVSQGDTSNN GYTGSATANW
QAAYGTLGVG YNYDRDQHDV NWQLSGGVVG HENGITLSQP LGDTNVLIKA PGAGGVRIEN
QTGILTDWRG YAVMPYATVY RYNRIALDTN TMGNSIDVEK NISSVVPTQG ALVRANFDTR
IGVRALITVT QGRKPVPFGS LVRENSTGIT SMVGDDGQVY LSGAPLSGEL LVQWGDGANS
RCIAHYVLPK QSLQQAVTVI SAVCTHPGS