Gene EcolC_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3520 
Symbol 
ID6068747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3844331 
End bp3846931 
Gene Length2601 bp 
Protein Length866 aa 
Translation table11 
GC content45% 
IMG OID641602937 
Productputative outer membrane usher protein 
Protein accessionYP_001726461 
Protein GI170021507 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.315338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTATACGT TAACTCATCA GAAAAGCCGT CTACCAAAAA CAACACTTCT GGCTGCATGT 
TGTGCTTTTT TTTATAGCAG TAACGGCGCG GCAACAGAAA GCGTGGAATA TGACAGTTCC
TTTTTGATGG GTACTGGGGC ATCGAGCATC GATGTCAAAC GTTATTCACA GGGTAACCCA
ACGCCTCCTG GCGTTTATAA CGTCCGTGTG TTTATCAACC AACAATCCGT TGCCAGTTTA
GAACTTCCTT TTGTTGATAT AGGGGAAAAC AGCGCCGCGG CCTGTATAAC ACTTAAGAAT
CTGGCACAAC TGCATATTAA GCAGCCAGAA TACCCAATTA CACTGCTTGC CAGAGAAGGC
GAAGAAGGCG ATTGCCTGGA CATCAAAAAA TCGATCGAGC AAGCGGAAGT CAGCTATGAC
GGCAGTGAGC AGCATTTGGA AATCTCGGTT CCGCAGGCTT ACGTCTATAA AACATATGGC
GGTTATGTTG ACCCCTCACT TTGGGAGTCG GGTATTAATG CCGCCATGCT TTCTTACTCA
TTAAACGCCT ACCACTCCGA CTCTAAAAAT GGCAACCGGG ACAGTATATA CGGAGCCTTC
AATACCGGTC TGAATTTAGG AGCCTGGCAC CTGCGTGCGC GCGGCAACTA TAACTGGTCA
CAAGATAACG GTAACAGTTT TGACTTTCAG GATCGTTACT TACAGCGTGA CATCCCGGCA
ATACGCTCAC AAGTGATCGT GGGTGATGCC TATACCACCG GTGAAACCTT CGACTCAGTC
AATGTACGTG GTATGCGTCT TTATAGTGAT AGCCGTATGC TACCATCCGC ACTGGCCAGC
TATGCGCCAA TCATTCGTGG CGTAGCAAAT TCTAACGCCA AAGTGACAGT GACCCAGAAC
GGTTACAAAA TCTATGAGTC CACCGTTCCA CCGGGTGAAT TCGTGATTGA CGATCTCAGT
CCTTCAGGGT TTGGTAGCGA ATTGGTCATT ACTATTGAAG AAGCAGATGG TTCTAAACGC
TCATTTACCC AACCTTTCTC TTCTGTCGTG CAGTTGCAGC GCCCTGGCGT TGGCCGTTGG
GATATTAGTG CCGGACAAAT TATCGATGAC AGCTTGCGTC ATGAACCCAA TATGGCACAG
GCCTCTTTTT ACTACGGTTT CAATAATTTA TTCACCGGTT ATACCGGAGT CCAAATTACA
GATAACGACT ATATGTCTGG CCTTTTGGGG CTAGGGATCA ATACCAGTAT TGGTGCCTTT
GCGGTAGATA TTACACATGC CCGTACCGAA ATTCCTGACG ATAAAACATA TCAGGGGCAA
AGTTATCGTA TAACCTGGAA CAAATTAATC GAAGCTACAA GCACCTCATT TAACCTTGCT
GCCTATCGTT ATTCCACTGA GGATTACCTT GGTCTGCATG ATGCGTTAGC CCTCATTGAC
GACGCCAATC ATCTGTCAGT CAACGACGAC AAAGACACGA TCCGTACCTA TTCACGCATG
AAAAACCAAT TTACCGTTAG TGTAAATCAG CCGTTGAGTT TTGCTTATGA GGATTACGGT
TCACTCTTTT TATCTGGCAG CTGGACAAAC TACTGGGCTG GTAACAATAA CCGAACTGAA
TACAATGTTG GTTACAGTAA AAGTGTCTCC TGGGGGAATT TCAGCGTCAA CTTGCAACGT
AGCTGGAATG AGGACGGCAA TAAAGATGAT GCGATGTATT TGAACGTTAG CGTTCCGCTG
GAAAATATTT TCGGCGGTAA ACGTAAATCC TCAGGGTTCC GTAACCTAAG CACTCAATTC
AATACTGATT TCAACGGTTC GCATCAATTA AACGTCAGTA GTTCGGGTAA CAGCGAAGAT
AATCTCATCG GTTACAGTGT GAATACGGGT TATAACCTTG ATAAAGAATC TGAAAATGTC
GCTTCCGTTG GTGGATATCT TAGCTATGAC TCCACATGGG GAGGTTTTTC CGCTTCTGCT
TCTGCTAGCA CCGATAACAG ACAACAATAC TCTGTCTCTA CCGATGGTGG TTTTGTACTT
CACAGTGGCG GCCTGACCTT TACCAATAAC AGTTTCGGTA GCAATGACAC TTTAGTCGTC
ATTAAAGCAC CCGGAGCGAA GGGGGCACGA GTCAATAACG GTACTGATGA AATCGATCGC
TGGGGTTATG CCGTTTCCTC ATCCTTAAGT CCATACCGTG AAAACCGGGT GGGATTAAAT
ATTGAAACAT TGGAAAACGA TGTCGAACTG AAAAGTACTA GTGCCACCAC TGTACCTCGT
AGTGGTTCCA TTATTCTTGC CAGTTTTGAA ACTGACCAGG GGCGTTCTGC AGTTCTGAAT
ATCAGCACCA GTAACGGTAA ACCTATTCCT TTTGCCGCTG AAGTTTATCA GGATGAAATT
ATGATCGGCA GTATGGGTCA GGGTGGTCAG GCATTTGTAC GCGGTATTAA TGACAGCGGA
GAGTTAATTA TCCGCTGGTT CGAAGACAGT CGAACAATAA ATTGCAAATT GCATTATCAA
CTTCCTGCAC AGCCAGAAAC ACTTGGAAGT ACAAACACCT TATTATTAAA CAACCTTACC
TGTAAGTTGG TTAACCACTA A
 
Protein sequence
MYTLTHQKSR LPKTTLLAAC CAFFYSSNGA ATESVEYDSS FLMGTGASSI DVKRYSQGNP 
TPPGVYNVRV FINQQSVASL ELPFVDIGEN SAAACITLKN LAQLHIKQPE YPITLLAREG
EEGDCLDIKK SIEQAEVSYD GSEQHLEISV PQAYVYKTYG GYVDPSLWES GINAAMLSYS
LNAYHSDSKN GNRDSIYGAF NTGLNLGAWH LRARGNYNWS QDNGNSFDFQ DRYLQRDIPA
IRSQVIVGDA YTTGETFDSV NVRGMRLYSD SRMLPSALAS YAPIIRGVAN SNAKVTVTQN
GYKIYESTVP PGEFVIDDLS PSGFGSELVI TIEEADGSKR SFTQPFSSVV QLQRPGVGRW
DISAGQIIDD SLRHEPNMAQ ASFYYGFNNL FTGYTGVQIT DNDYMSGLLG LGINTSIGAF
AVDITHARTE IPDDKTYQGQ SYRITWNKLI EATSTSFNLA AYRYSTEDYL GLHDALALID
DANHLSVNDD KDTIRTYSRM KNQFTVSVNQ PLSFAYEDYG SLFLSGSWTN YWAGNNNRTE
YNVGYSKSVS WGNFSVNLQR SWNEDGNKDD AMYLNVSVPL ENIFGGKRKS SGFRNLSTQF
NTDFNGSHQL NVSSSGNSED NLIGYSVNTG YNLDKESENV ASVGGYLSYD STWGGFSASA
SASTDNRQQY SVSTDGGFVL HSGGLTFTNN SFGSNDTLVV IKAPGAKGAR VNNGTDEIDR
WGYAVSSSLS PYRENRVGLN IETLENDVEL KSTSATTVPR SGSIILASFE TDQGRSAVLN
ISTSNGKPIP FAAEVYQDEI MIGSMGQGGQ AFVRGINDSG ELIIRWFEDS RTINCKLHYQ
LPAQPETLGS TNTLLLNNLT CKLVNH