Gene EcolC_2152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2152 
Symbol 
ID6065540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2353865 
End bp2355013 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content44% 
IMG OID641601559 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionYP_001725118 
Protein GI170020164 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.883478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGTT ACACCGTCAA GCCTCCTACC GGAGACACCA ATGAGCAGAC ACAATTTATT 
GATTATTTTA ATCTGTTCTA CAGTAAGCGT GGTCAGGAAC AAATAAGCAT CTCTCAGCAG
CTTGGAAATT ACGGTACGAC ATTTTTCAGT GCCAGTCGCC AAAGTTACTG GAACACGTCA
CGCAGCGACC AGCAAATATC ATTTGGATTA AATGTGCCGT TTGGTGATAT TACGACTTCG
CTGAATTACA GCTATTCCAA TAATATATGG CAAAACGATC GGGATCATTT ACTCGCTTTT
ACGCTTAATG TTCCCTTCAG TCATTGGATG CGTACAGACA GTCAGTCGGC ATTTCGTAAT
TCAAACGCCA GTTACAGTAT GTCAAACGAT TTGAAAGGCG GCATGACCAA TCTATCGGGG
GTTTATGGCA CTCTGCTGCC GGATAATAAC CTGAATTATA GCGTTCAGGT CGGTAACACC
CACGGAGGTA ATACATCGTC TGGCACCAGT GGTTACAGTT CTCTTAATTA TCGTGGAGCT
TATGGTAATA CTAATGTCGG TTACAGTCGG AGTGGTGACA GCAGCCAGAT TTATTACGGA
ATGAGTGGTG GGATTATTGC TCATGCTGAT GGCATCACCT TTGGACAGCC GCTGGGCGAC
ACAATGGTTC TGGTTAAGGC TCTTGGTGCT GATAATGTCA AAATAGAGAA CCAGACCGGA
ATTCATACCG ACTGGCGTGG CTATGCCATA TTACCATTTG CGACAGAATA TAGAGAAAAC
CGTGTTGCTC TTAACGCGAA TTCCCTTGCA GATAATGTTG AACTGGATGA AACCGTGGTC
ACTGTCATCC CAACTCACGG TGCTATTGCC AGAGCAACAT TTAATGCACA AATCGGCGGG
AAAGTATTAA TGACGTTGAA GTACGGTAAT AAGAGCGTTC CATTCGGTGC AATTGTCACA
CACGGAGAGA ATAAAAATGG CAGCATTGTC GCGGAAAATG GTCAGGTTTA TCTGACTGGA
CTTCCACAGT CAGGGCAATT ACAGGTTTCA TGGGGCAAAG ATAAAAACTC AAACTGTATT
GTCGAGTACA AGCTTCCTGA AGTTTCTCCT GGTACCTTAC TGAACCAGCA GACAGCAATC
TGTCGCTAA
 
Protein sequence
MSGYTVKPPT GDTNEQTQFI DYFNLFYSKR GQEQISISQQ LGNYGTTFFS ASRQSYWNTS 
RSDQQISFGL NVPFGDITTS LNYSYSNNIW QNDRDHLLAF TLNVPFSHWM RTDSQSAFRN
SNASYSMSND LKGGMTNLSG VYGTLLPDNN LNYSVQVGNT HGGNTSSGTS GYSSLNYRGA
YGNTNVGYSR SGDSSQIYYG MSGGIIAHAD GITFGQPLGD TMVLVKALGA DNVKIENQTG
IHTDWRGYAI LPFATEYREN RVALNANSLA DNVELDETVV TVIPTHGAIA RATFNAQIGG
KVLMTLKYGN KSVPFGAIVT HGENKNGSIV AENGQVYLTG LPQSGQLQVS WGKDKNSNCI
VEYKLPEVSP GTLLNQQTAI CR