Gene EcolC_2655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2655 
Symbol 
ID6064444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2907483 
End bp2908526 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content49% 
IMG OID641602062 
Productfimbrial protein 
Protein accessionYP_001725612 
Protein GI170020658 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0263155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0042534 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCATTAC TACGACTATT TTTTGCCGCC GTCTTAATGC TATGGTGCGC TCAAACCGCT 
GCTTATAGCG GGCAGTGTCA TACTACTCAG GGGAATCCGT ATATTGGCGT CAATTTTGGC
GTTAAAACCC TGGAGGAAGA AGCAAATACG GCAGGGGTAG TTAAAGACAA ATTTTATCAG
TGGAACGAAT CGAATGATTA TTATGTTTCC TGTGATTGCG ATAAAGACAA TGTCAGAAGT
GGCCGATGGG CATTCGCCGC GGATTCACCG TTAGTCTATT TAGGCGACAA CTGGTACAAA
ATTAATGACT ATCTTGCCGC CAAAGTTTTA TTGCAGGTTA AAGGCAGTTC TCCTACTGCG
GTTCCTTTCG AAAACGTGGG CACAGGGGGG GATACCCGAT GGCATATTTG CGACCCTGGC
GGTCAACGTT TAGGTGGGCA GGGGGCAAGC GGTAATAGCG GTAGCTTTTC CCTGAAAATA
TTGCAGCCGT TCGTTGGCTC GGTCGTCATT CCTCCTATGG CGCTGGCGCG ATTATATGAA
TGCTACAACA TACCCGCAGG TGATTCCTGC ACGACTACAG GTTCACCGGT TTTAGTGTAT
TACCTGTCTG GTACGATCAA TTCACTTGGC TCATGTTCCG TCAATGCCGG AGAGACAATT
GAAGTTGATT TAGGTGATGT CTTCGCTGCC AATTTCCGTG TTGTAGGGCA TAAACCTCTT
GGGGCCAGAA CAGCAGAACT TGCAATTCCA GTCAGGTGTA ACACGGGAAA CGCGGGATTA
GTTAATGTCA ACCTGAGTCT GACGGCAACC ACAGACCCCA GCTATCCCCA GGCGATTAAG
ACGTCACGTC CTGGCGTGGG CGTGGTGGTG ACCGATAGCC AGAACAACAT TATTTCCCCT
GCTGGTGGAA CATTACCGCT CTCTATTCCT GATGATGCAG ACAGTATCGC GCGAATGAAT
GTCTATCCAG TCAGCACGAC AGGTGTACCA CCAGAAACCG GGCGATTTGA AGCCACGGCA
ACGGTGAGAA TAAATTTTGA TTAA
 
Protein sequence
MSLLRLFFAA VLMLWCAQTA AYSGQCHTTQ GNPYIGVNFG VKTLEEEANT AGVVKDKFYQ 
WNESNDYYVS CDCDKDNVRS GRWAFAADSP LVYLGDNWYK INDYLAAKVL LQVKGSSPTA
VPFENVGTGG DTRWHICDPG GQRLGGQGAS GNSGSFSLKI LQPFVGSVVI PPMALARLYE
CYNIPAGDSC TTTGSPVLVY YLSGTINSLG SCSVNAGETI EVDLGDVFAA NFRVVGHKPL
GARTAELAIP VRCNTGNAGL VNVNLSLTAT TDPSYPQAIK TSRPGVGVVV TDSQNNIISP
AGGTLPLSIP DDADSIARMN VYPVSTTGVP PETGRFEATA TVRINFD