Gene EcSMS35_3386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3386 
Symbol 
ID6143195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3471312 
End bp3472406 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content48% 
IMG OID641618215 
Productpilus biogenesis initiator 
Protein accessionYP_001745364 
Protein GI170684132 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAATC GATTAATTGC CGTGATTTTA TGTTTATTTG GCATGGTCGC GGGGGGGCAC 
GCTACGCCAA ATGTGACGGC TGAAATTACT TATGATTTGG CCGCTGGTAG GGCGGATTAT
TACTTCTGGA ATAGGGAAGA TCCCCCCGCA GTAAGCTACA ACACGACATG GTCAAGTTAT
AAATGCGATT TTCCTGATGT GCAGCAGACC TGTACCGCAT CAGGAAATTT ATCAACAGTG
AAAATATATT TGACAGAAAA ACGCAGTGGA ATGCGTTGGC CCGTCAAACT CAAAGGCTAT
GTTGATGCGG AGGTCTGGAA CCCGGGTGGA GTCTGTAACG GATGGTCGAC GCAAATCGCG
TTAGCTAACG GTACGGGTTA TCAATGTAAA AGTGCTTCAG ACGGTTATAT ACAACATCTT
GCGAGCGCAA AGCCGATGAC GCTCTATCTT GAACAGTCCG AAATGAAAAA TTTACCAATC
GGTGGTGTTT GGGAAGGTTC GGTCAAATTG CAGTTTACTA ATCCGTCTAC GGATTATCGC
GCCGATATTA CGCTTAATGT CCTCGATCCC AACCATATCG ACGTGTTCTT CCCGGAGTTC
GCCCACGCTA CGCCACGCGT ACAACTGGAT TTGCATCCAA CAGGCAGCGT TAACGGCAAC
AACTACGCGC AAGATCTGAC TATGTTGGAT ATGTGCCTGT ACGATGGCTT TAACGGTAAC
GGTCTTAGCT ATGAAATTTT GCTAAAAGAT GAGGGAAGAA CGGCGGCAGG ACGCAGTAAT
GGTGAGTTTT CGATTTATCG TCAGGGCGCG AGCTCAACGG ATGAAGGGGA GCGTATTGAT
TACCGTGTCA AAATGTACGA CCCGGAATCA GGTGGGCAAA TCGATGTGCG CAACAATGAG
AGCATGGTCT GGACCAACAT CAACCTTAAA CGTGTCCGCC CGGTGGTGCT TCCAGGCATT
CGTTACGCTG TGATGTGTGT TCCCACACCG CTGACGCTGG TGGTCGACAA ATTTAACGTG
ACGGCAAAAC AGGCGGGATA TTATATGGGT AAATTGTCGG TCATCTTTAC CCCGTCATTG
CCGACAATCA ATTGA
 
Protein sequence
MRNRLIAVIL CLFGMVAGGH ATPNVTAEIT YDLAAGRADY YFWNREDPPA VSYNTTWSSY 
KCDFPDVQQT CTASGNLSTV KIYLTEKRSG MRWPVKLKGY VDAEVWNPGG VCNGWSTQIA
LANGTGYQCK SASDGYIQHL ASAKPMTLYL EQSEMKNLPI GGVWEGSVKL QFTNPSTDYR
ADITLNVLDP NHIDVFFPEF AHATPRVQLD LHPTGSVNGN NYAQDLTMLD MCLYDGFNGN
GLSYEILLKD EGRTAAGRSN GEFSIYRQGA SSTDEGERID YRVKMYDPES GGQIDVRNNE
SMVWTNINLK RVRPVVLPGI RYAVMCVPTP LTLVVDKFNV TAKQAGYYMG KLSVIFTPSL
PTIN