Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3386 |
Symbol | |
ID | 6143195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3471312 |
End bp | 3472406 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641618215 |
Product | pilus biogenesis initiator |
Protein accession | YP_001745364 |
Protein GI | 170684132 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAATC GATTAATTGC CGTGATTTTA TGTTTATTTG GCATGGTCGC GGGGGGGCAC GCTACGCCAA ATGTGACGGC TGAAATTACT TATGATTTGG CCGCTGGTAG GGCGGATTAT TACTTCTGGA ATAGGGAAGA TCCCCCCGCA GTAAGCTACA ACACGACATG GTCAAGTTAT AAATGCGATT TTCCTGATGT GCAGCAGACC TGTACCGCAT CAGGAAATTT ATCAACAGTG AAAATATATT TGACAGAAAA ACGCAGTGGA ATGCGTTGGC CCGTCAAACT CAAAGGCTAT GTTGATGCGG AGGTCTGGAA CCCGGGTGGA GTCTGTAACG GATGGTCGAC GCAAATCGCG TTAGCTAACG GTACGGGTTA TCAATGTAAA AGTGCTTCAG ACGGTTATAT ACAACATCTT GCGAGCGCAA AGCCGATGAC GCTCTATCTT GAACAGTCCG AAATGAAAAA TTTACCAATC GGTGGTGTTT GGGAAGGTTC GGTCAAATTG CAGTTTACTA ATCCGTCTAC GGATTATCGC GCCGATATTA CGCTTAATGT CCTCGATCCC AACCATATCG ACGTGTTCTT CCCGGAGTTC GCCCACGCTA CGCCACGCGT ACAACTGGAT TTGCATCCAA CAGGCAGCGT TAACGGCAAC AACTACGCGC AAGATCTGAC TATGTTGGAT ATGTGCCTGT ACGATGGCTT TAACGGTAAC GGTCTTAGCT ATGAAATTTT GCTAAAAGAT GAGGGAAGAA CGGCGGCAGG ACGCAGTAAT GGTGAGTTTT CGATTTATCG TCAGGGCGCG AGCTCAACGG ATGAAGGGGA GCGTATTGAT TACCGTGTCA AAATGTACGA CCCGGAATCA GGTGGGCAAA TCGATGTGCG CAACAATGAG AGCATGGTCT GGACCAACAT CAACCTTAAA CGTGTCCGCC CGGTGGTGCT TCCAGGCATT CGTTACGCTG TGATGTGTGT TCCCACACCG CTGACGCTGG TGGTCGACAA ATTTAACGTG ACGGCAAAAC AGGCGGGATA TTATATGGGT AAATTGTCGG TCATCTTTAC CCCGTCATTG CCGACAATCA ATTGA
|
Protein sequence | MRNRLIAVIL CLFGMVAGGH ATPNVTAEIT YDLAAGRADY YFWNREDPPA VSYNTTWSSY KCDFPDVQQT CTASGNLSTV KIYLTEKRSG MRWPVKLKGY VDAEVWNPGG VCNGWSTQIA LANGTGYQCK SASDGYIQHL ASAKPMTLYL EQSEMKNLPI GGVWEGSVKL QFTNPSTDYR ADITLNVLDP NHIDVFFPEF AHATPRVQLD LHPTGSVNGN NYAQDLTMLD MCLYDGFNGN GLSYEILLKD EGRTAAGRSN GEFSIYRQGA SSTDEGERID YRVKMYDPES GGQIDVRNNE SMVWTNINLK RVRPVVLPGI RYAVMCVPTP LTLVVDKFNV TAKQAGYYMG KLSVIFTPSL PTIN
|
| |