Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00325 |
Symbol | yaiT |
ID | 8115031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 356037 |
End bp | 358943 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644846610 |
Product | hypothetical protein |
Protein accession | YP_002998183 |
Protein GI | 251783879 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACTCCT GGAAAAAGAA ACTTGTAGTA TCACAATTAG CATTGGCTTG CACTCTGGCT ATCACCTCTC AGGCTAATGC AGCAAACTAT GATACCTGGA CTTATATCGA TAATCCCGTT ACAGCACTTG ATTGGGATCA TATGGATAAG GCAGGCACTG TAGATGGCAA CTATGTAAAC TATAGTGGTT TTGTCTATTA CAACAACACC AATGGTGATT TCGATCAGTC CTTTAACGGC GATACCGTTA ACGGCACGAT CTCAACCTAT TATTTGAACC ATGATTATGC AGACAGTACT GCTAATCAGC TTGATATCAG TAATTCAGTG ATTCACGGTT CGATTACTTC TATGCTGCCT GGCGGTTATT ATGATCGTTT TGATGCAGAT GGTAATAATC TGGGTGGATA TGATTTTTAC ACTGATGCGG TTGTTGATAC ACACTGGCGT GATGGTGATG TTTTCACTTT GAACATTGCT AACACTACTA TTGATGATGA TTATGAAGCT CTTTACTTCA CTGATTCTTA TAAAGATGGT GATGTAACCA AGCACACAAA TGAGACATTT GATACAAGTG AAGGCGTTGC TGTTAATCTT GATGTAGAAA GTAACATCAA TATTTCCAAT AACTCCCGCG TTGCAGGTAT TGCATTATCT CAAGGTAATA CTTACAACGA AACCTACACT ACCGAATCTC ATACTTGGGA TAACAATATC TCTGTAAAAG ATTCCACAGT GACTTCGGGT TCAAATTATA TCCTGGATAG CAATACTTAT GGCAAAACTG GTCACTTTGG CAATTCTGAT GAACCGAGTG ATTATGCTGG CCCGGGTGAT GTTGCAATGT CCTTTACTGC TTCAGGTTCC GACTATGCGA TGAAGAACAA TGTATTCCTC AGCAATTCAA CGCTGATGGG TGATGTTGCC TTTACCAGCA CCTGGAATAG TAATTTTGAT CCGAATGGTC ATGATTCCAA CGGTGACGGG GTGAAAGATA CCAACGGGGG TTGGACTGAT GATAGCCTCA ACGTTGATGA ACTAAATCTC ACTCTCGATA ACGGAAGCAA GTGGGTTGGT CAGGCAATTT ATAACGTTGC TGAAACGTCA GCAATGTATG ATGTTGCTAC AAACAGCCTT ACTCCTGATG CAACATATGA AAACAATGAC TGGAAACGTG TTGTTGATGA CAAGGTCTTC CAGAGCGGTG TATTTAACGT AGCGTTGAAT AACGGTTCTG AATGGGATAC TACAGGTCGT TCCATCGTTG ATACCTTGAC AGTTAATAAT GGTTCTCAGG TTAATGTTTC GGAATCTAAA TTAACTTCAG ATACTATCGA TTTAACTAAC GGTTCTTCGC TGAACATTGG TGAAGATGGC TACGTTGATA CCGATCATCT GACTATTAAC TCCTACAGTA CTGTTGCGTT GACCGAATCT ACTGGGTGGG GGGCTGATTA CAACCTGTAC GCCAATACTA TCACCGTAAC TAACGGCGGT GTATTGGATG TGAACGTTGA TCAGTTCGAT ACTGAAGCTT TCCGTACTGA CAAACTGGAA CTGACCAGCG GCAACATCGC TGACCATAAC GGTAACGTAG TATCTGGTGT GTTCGATATC CATAGCAGCG ATTACGTTCT GAACGCTGAT CTGGTGAACG ACCGTACCTG GGATACTTCC AAGTCTAACT ACGGTTACGG TATTGTTGCT ATGAACTCTG ATGGTCACCT GACTATCAAC GGTAACGGCG ACGTAGACAA CGGTACTGAA CTGGATAACA GCTCTGTAGA CAATGTTGTT GCTGCAACCG GTAACTACAA AGTTCGTATC GACAACGCAA CTGGCGCTGG CGCTATCGCT GATTACAAAG ATAAAGAAAT TATCTACGTA AACGACGTCA ACAGCAACGC GACCTTCTCT GCTGCTAACA AAGCTGACCT GGGTGCATAC ACCTATCAGG CTGAACAGCG CGGTAACACC GTTGTTCTGC AACAGATGGA GCTGACCGAC TACGCTAACA TGGCGCTGAG CATCCCGTCT GCGAACACCA ATATCTGGAA CCTGGAACAA GACACCGTTG GTACTCGTCT GACCAACTCT CGTCATGGCC TGGCTGATAA CGGCGGCGCA TGGGTAAGCT ACTTCGGTGG TAACTTCAAC GGCGACAACG GCACCATCAA CTATGATCAG GATGTTAACG GCATCATGGT CGGTGTTGAT ACCAAAATTG ACGGTAACAA CGCTAAGTGG ATCGTCGGTG CGGCTGCAGG CTTCGCTAAA GGTGACATGA ATGACCGTTC TGGTCAGGTG GATCAAGACA GCCAGACTGC CTACATCTAC TCTTCTGCTC ACTTCGCGAA CAACGTCTTT GTTGATGGTA GCTTGAGCTA CTCTCACTTC AACAACGACC TGTCTGCAAC CATGAGCAAC GGTACTTACG TTGACGGTAG CACCAACTCC GACGCTTGGG GCTTCGGTTT GAAAGCCGGT TACGACTTCA AACTGGGTGA TGCTGGTTAC GTGACTCCTT ACGGCAGCGT TTCTGGTCTG TTCCAGTCTG GTGATGACTA CCAGCTGAGC AACGACATGA AAGTTGACGG TCAGTCTTAC GACAGCATGC GTTATGAACT GGGTGTAGAT GCAGGTTATA CCTTCACCTA CAGCGAAGAT CAGGCTCTGA CTCCGTACTT CAAACTGGCT TACGTCTACG ACGACTCTAA CAACGATAAC GATGTGAACG GCGATTCCAT CGATAACGGT ACTGAAGGGT CTGCGGTACG TGTTGGTCTG GGTACTCAGT TCAGCTTCAC CAAGAACTTC AGCGCCTATA CCGATGCTAA CTACCTCGGT GGTGGTGACG TAGATCAAGA CTGGTCCGCG AACGTGGGTG TTAAATATAC CTGGTAA
|
Protein sequence | MHSWKKKLVV SQLALACTLA ITSQANAANY DTWTYIDNPV TALDWDHMDK AGTVDGNYVN YSGFVYYNNT NGDFDQSFNG DTVNGTISTY YLNHDYADST ANQLDISNSV IHGSITSMLP GGYYDRFDAD GNNLGGYDFY TDAVVDTHWR DGDVFTLNIA NTTIDDDYEA LYFTDSYKDG DVTKHTNETF DTSEGVAVNL DVESNINISN NSRVAGIALS QGNTYNETYT TESHTWDNNI SVKDSTVTSG SNYILDSNTY GKTGHFGNSD EPSDYAGPGD VAMSFTASGS DYAMKNNVFL SNSTLMGDVA FTSTWNSNFD PNGHDSNGDG VKDTNGGWTD DSLNVDELNL TLDNGSKWVG QAIYNVAETS AMYDVATNSL TPDATYENND WKRVVDDKVF QSGVFNVALN NGSEWDTTGR SIVDTLTVNN GSQVNVSESK LTSDTIDLTN GSSLNIGEDG YVDTDHLTIN SYSTVALTES TGWGADYNLY ANTITVTNGG VLDVNVDQFD TEAFRTDKLE LTSGNIADHN GNVVSGVFDI HSSDYVLNAD LVNDRTWDTS KSNYGYGIVA MNSDGHLTIN GNGDVDNGTE LDNSSVDNVV AATGNYKVRI DNATGAGAIA DYKDKEIIYV NDVNSNATFS AANKADLGAY TYQAEQRGNT VVLQQMELTD YANMALSIPS ANTNIWNLEQ DTVGTRLTNS RHGLADNGGA WVSYFGGNFN GDNGTINYDQ DVNGIMVGVD TKIDGNNAKW IVGAAAGFAK GDMNDRSGQV DQDSQTAYIY SSAHFANNVF VDGSLSYSHF NNDLSATMSN GTYVDGSTNS DAWGFGLKAG YDFKLGDAGY VTPYGSVSGL FQSGDDYQLS NDMKVDGQSY DSMRYELGVD AGYTFTYSED QALTPYFKLA YVYDDSNNDN DVNGDSIDNG TEGSAVRVGL GTQFSFTKNF SAYTDANYLG GGDVDQDWSA NVGVKYTW
|
| |