Gene B21_03411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03411 
Symbolybl151 
ID8116251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3639502 
End bp3644352 
Gene Length4851 bp 
Protein Length1616 aa 
Translation table11 
GC content51% 
IMG OID644849584 
Producthypothetical protein 
Protein accessionYP_003001157 
Protein GI251786853 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TATTTAAAGT TATCTGGAAC CCTGCCACAG GGAATTATAC TGTTACCAGC 
GAAACGGCAA AAAGCCGTGG CAAGAAATCT GGGCGCAGTA AGCTGTTAAT TTCTGCGCTG
GTTGCGGGTG GGTTGTTGTC GTCGTTTGGG GCATTGGCGA ATGCCGGGAA TGACAACGGT
CAGGGTGTTG ATTACGGTAG TGGATCAGCT GGCGACGGCT GGGTTGCTAT AGGCAAAGGG
GCGAAAGCAA ATACTTTTAT GAACACCAGT GGTTCCAGTA CTGCTGTGGG TTATGACGCT
ATAGCTGAAG GCCAATATAG CTCTGCCATC GGGTCAAAAC CCCATGCGAT TGGCGGTGCA
TCAATGGCCT TTGGGGTTAG TGCAATATCA GAAGGCGATA GAAGTATAGC ACTGGGTGCC
TCTTCGTATT CATTGGGCCA ATACTCAATG GCCCTCGGCC GTTATTCAAA AGCATTGGGT
AAATTGTCTA TTGCTATGGG GGACTCTTCC AAAGCGGAAG GAGCAAACGC CATTGCCCTG
GGAAATGCCA CTAAAGCTAC TGAGATTATG AGTATTGCTC TTGGCGACAC CGCCAATGCG
TCAAAAGCGT ATTCAATGGC GCTGGGAGCA AGTAGCGTCG CATCTGAAGA AAACGCAATT
GCCCTGGGGC GTAGCAGTGT AGCTAGCGGT ACTGACAGCC TCGCATTTGG CAGACAATCA
CTTGCCAGCG CAGCGAACGC TATTGCGATA GGTGCTGAGA CCGAAGCCGC TGAAAATGCA
ACTGCTATTG GCAATAATGC GAAGGCAAAA GGGACTAATA GCATGGCAAT GGGGTTCGGA
AGCCTTGCCG ATAAAGTCAA TACTATCGCA TTAGGAAATG GCAGCCAGGC TCTGGCAGAT
AATGCAATCG CCATAGGCCA GGGCAACAAA GCTGATGGCG TGGATGCCAT CGCTCTGGGT
AATGGTAGCC AGTCGAGAGG CTTAAACACC ATTGCCTTAG GCACAGCCAG TAATGCAACT
GGTGATAAGA GTCTTGCGCT TGGTAGTAAT AGCAGTGCCA ACGGTATTAA CTCTGTCGCG
CTGGGCGCAG ATTCCATTGC GGATTTAGAC AATACCGTCT CTGTCGGCAA TAGTTCATTA
AAACGCAAGA TCGTTAATGT GAAAAATGGC GCGATCAAGT CTGACAGTTA CGATGCCATT
AATGGTTCAC AGCTTTATGC CATTAGCGAC TCGGTAGCAA AAAGGCTTGG AGGAGGGGCT
GCAGTAGATG TTGATGACGG TACTGTTACA GCACCAACCT ACAATTTAAA AAATGGTAGC
AAAAATAACG TAGGGGCTGC GCTCGCTGTA CTTGATGAAA ACACCCTGCA ATGGGACCAA
ACCAAAGGCA AATACAGCGC TGCTCATGGT ACTAGTAGCC CAACTGCCAG CGTAATCACC
GATGTTGCGG ATGGCACGAT TTCAGCCTCC AGTAAGGATG CGGTTAACGG TTCCCAACTG
AAAGCTACCA ATGACGATGT CGAAGCCAAC ACCGCCAATA TCGCTACTAA TACCAGCAAC
ATTGCCACGA ATACAGCAAG TATTGCCACC AATACCACCA ATATCACCAA CCTGACGGAT
TCCGTTGGTG ACCTTCAGGC TGATGCCCTG CTCTGGAACG AAACTAAAAA GGCATTCAGT
GCAGCTCACG GCCAGGATAC CACCAGCAAA ATCACCAACG TTAAAGATGC CGACCTGACG
GCTGACAGCA CTGATGCTGT TAACGGCTCT CAGCTGAAAA CCACCAACGA TGCTGTGGCG
ACGAATACCA CCAATATCGC CAATAACACT TCCAATATTG CCACTAACAC CACCAACATC
TCTAACCTGA CTGAGACGGT GACTAATCTT GGTGAGGATG CGCTGAAATG GGATAAGGAC
AATGGTGTAT TCACGGCAGC TCATGGCACC GAGACCACCA GCAAAATCAC CAACGTTAAA
GATGGCGACC TGACGACTGG CAGCACCGAT GCCGTTAACG GCTCTCAGCT GAAAACCACC
AACGATGCCG TGGCGACGAA TACCACCAAT ATCGCCACTA ACACCACCAA CATCTCTAAT
CTGACTGAGA CGGTGACTAA TCTTGGTGAG GATGCGCTGA AATGGGATAA GGACAATGGT
GTCTTCACTG CAGCTCATGG CAACAATACC GCCAGCAAAA TCACCAATAT CCTGGACGGC
ACAGTCACTG CAACCAGTTC CGATGCCATT AACGGTAGCC AGCTTTATGA CTTAAGCAGC
AATATCGCCA CCTACTTCGG CGGCAATGCT TCTGTGAATA CTGACGGTGT GTTTACCGGT
CCAACCTACA AAATCGGTGA AACAAATTAT TATAACGTCG GCGATGCACT GGCTGCGATT
AACTCCTCAT TTAGCACGTC TCTCGGCGAT GCTCTGCTTT GGGATGCCAC CGCAGGTAAA
TTCAGTGCCA AACACGGTAC TAATGGTGAC GCAAGCGTGA TCACTGATGT CGCAGATGGT
GAAATTTCAG ACTCCAGTTC TGACGCAGTA AACGGCTCAC AACTCCACGG CGTGAGCAGT
TATGTTGTTG ATGCGCTGGG GGGTGGTGCC GAAGTCAATG CAGACGGCAC CATCACTGCG
CCGACGTACA CCATTGCTAA TGCTGATTAC GATAATGTCG GTGATGCCCT GAATGCTATC
GATACCACTC TTGACGACGC TCTGCTCTGG GATGCGGACG CCGGTGAAAA TGGTGCATTT
AGCGCCGCTC ACGGAAAAGA TAAAACTGCC AGTGTAATCA CTAACGTCGC TAACGGTGCA
ATCTCTGCTG CCAGCAGCGA CGCGATTAAC GGCTCACAAC TCTATACCAC CAATAAGTAC
ATCGCTGATG CGCTGGGTGG TGACGCAGAA GTCAACGCCG ACGGCACCAT CACCGCACCG
ACTTACACCA TTGCGAACAC CGAGTACAAC AACGTCGGTG ACGCCCTGGA TGCGCTTGAT
GATAACGCCC TGCTGTGGGA TGAGACTGCC AATGGCGGTG CTGGAGCCTA CAATGCCAGC
CATGACGGTA AAGCCAGCAT CATCACTAAT GTCGCTAATG GCAGTATTAG TGAGGACAGT
ACCGATGCAG TGAACGGTTC TCAGTTGAAT GCGACGAATA TGATGATTGA GCAGAACACC
CAAATTATCA ATCAGCTCGC TGGTAACACC GACGCAACCT ATATCCAAGA AAACGGTGCG
GGTATTAACT ATGTGCGTAC TAACGACGAC GGCTTAGCGT TCAACGACGC CAGCGCACAG
GGTGTTGGCG CTACAGCTAT AGGTTATAAC TCTGTCGCCA AAGGCGATAG CAGCGTAGCT
ATTGGTCAGG GCAGCTACAG CGACGTTGAT ACGGGTATCG CCCTGGGTAG CAGCTCTGTT
TCCAGCCGAG TGATTGCCAA AGGCTCCCGT GACACCAGCA TAACGGAAAA TGGCGTTGTT
ATTGGTTACG ACACCACGGA TGGCGAACTG CTCGGTGCAT TGTCTATCGG TGATGACGGT
AAATATCGTC AAATCATCAA CGTAGCCGAT GGTTCCGAAG CCCATGACGC CGTTACGGTT
CGTCAATTGC AGAATGCGAT TGGTGCTGTA GCAACGACAC CGTCCAAATA CTTCCATGCT
AATTCAACGG AAGAAGATTC ACTGGCTGTC GGTGAAGACT CGCTGGCAAT GGGTGCGAAA
ACCATTGTTA ATGGTGATGC AGGGATCGGT ATTGGCCTGA ACACTCTGGT GTTAACTGAT
GCAATCAACG GTATTGCTAT CGGTAGCAAC GCGAGTGCAA ATCATGCAAA CAGTATTGCG
ATGGGTAGTG GTTCCCAGAC CACCCGTGGT GCGCAGACGG ACTACACCGC CTACAACATG
GACGCGCCGC AGAATTCTGT CGGTGAATTC TCTGTCGGCA GCGAAGACGG TCAACGTCAG
ATCACCAACG TCGCGGCTGG TTCAGCGGAT ACCGATGCGG TTAACGTAGG TCAGTTGAAA
GTCACTGATG AGCGCGTAGC GCAAAATACC CAAAGCATTA CTAACCTGAA CAATCAGGTC
ACTAATCTGG ATACTCGCGT TACTAATATC GAAAACGGTA TTGGCGACAT TGTAACCACC
GGTAGCACCA AGTACTTCAA GACCAACACC GATGGCGTAG ATGCCAACGC CCAGGGTAAA
GATAGCGTTG CTATTGGTTC TGGGTCCATT GCTGCCGCTG ACAACAGCGT CGCACTGGGT
ACCGGTTCCG TTGCAGACGA AGAAAATACA ATCTCTGTAG GTTCTTCTAC TAACCAACGC
CGTATTACTA ACGTTGCCGC AGGTAAAAAT GCTACCGATG CTGTTAACGT TGCGCAGTTG
AAGTCTTCTG AGGCGGGCGG CGTGCGTTAC GACACCAAAG CTGATGGTTC TATCGACTAT
AGCAATATCA CCCTCGGTGG CGGCAACGGT GGTACGACTC GTATCAGCAA CGTCTCCGCT
GGCGTCAACA ACAACGACGC GGTGAACTAC GCGCAGTTGA AGCAAAGCGT GCAGGAAACG
AAGCAATACA CCGATCAGCG GATGGTTGAG ATGGATAACA AACTGTCTAA AACCGAAAGC
AAGTTGAGTG GTGGTATCGC TTCTGCAATG GCAATGACCG GTCTGCCGCA GGCTTATACA
CCGGGTGCCA GCATGGCTTC TATTGGTGGC GGTACTTACA ACGGTGAATC GGCAGTTGCT
TTAGGTGTAT CGATGGTGAG CGCCAATGGT CGTTGGGTCT ACAAATTACA AGGTAGTACC
AATAGCCAGG GTGAATACTC CGCCGCACTC GGTGCCGGTA TTCAGTGGTA A
 
Protein sequence
MNKIFKVIWN PATGNYTVTS ETAKSRGKKS GRSKLLISAL VAGGLLSSFG ALANAGNDNG 
QGVDYGSGSA GDGWVAIGKG AKANTFMNTS GSSTAVGYDA IAEGQYSSAI GSKPHAIGGA
SMAFGVSAIS EGDRSIALGA SSYSLGQYSM ALGRYSKALG KLSIAMGDSS KAEGANAIAL
GNATKATEIM SIALGDTANA SKAYSMALGA SSVASEENAI ALGRSSVASG TDSLAFGRQS
LASAANAIAI GAETEAAENA TAIGNNAKAK GTNSMAMGFG SLADKVNTIA LGNGSQALAD
NAIAIGQGNK ADGVDAIALG NGSQSRGLNT IALGTASNAT GDKSLALGSN SSANGINSVA
LGADSIADLD NTVSVGNSSL KRKIVNVKNG AIKSDSYDAI NGSQLYAISD SVAKRLGGGA
AVDVDDGTVT APTYNLKNGS KNNVGAALAV LDENTLQWDQ TKGKYSAAHG TSSPTASVIT
DVADGTISAS SKDAVNGSQL KATNDDVEAN TANIATNTSN IATNTASIAT NTTNITNLTD
SVGDLQADAL LWNETKKAFS AAHGQDTTSK ITNVKDADLT ADSTDAVNGS QLKTTNDAVA
TNTTNIANNT SNIATNTTNI SNLTETVTNL GEDALKWDKD NGVFTAAHGT ETTSKITNVK
DGDLTTGSTD AVNGSQLKTT NDAVATNTTN IATNTTNISN LTETVTNLGE DALKWDKDNG
VFTAAHGNNT ASKITNILDG TVTATSSDAI NGSQLYDLSS NIATYFGGNA SVNTDGVFTG
PTYKIGETNY YNVGDALAAI NSSFSTSLGD ALLWDATAGK FSAKHGTNGD ASVITDVADG
EISDSSSDAV NGSQLHGVSS YVVDALGGGA EVNADGTITA PTYTIANADY DNVGDALNAI
DTTLDDALLW DADAGENGAF SAAHGKDKTA SVITNVANGA ISAASSDAIN GSQLYTTNKY
IADALGGDAE VNADGTITAP TYTIANTEYN NVGDALDALD DNALLWDETA NGGAGAYNAS
HDGKASIITN VANGSISEDS TDAVNGSQLN ATNMMIEQNT QIINQLAGNT DATYIQENGA
GINYVRTNDD GLAFNDASAQ GVGATAIGYN SVAKGDSSVA IGQGSYSDVD TGIALGSSSV
SSRVIAKGSR DTSITENGVV IGYDTTDGEL LGALSIGDDG KYRQIINVAD GSEAHDAVTV
RQLQNAIGAV ATTPSKYFHA NSTEEDSLAV GEDSLAMGAK TIVNGDAGIG IGLNTLVLTD
AINGIAIGSN ASANHANSIA MGSGSQTTRG AQTDYTAYNM DAPQNSVGEF SVGSEDGQRQ
ITNVAAGSAD TDAVNVGQLK VTDERVAQNT QSITNLNNQV TNLDTRVTNI ENGIGDIVTT
GSTKYFKTNT DGVDANAQGK DSVAIGSGSI AAADNSVALG TGSVADEENT ISVGSSTNQR
RITNVAAGKN ATDAVNVAQL KSSEAGGVRY DTKADGSIDY SNITLGGGNG GTTRISNVSA
GVNNNDAVNY AQLKQSVQET KQYTDQRMVE MDNKLSKTES KLSGGIASAM AMTGLPQAYT
PGASMASIGG GTYNGESAVA LGVSMVSANG RWVYKLQGST NSQGEYSAAL GAGIQW