Gene B21_03331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03331 
SymbolbcsC 
ID8112572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3546099 
End bp3549572 
Gene Length3474 bp 
Protein Length1157 aa 
Translation table11 
GC content57% 
IMG OID644849506 
Producthypothetical protein 
Protein accessionYP_003001079 
Protein GI251786775 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAAT TCACACTAAA CATATTCACG CTTTCCCTCG GTCTGACCGT CATGCCGATG 
GTCGAGGCAG CACCAACCGC TCAGCAACAG TTGCTGGAGC AAGTTCGGTT AGGCGAAGCG
ACCCATCGTG AAGATCTGGT GCAACAGTCG TTATATCGGC TGGAACTTAT TGATCCGAAT
AATCCGGACG TCGTTGCCGC CCGTTTCCGT TCTTTGTTAC GTCAGGGCGA TATTGATGGT
GCGCAAAAAC AGCTTGATCG GCTGTCGCAG TTAGCGCCGA GTTCAAATGC GTATAAATCG
TCGCGGACTA CGATGCTACT TTCCACGCCG GATGGTCGTC AGGCACTGCA ACAGGCAAGA
TTGCAGGCGA CTACCGGTCA TGCGGAAGAA GCTGTGGCGA GTTACAACAA ACTGTTCAAC
GGTGCGCCGC CGGAAGGTGA CATTGCTGTC GAGTACTGGA GTACGGTGGC GAAAATTCCG
GCTCGCCGTG GCGAAGCGAT TAATCAGCTA AAACGCATTA ATGCGGATAC GCCGGGCAAT
ACGGGCCTGC AAAACAATCT GGCGCTATTG CTGTTTAGTA GCGATCGCCG TGACGAAGGT
TTTGCCGTCC TGGAACAGAT GGCGAAATCG AACGCCGGGC GCGAAGGGGC CTCTAAAATC
TGGTACGGGC AGATTAAAGA CATGCCCGTC AGCGATGCCA GTGTGTCGGC GCTGAAAAAA
TATCTTTCGA TCTTTAGTGA TGGCGATAGC GTGGCGGCTG CGCAATCGCA ACTGGCAGAA
CAGCAAAAAC AGCTGGCCGA TCCTGCTTTC CGCGCTCGTG CGCAAGGTTT AGCGGCGGTG
GACTCTGGTA TGGCGGGTAA AGCCATTCCC GAACTACAAC AGGCAGTGCG GGCGAACCCG
AAAGACAGTG AGGCTCTGGG GGCGCTGGGC CAGGCATATT CACAGAAAGG CGATCGCGCC
AATGCAGTGG CGAATCTGGA AAAAGCCCTC GCACTGGACC CGCACAGCAG CAACAACGAC
AAATGGAATA GTCTGCTGAA AGTGAACCGC TACTGGCTGG CGATCCAGCA GGGCGATGCT
GCGCTGAAAG CCAATAATCC TGACCGGGCA GAACGCCTGT TCCAGCAGGC GCGTAATGTC
GATAACACCG ACAGTTATGC AGTGCTGGGG CTGGGCGATG TGGCGATGGC GCGAAAAGAT
TATCCCGCCG CCGAACGTTA TTATCAGCAG ACCTTGCGTA TGGACAGCGG CAACACTAAC
GCCGTGCGCG GGCTGGCAAA TATTTACCGC CAGCAATCGC CAGAAAAAGC TGAAGCGTTT
ATCGCCTCGC TCTCTGCCAG TCAGCGGCGT AGCATTGATG ATATCGAACG CAGCCTGCAA
AACGACCGTC TGGCACAGCA GGCAGAGGCA CTGGAAAACC AGGGCAAATG GGCGCAGGCG
GCAGCACTTC AGCGGCAACG ACTGGCGCTG GACCCCGGCA GTGTATGGAT TACTTACCGA
CTTTCGCAGG ATCTCTGGCA GGCCGGACAA CGCAGCCAGG CCGATACGTT AATGCGCAAT
CTGGCGCAGC AGAAGCCGAA TGACCCGGAG CAGGTTTACG CTTACGGGCT GTATCTCTCT
GGTCATGACC AGGACAGAGC GGCGCTGGCG CATATCAACA GCCTGCCGCG TGCGCAGTGG
AACAGCAATA TTCAGGAGCT GGTTAATCGA CTGCAAAGCG ATCAGGTGCT GGAAACCGCT
AACCGCCTGC GAGAAAGCGG CAAAGAGGCA GAAGCGGAAG CGATGCTGCG CCAGCAACCA
CCTTCCACGC GTATTGACCT CACGCTGGCT GACTGGGCGC AACAACGACG TGATTACACC
GCCGCCCGCG CTGCATATCA GAATGTCCTG ACGCGGGAGC CAGCTAACGC CGACGCCATT
CTTGGTCTGA CGGAAGTGGA TATTGCTGCC GGTGACAAAG CGGCGGCACG TAGCCAGCTG
GCGAAACTGC CCGCTACCGA TAACGCCTCG CTGAACACAC AGCGGCGCGT GGCGCTGGCA
CAGGCGCAGC TTGGCGATAC CGCAGCAGCG CAGCGGACGT TTAATAAGTT GATCCCGCAG
GCAAAATCTC AGCCACCGTC GATGGAAAGC GCGATGGTGC TGCGTGATGG TGCGAAGTTT
GAAGCGCAGG CGGGCGATCC AACGCAGGCG CTGGAAACCT ACAAAGACGC CATGGTCGCA
TCCGGTGTGA CTACGACGCG TCCGCAGGAT AACGACACCT TTACCCGACT GACCCGTAAC
GACGAGAAAG ATGACTGGCT GAAACGTGGC GTGCGCAGCG ATGCGGCGGA CCTCTATCGC
CAGCAGGATC TTAACGTCAC CCTTGAGCAC GATTACTGGG GTTCGAGCGG CACCGGTGGT
TACTCCGATC TGAAAGCGCA CACTACCATG TTGCAGGTGG ATGCGCCGTA TTCTGACGGG
CGGATGTTCT TTCGCAGTGA TTTCGTCAAT ATGAACGTCG GCAGTTTCTC CACTAATGCC
GATGGCAAAT GGGATGACAA CTGGGGCACC TGTACATTAC AGGACTGTAG CGGCAACCGC
AGCCAGTCGG ACTCCGGTGC CAGCGTGGCG GTCGGCTGGC GAAATGACGT CTGGAGCTGG
GATATCGGTA CCACGCCGAT GGGCTTCAAC GTGGTGGATG TGGTCGGCGG CATCAGTTAC
AGCGATGATA TCGGGCCGCT GGGTTACACC GTTAACGCCC ACCGTCGGCC CATCTCCAGT
TCTTTGCTGG CCTTTGGTGG GCAAAAAGAC TCCCCGAGCA ATACCGGGAA AAAATGGGGT
GGCGTACGTG CCGACGGTGT GGGGCTAAGT CTGAGCTACG ATAAAGGTGA AGCAAACGGC
GTCTGGGCAT CGCTTAGTGG CGACCAGTTA ACCGGTAAAA ATGTCGAAGA TAACTGGCGC
GTGCGCTGGA TGACGGGCTA TTACTATAAG GTCATTAACC AGAACAATCG CCGCGTCACA
ATCGGCCTGA ACAACATGAT CTGGCATTAC GACAAAGATC TGAGTGGCTA CTCACTCGGT
CAGGGCGGTT ACTACAGTCC GCAGGAATAC CTGTCGTTTG CCATACCGGT GATGTGGCGG
GAGCGCACGG AAAACTGGTC GTGGGAGCTG GGTGCGTCTG GCTCGTGGTC GCATTCACGC
ACCAAAACCA TGCCGCGTTA TCCGCTGATG AATCTGATCC CGACCGACTG GCAGGAAGAA
GCTGCGCGGC AATCCAACGA TGGCGGCAGC AGTCAGGGCT TCGGCTACAC GGCGCGGGCA
TTACTTGAAC GACGTGTTAC TTCCAACTGG TTTGTTGGCA CGGCAATTGA TATCCAGCAG
GCGAAAGATT ACGCACCCAG CCATTTCCTG CTCTACGTAC GTTATTCCGC CGCCGGATGG
CAGGGTGACA TGGATTTACC GCCGCAGCCG CTGATACCTT ACGCCGACTG GTAA
 
Protein sequence
MRKFTLNIFT LSLGLTVMPM VEAAPTAQQQ LLEQVRLGEA THREDLVQQS LYRLELIDPN 
NPDVVAARFR SLLRQGDIDG AQKQLDRLSQ LAPSSNAYKS SRTTMLLSTP DGRQALQQAR
LQATTGHAEE AVASYNKLFN GAPPEGDIAV EYWSTVAKIP ARRGEAINQL KRINADTPGN
TGLQNNLALL LFSSDRRDEG FAVLEQMAKS NAGREGASKI WYGQIKDMPV SDASVSALKK
YLSIFSDGDS VAAAQSQLAE QQKQLADPAF RARAQGLAAV DSGMAGKAIP ELQQAVRANP
KDSEALGALG QAYSQKGDRA NAVANLEKAL ALDPHSSNND KWNSLLKVNR YWLAIQQGDA
ALKANNPDRA ERLFQQARNV DNTDSYAVLG LGDVAMARKD YPAAERYYQQ TLRMDSGNTN
AVRGLANIYR QQSPEKAEAF IASLSASQRR SIDDIERSLQ NDRLAQQAEA LENQGKWAQA
AALQRQRLAL DPGSVWITYR LSQDLWQAGQ RSQADTLMRN LAQQKPNDPE QVYAYGLYLS
GHDQDRAALA HINSLPRAQW NSNIQELVNR LQSDQVLETA NRLRESGKEA EAEAMLRQQP
PSTRIDLTLA DWAQQRRDYT AARAAYQNVL TREPANADAI LGLTEVDIAA GDKAAARSQL
AKLPATDNAS LNTQRRVALA QAQLGDTAAA QRTFNKLIPQ AKSQPPSMES AMVLRDGAKF
EAQAGDPTQA LETYKDAMVA SGVTTTRPQD NDTFTRLTRN DEKDDWLKRG VRSDAADLYR
QQDLNVTLEH DYWGSSGTGG YSDLKAHTTM LQVDAPYSDG RMFFRSDFVN MNVGSFSTNA
DGKWDDNWGT CTLQDCSGNR SQSDSGASVA VGWRNDVWSW DIGTTPMGFN VVDVVGGISY
SDDIGPLGYT VNAHRRPISS SLLAFGGQKD SPSNTGKKWG GVRADGVGLS LSYDKGEANG
VWASLSGDQL TGKNVEDNWR VRWMTGYYYK VINQNNRRVT IGLNNMIWHY DKDLSGYSLG
QGGYYSPQEY LSFAIPVMWR ERTENWSWEL GASGSWSHSR TKTMPRYPLM NLIPTDWQEE
AARQSNDGGS SQGFGYTARA LLERRVTSNW FVGTAIDIQQ AKDYAPSHFL LYVRYSAAGW
QGDMDLPPQP LIPYADW