Gene B21_02857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02857 
SymboltolC 
ID8113021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3044756 
End bp3046237 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content51% 
IMG OID644849045 
Producthypothetical protein 
Protein accessionYP_003000618 
Protein GI251786314 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAT TGCTCCCCAT TCTTATCGGC CTGAGCCTTT CTGGGTTCAG TTCGTTGAGC 
CAGGCCGAGA ACCTGATGCA AGTTTATCAG CAAGCACGCC TTAGTAACCC GGAATTGCGT
AAGTCTGCCG CCGATCGTGA TGCTGCCTTT GAAAAAATTA ATGAAGCGCG CAGTCCATTA
CTGCCACAGC TAGGTTTAGG TGCAGATTAC ACCTATAGCA ACGGCTACCG CGACGCGAAC
GGCATCAACT CTAACGCGAC CAGTGCGTCC CTGCAGTTAA CTCAATCCAT TTTTGATATG
TCGAAATGGC GTGCGTTAAC GCTGCAGGAA AAAGCAGCAG GGATTCAGGA CGTCACGTAT
CAGACCGATC AGCAAACCTT GATCCTCAAC ACCGCGACCG CTTATTTCAA CGTGTTGAAT
GCTATTGACG TTCTTTCCTA TACACAGGCA CAAAAAGAAG CGATCTACCG TCAATTAGAT
CAAACCACCC AACGTTTTAA CGTGGGCCTG GTAGCGATCA CCGACGTGCA GAACGCCCGC
GCACAGTACG ATACCGTGCT GGCGAACGAA GTGACCGCAC GTAATAACCT TGATAACGCG
GTAGAGCAGC TGCGCCAGAT CACCGGTAAC TACTATCCGG AACTGGCTGC GCTGAATGTC
GAAAACTTTA AAACCGACAA ACCACAGCCG GTTAACGCGC TGCTGAAAGA AGCCGAAAAA
CGCAACCTGT CGCTGTTACA GGCACGCTTG AGCCAGGACC TGGCGCGCGA GCAAATTCGC
CAGGCGCAGG ATGGTCACTT ACCGACTCTG GATTTAACGG CTTCTACCGG GATTTCTGAC
ACCTCTTATA GCGGTTCGAA AACCCGTGGT GCCGCTGGTA CCCAGTATGA CGATAGCAAT
ATGGGCCAGA ACAAAGTTGG CCTGAGCTTC TCGCTGCCGA TTTATCAGGG CGGAATGGTT
AACTCGCAGG TGAAACAGGC ACAGTACAAC TTTGTCGGTG CCAGCGAGCA ACTGGAAAGT
GCCCATCGTA GCGTCGTGCA GACCGTGCGT TCCTCCTTCA ACAACATTAA TGCATCTATC
AGTAGCATTA ACGCCTACAA ACAAGCCGTA GTTTCCGCTC AAAGCTCATT AGACGCGATG
GAAGCGGGCT ACTCGGTCGG TACGCGTACC ATTGTTGATG TGTTGGATGC GACCACCACG
TTGTACAACG CCAAGCAAGA GCTGGCGAAT GCGCGTTATA ACTACCTGAT TAATCAGCTG
AATATTAAGT CAGCTCTGGG TACGTTGAAC GAGCAGGATC TGCTGGCACT GAACAATGCG
CTGAGCAAAC CGGTTTCCAC TAATCCGGAA AACGTTGCAC CGCAAACGCC GGAACAGAAT
GCTATTGCTG ATGGTTATGC GCCTGATAGC CCGGCACCAG TCGTTCAGCA AACATCCGCA
CGCACTACCA CCAGTAACGG TCATAACCCT TTCCGTAACT GA
 
Protein sequence
MKKLLPILIG LSLSGFSSLS QAENLMQVYQ QARLSNPELR KSAADRDAAF EKINEARSPL 
LPQLGLGADY TYSNGYRDAN GINSNATSAS LQLTQSIFDM SKWRALTLQE KAAGIQDVTY
QTDQQTLILN TATAYFNVLN AIDVLSYTQA QKEAIYRQLD QTTQRFNVGL VAITDVQNAR
AQYDTVLANE VTARNNLDNA VEQLRQITGN YYPELAALNV ENFKTDKPQP VNALLKEAEK
RNLSLLQARL SQDLAREQIR QAQDGHLPTL DLTASTGISD TSYSGSKTRG AAGTQYDDSN
MGQNKVGLSF SLPIYQGGMV NSQVKQAQYN FVGASEQLES AHRSVVQTVR SSFNNINASI
SSINAYKQAV VSAQSSLDAM EAGYSVGTRT IVDVLDATTT LYNAKQELAN ARYNYLINQL
NIKSALGTLN EQDLLALNNA LSKPVSTNPE NVAPQTPEQN AIADGYAPDS PAPVVQQTSA
RTTTSNGHNP FRN