Gene B21_03403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03403 
SymbolyibH 
ID8112600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3631447 
End bp3632583 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID644849576 
Producthypothetical protein 
Protein accessionYP_003001149 
Protein GI251786845 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTAT TGATTGTTTT AACTTACGTG GCGCTGGCGT GGGCGGTCTT TAAAATCTTT 
CGTATTCCGG TGAATCAGTG GACGCTGGCG ACGGCGGCGC TGGGTGGCGT GTTTCTGGTG
AGTGGTTTGA TTTTGTTGAT GAACTACAAC CACCCTTACA CTTTTACCGC GCAAAAGGCA
GTGATAGCGA TCCCCATCAC GCCACAGGTG ACGGGAATTG TTACTGAAGT CACTGACAAG
AATAATCAGC TTATTCAAAA GGGCGAGGTG CTTTTTAAGC TCGACCCGGT TCGTTACCAG
GCGCGAGTTG ACAGGCTTCA GGCTGACCTG ATGACGGCGA CGCATAATAT AAAGACTCTG
CGCGCGCAGC TCACAGAAGC GCAGGCCAAC ACCACCCAGG TTTCAGCGGA GCGCGACCGT
CTGTTTAAAA ATTATCAACG TTATCTGAAA GGCAGCCAGG CGGCGGTGAA TCCGTTCTCG
GAACGTGACA TCGACGATGC GCGGCAAAAT TTCCTCGCGC AGGATGCGCT GGTGAAAGGC
TCGGTGGCGG AGCAGGCGCA GATCCAGAGC CAGCTCGACA GTATGGTTAA CGGCGAGCAA
TCGCAGATTG TGAGCTTAAG AGCGCAACTT ACTGAAGCAA AATATAATCT TGAGCAGACT
GTCATTCGCG CACCAAGCAA TGGCTACGTC ACTCAGGTAC TGATCCGCCC AGGCACATAC
GCAGCTGCCT TGCCGTTGCG TCCGGTGATG GTTTTCATCC CCGAGCAAAA ACGGCAAATT
GTCGCCCAAT TTCGGCAAAA CTCGCTGTTA CGTCTGAAAC CTGGTGATGA TGCAGAAGTG
GTGTTTAACG CGCTACCTGG GCAGGTGTTC CACGGCAAAC TGACCAGTAT TTTACCTGTC
GTGCCAGGCG GTTCTTATCA GGCGCAGGGG GTATTGCAAT CATTAACGGT CGTGCCCGGC
ACGGACGGTG TGCTGGGAAC CATTGAACTG GACCCTAACG ATGATATCGA TGCCTTACCC
GACGGCATCT ACGCCCAGGT GGCGGTTTAC TCCGACCATT TCAGCCATGT TTCGGTGATG
CGGAAAGTGC TGCTAAGAAT GACCAGCTGG ATGCATTATC TTTATTTGGA TCATTGA
 
Protein sequence
MDLLIVLTYV ALAWAVFKIF RIPVNQWTLA TAALGGVFLV SGLILLMNYN HPYTFTAQKA 
VIAIPITPQV TGIVTEVTDK NNQLIQKGEV LFKLDPVRYQ ARVDRLQADL MTATHNIKTL
RAQLTEAQAN TTQVSAERDR LFKNYQRYLK GSQAAVNPFS ERDIDDARQN FLAQDALVKG
SVAEQAQIQS QLDSMVNGEQ SQIVSLRAQL TEAKYNLEQT VIRAPSNGYV TQVLIRPGTY
AAALPLRPVM VFIPEQKRQI VAQFRQNSLL RLKPGDDAEV VFNALPGQVF HGKLTSILPV
VPGGSYQAQG VLQSLTVVPG TDGVLGTIEL DPNDDIDALP DGIYAQVAVY SDHFSHVSVM
RKVLLRMTSW MHYLYLDH