Gene B21_02004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02004 
SymbolyehI 
ID8116160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2094454 
End bp2098086 
Gene Length3633 bp 
Protein Length1210 aa 
Translation table11 
GC content50% 
IMG OID644848216 
Producthypothetical protein 
Protein accessionYP_002999789 
Protein GI251785485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGG AATTACCGTG GCTGGCGGAT AACGCCCAAC TGGAACTGAA ATATAAAAAA 
GGCAAAACGC CGCTCAGTCA TCGTCGCTGG CCGGGCGAAC CAGTGTCCGT TATCACTGGA
AGTCTCATCC AGACATTGGG TGATGAATTG CTACAAAAAG CTGAGAAGAA AAAAAACATT
GTCTGGCGTT ATGAGAATTT TTCACTGGAG TGGCAGTCCG CCATCACGCA GGCCATCAAC
TTGATCGGCG AACACAAACC CTCAGTCCCG GCCCAGACAA TGGCGGCGCT AGCCTGTATC
GCGCAAAATG ACAGCCAACA GTTGCTCGAC GAAATCGTCC AACAAGAGGG GCTGGAATAT
GCGACTGAGG TGGTGATTGC ACGCCAGTTT ATTGCGCGGT GTTATGAGAG TGATCCTCTG
GTAGTGACAT TGCAGTATCA GGACGAGGAT TATGGCTATG GTTATCGCTC AGAAACCTAT
AACGAATTCG ATCTCCGACT GCGTAAGCAT CTCTCTCTGG CAGAGGAAAG CTGCTGGCAG
CGTTGCGCCG ACAAACTCAT TGCCGCACTA CCAGGAATAA CCAAAGTTCG CCGCCCTTTT
ATTGCGCTGA TCCTCCCGGA AAAACCAGAA ATAGCCAATG AGTTGGTAGG CCTTGAATGC
CCGCGAACTC ATTTTCATTC TAAGGAGTGG TTAAAAGTTG TTGCTAATGA CCCCACAGCG
GTGAGAAAAC TCGAACACTA CTGGAGCCAG GATATATTTA GCGATCGAGA AGCCAGCTAC
ATGTCGCATG AAAACCACTT CGGCTACGCG GCCTGCGCCG CCCTTTTGCG CGAACAAGGA
CTGGCAGCCA TTCCGCGCCT CGCGATGTAT GCCCATAAAG AAGATTGCGG CAGTCTGCTG
GTACAAATTA ACCATCCGCA AGTCATCCGC ACCTTGCTAC TGGTGGCTGA TAAAAACAAA
CCCAGCCTGC AACGTGTAGC TAAATACCAT AAAAACTTCC CCCATGCGAC GCTCGCCGCA
CTGGCAGAAC TGCTGGCGTT AACAGAACCA CCAGCCCGCC CTGGTTATCC AATCATCGAA
GACAAAAAGC TGCCTGCACA GCAAAAAGCA CGCGATGAAT ACTGGCGTAC GCTGTTACAG
ACGCTGATGG CATCGCAGCC ACAACTGGCA GAAGAGGTGA TGCAGTGGTT AAGTACTCAA
GCCAGGGCAG TGCTGAATAG TTATTTATCG GCACCGCCCA AACCGGTTAT TGATAGTACC
GATAACAGCA ATCTGCCTGA AATCCTCGTT TCGCCACCGT GGCGTAGTAA GAAAAAAATG
ACAGCTCCAC GTCTTGATTT GGCACCGCTC GAATTAACTC CGCAAGTTTA CTGGCAACCA
GGCGAACAAG AGAGGCTTGC CGCCACTGAG TCTGCCCGTT ATTTCAGCAC GGAATCTCTT
GCGCAACGCA TGGAACAAAA AAGTGGACGA GTTGTATTAC AGGAACTGGG TTTTGGGGAT
GATGTATGGC TGTTTCTGAA TTATATACTC CCCGGAAAAC TGGATGCTGC ACGCAATTCA
CTCATTGTTC AGTGGCATTA CTACCAGGGG CGGGTTGAAG AGATCCTGAA TGGCTGGAAC
TCCCCGGAAG CACAATTAGC AGAACAGGCG CTCCGCAGCG GTCACATAGA AGCGTTAATT
AACATATGGG AAAATGACAA CTACTCACGT TATCGTCCGG AAAAGAGTGT CTGGAACCTG
TATTTATTGG CACAGTTGCC GCGTGAGATG GCTTTGACCT TCTGGCTGCG TATCAATGAG
AAAAAGCATC TGTTCGCGGG TGAGGACTAT TTTCTCAGTA TCCTCGGATT GGATGCGCTA
CCAGGTCTGC TGTTGGCTTT TTCACATCGT CCAAAAGAAA CATTTCCGTT AATTTTAAAT
TTCGGCGCAA CAGAACTGGC CCTGCCCGTT GCCCGCGTCT GGCACCGTTT TGCGGGCCAG
CGTAATCTGG CTCGCCAGTG GATTTTACAA TGGCCGGAAC ATACGGCTAC TGCACTTATT
CCACTCGTCT TTGTTAAACC CTGCGACAAC AGCGAAGCGG CATTATTTGC CCTTCGTTTA
CTGTATGAAC AAGGACATAG TGAATTACTG CAAACGGTTG CAAACCGCTG GGATCGCGCT
GATATGTGGC CAGCCCTGGA AAAAATACTT ACCCAGAACC CGATGGAAAT TTACCCGGCA
CGCATTCCAA AAGCCCCTGA TTTCTGGCAT CCGCAAATGT GGTCCAGGCC GCGCCTTATC
ACTAATAATC AAACTGTTAC CAATGACGCT CTGGAAATTA TCGGCGAAAT GCTGCGCTTT
ACCCAGGGGG GACGTTTTTA TAGCGGGCTG GAACAACTGA AAACGTTCTG CCAGCCACAA
ACGCTGGCAG CTTTTGCTTG GGATCTCTTC ACTGCGTGGC AACAAGCTGG TGCCCCCGCA
AAAGACAACT GGACATTTCT GGCGTTGAGT CTCTTTGGTG ACGAAAGCAC GGCACGGGAT
CTAACGACAC AGATCCTCGC CTGGCCACAA GAAGGCAAAT CTGCCCGTGC GGTCAGTGGC
CTGAACATCC TTACCCTGAT GAATAATGAT ATGGCGCTGA TACAGCTGCA TCATATATCG
CAACGGGCGA AATCCTCTTC ATTACGTGAA AACGCAGCGG AATTTCTTCA AGTGGTCGCA
GAAAATCGCG GGCTAAGCCA GGAAGAGTTA GCGGACAGAT TAGTCCCAAC CCTGGGCCTT
GATGATCCGC AGGCGTTGAT TTTTGATTTT GGTCCCCGGC AGTTTACCGT TCGCTTCGAT
GAAAACCTCA ATCCGGTTAT CTTTGATCAG CAAAACGTTC GCCAGAAAAG CGTTCCCCGG
TTGCGCGCCG ATGACGATCA ACTGAAAGCG CCCGAGGCAC TGGCCCGACT AAAAGGGTTA
AAAAAAGATG CTACTCAGGT GAGCAAAAAC CTGCTCCCGC GTCTTGAAGC TGCCCTACGT
ACCATCCGAC GCTGGTCGCT GGCAGATTTT CATACTCTGT TTGTTAATCA TCCCTTTACC
CGCCTGGTTA CCCAGCGATT AATATGGGGC GTGTATCCGG CAAATGAACC GCGTTGTTTA
CTCAACGCCT TTCGTGTGGC CGCAGAGGGG GAGTTCTGCA ATGCGCAAGA TGAGCCAATT
GGCCTGCCTG CGGATGCTCT GATTGGCATT GCCCACCCGT TAGAAATGAC AGCAGAAATG
CGCAGTGAAT TTGCACAGCT TTTTGCCGAT TACGAAATTA TGCCGCCTTT TCGCCAGTTG
TCGCGCCGCA CGGTGCTGCT CACACCTGAC GAGTCAACCA GTAACAGCCT GACTCGCTGG
GAAGGTAAAT CCGCTACCGT TGGGCAACTT ATGGGAATGC GATACAAAGG CTGGGAGTCA
GGCTATGAGG ACGCATTTGT CTATAACCTG GGTGAGTACC GGCTGGTCCT TAAGTTTTCA
CCCGGTTTTA ACCACTACAA TGTTGATAGC AAAGCGCTAA TGAGCTTCCG TTCTCTTCGA
GTGTACCGTG ACAATAAATC CGTCACTTTT GCCGAACTTG ATGTGTTTGA TTTGAGTGAG
GCGTTAAGCG CACCTGACGT CATTTTCCAT TAA
 
Protein sequence
MDKELPWLAD NAQLELKYKK GKTPLSHRRW PGEPVSVITG SLIQTLGDEL LQKAEKKKNI 
VWRYENFSLE WQSAITQAIN LIGEHKPSVP AQTMAALACI AQNDSQQLLD EIVQQEGLEY
ATEVVIARQF IARCYESDPL VVTLQYQDED YGYGYRSETY NEFDLRLRKH LSLAEESCWQ
RCADKLIAAL PGITKVRRPF IALILPEKPE IANELVGLEC PRTHFHSKEW LKVVANDPTA
VRKLEHYWSQ DIFSDREASY MSHENHFGYA ACAALLREQG LAAIPRLAMY AHKEDCGSLL
VQINHPQVIR TLLLVADKNK PSLQRVAKYH KNFPHATLAA LAELLALTEP PARPGYPIIE
DKKLPAQQKA RDEYWRTLLQ TLMASQPQLA EEVMQWLSTQ ARAVLNSYLS APPKPVIDST
DNSNLPEILV SPPWRSKKKM TAPRLDLAPL ELTPQVYWQP GEQERLAATE SARYFSTESL
AQRMEQKSGR VVLQELGFGD DVWLFLNYIL PGKLDAARNS LIVQWHYYQG RVEEILNGWN
SPEAQLAEQA LRSGHIEALI NIWENDNYSR YRPEKSVWNL YLLAQLPREM ALTFWLRINE
KKHLFAGEDY FLSILGLDAL PGLLLAFSHR PKETFPLILN FGATELALPV ARVWHRFAGQ
RNLARQWILQ WPEHTATALI PLVFVKPCDN SEAALFALRL LYEQGHSELL QTVANRWDRA
DMWPALEKIL TQNPMEIYPA RIPKAPDFWH PQMWSRPRLI TNNQTVTNDA LEIIGEMLRF
TQGGRFYSGL EQLKTFCQPQ TLAAFAWDLF TAWQQAGAPA KDNWTFLALS LFGDESTARD
LTTQILAWPQ EGKSARAVSG LNILTLMNND MALIQLHHIS QRAKSSSLRE NAAEFLQVVA
ENRGLSQEEL ADRLVPTLGL DDPQALIFDF GPRQFTVRFD ENLNPVIFDQ QNVRQKSVPR
LRADDDQLKA PEALARLKGL KKDATQVSKN LLPRLEAALR TIRRWSLADF HTLFVNHPFT
RLVTQRLIWG VYPANEPRCL LNAFRVAAEG EFCNAQDEPI GLPADALIGI AHPLEMTAEM
RSEFAQLFAD YEIMPPFRQL SRRTVLLTPD ESTSNSLTRW EGKSATVGQL MGMRYKGWES
GYEDAFVYNL GEYRLVLKFS PGFNHYNVDS KALMSFRSLR VYRDNKSVTF AELDVFDLSE
ALSAPDVIFH