Gene B21_02238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02238 
SymbolemrY 
ID8114560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2357836 
End bp2359374 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content42% 
IMG OID644848443 
Producthypothetical protein 
Protein accessionYP_003000016 
Protein GI251785712 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.945599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATCA CTAAATCAAC TCCGGCACCA TTAACCGGTG GGACGTTATG GTGCGTCACT 
ATTGCATTGT CATTAGCGAC ATTTATGCAA ATGTTGGATT CCACTATTTC TAACGTCGCA
ATACCGACAA TATCTGGCTT TCTGGGAGCA TCAACAGACG AAGGCACCTG GGTTATCACC
TCGTTTGGTG TAGCAAATGC CATTGCGATC CCTGTTACTG GCAGGTTGGC ACAAAGAATA
GGCGAATTAA GATTATTTTT ACTTTCAGTA ACCTTTTTTT CGCTGTCTTC ATTAATGTGT
AGCCTATCGA CCAATCTTGA TGTGCTGATA TTTTTTAGAG TCGTTCAGGG GTTAATGGCG
GGGCCGTTAA TTCCACTGTC ACAGAGTTTA TTATTAAGGA ATTACCCACC AGAAAAAAGG
ACATTTGCTC TGGCATTATG GTCAATGACC GTGATTATCG CTCCGATATG TGGGCCGATA
TTGGGCGGCT ATATTTGTGA TAACTTTAGC TGGGGTTGGA TATTTTTAAT CAATGTCCCT
ATGGGGATTG TCGTCCTGAC ATTATGCTTA ACCTTACTTA AAGGAAGAGA AACTGAGACT
TCACCGGTCA AAATGAATCT ACCAGGACTG ACCCTGTTAG TGCTCGGTGT TGGTGGCTTG
CAAATTATGC TTGATAAAGG GCGCGATCTG GATTGGTTCA ACTCGAGTAC GATAATAATA
TTAACAGTAG TATCAGTTAT TTCTCTGATC TCTTTAGTCA TTTGGGAGTC GACCTCAGAG
AACCCGATTC TTGATCTCAG TTTGTTTAAG TCCCGTAACT TCACCATTGG TATTGTGAGT
ATCACATGCG CGTATTTATT TTACTCTGGA GCGATCGTCC TTATGCCGCA GTTACTCCAG
GAAACGATGG GGTATAATGC GATATGGGCC GGACTTGCTT ATGCGCCCAT CGGCATCATG
CCACTATTAA TTTCACCTTT GATAGGACGT TATGGCAACA AAATAGACAT GCGGGTGTTG
GTGACATTCA GTTTTTTGAT GTATGCGGTT TGCTATTACT GGCGTTCTGT GACATTTATG
CCAACGATTG ATTTTACAGG TATCATTATG CCGCAGTTTT TTCAGGGATT CGCCGTTGCC
TGTTTCTTTT TACCCTTAAC AACGATTTCG TTTTCAGGCT TGCCAGATAA TAAATTTGCC
AATGCCTCGA GTATGAGTAA TTTTTTTCGT ACCTTGTCAG GATCAGTTGG TACGTCGTTG
ACAATGACGC TGTGGGGACG ACGCGAATCG TTACACCATA GTCAGTTGAC AGCAACCATC
GATCAATTTA ACCCCGTGTT TAATTCATCG TCACAAATTA TGGATAAATA CTATGGTTCG
CTTTCAGGAG TTCTTAATGA AATTAATAAT GAAATAACCC AGCAGTCACT TTCTATTTCT
GCAAATGAGA TTTTCCGTAT GGCGGCTATT GCTTTTATCT TACTTACGGT TTTGGTTTGG
TTTGCGAAAC CGCCGTTTAC AGCGAAAGGC GTTGGGTGA
 
Protein sequence
MAITKSTPAP LTGGTLWCVT IALSLATFMQ MLDSTISNVA IPTISGFLGA STDEGTWVIT 
SFGVANAIAI PVTGRLAQRI GELRLFLLSV TFFSLSSLMC SLSTNLDVLI FFRVVQGLMA
GPLIPLSQSL LLRNYPPEKR TFALALWSMT VIIAPICGPI LGGYICDNFS WGWIFLINVP
MGIVVLTLCL TLLKGRETET SPVKMNLPGL TLLVLGVGGL QIMLDKGRDL DWFNSSTIII
LTVVSVISLI SLVIWESTSE NPILDLSLFK SRNFTIGIVS ITCAYLFYSG AIVLMPQLLQ
ETMGYNAIWA GLAYAPIGIM PLLISPLIGR YGNKIDMRVL VTFSFLMYAV CYYWRSVTFM
PTIDFTGIIM PQFFQGFAVA CFFLPLTTIS FSGLPDNKFA NASSMSNFFR TLSGSVGTSL
TMTLWGRRES LHHSQLTATI DQFNPVFNSS SQIMDKYYGS LSGVLNEINN EITQQSLSIS
ANEIFRMAAI AFILLTVLVW FAKPPFTAKG VG