Gene B21_02058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02058 
SymbolsetB 
ID8113148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2158024 
End bp2159205 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID644848268 
Producthypothetical protein 
Protein accessionYP_002999841 
Protein GI251785537 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00899] sugar efflux transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAACT CCCCCGCAGT CTCCAGCGCG AAATCGTTTG ACCTGACCTC GACGGCGTTT 
TTAATCGTTG CCTTTCTCAC CGGTATTGCG GGCGCTCTGC AAACCCCGAC ACTCAGTATT
TTTCTTACCG ATGAAGTACA TGCCCGTCCG GCGATGGTGG GATTCTTCTT TACCGGCAGC
GCTGTCATTG GGATTCTGGT AAGTCAGTTT CTCGCCGGGC GCTCTGATAA GCGCGGCGAT
CGCAAATCGC TGATTGTCTT TTGCTGCCTG TTAGGCGTGC TGGCCTGCAC CCTTTTTGCC
TGGAATCGCA ACTACTTTGT TTTGCTATTC GTTGGCGTCT TTCTTAGCAG CTTTGGCTCG
ACCGCTAACC CGCAAATGTT TGCCCTTGCC CGTGAACATG CCGACAAAAC CGGACGTGAG
GCGGTGATGT TCAGCTCTTT TTTACGCGCT CAGGTTTCAC TGGCATGGGT CATTGGCCCA
CCGCTGGCTT ATGCCTTAGC GATGGGTTTC AGCTTTACGG TAATGTATCT GAGCGCAGCG
GTAGCGTTTA TTGTTTGCGG TGTGATGGTG TGGCTGTTTT TACCGTCGAT GCAAAAAGAG
CTTCCGCTGG CGACCGGCAC GATCGAAGCG CCGCGCCGTA ACCGTCGCGA TACGCTGCTG
CTGTTTGTCA TTTGTACATT GATGTGGGGC TCGAACAGCC TGTACATCAT CAACATGCCG
CTATTTATTA TCAACGAACT GCATCTTCCC GAGAAACTGG CCGGTGTGAT GATGGGGACC
GCCGCCGGGC TGGAAATCCC GACGATGTTG ATTGCCGGAT ATTTCGCCAA ACGTCTGGGT
AAGCGTTTCT TAATGCGCGT TGCTGCCGTG GGTGGCGTCT GTTTTTACGC AGGAATGCTG
ATGGCGCATT CACCTGTCAT TCTGTTGGGC TTGCAGCTGC TAAATGCTAT TTTTATTGGC
ATTCTGGGCG GCATCGGGAT GCTCTATTTT CAGGATCTGA TGCCCGGTCA GGCGGGTTCA
GCCACCACGC TCTATACCAA CACTTCGCGC GTGGGCTGGA TCATCGCAGG ATCAGTGGCG
GGCATCGTCG CCGAGATCTG GAATTATCAC GCTGTGTTCT GGTTTGCGAT GGTGATGATT
ATCGCCACTC TGTTTTGCTT ACTGCGGATT AAAGATGTTT AA
 
Protein sequence
MHNSPAVSSA KSFDLTSTAF LIVAFLTGIA GALQTPTLSI FLTDEVHARP AMVGFFFTGS 
AVIGILVSQF LAGRSDKRGD RKSLIVFCCL LGVLACTLFA WNRNYFVLLF VGVFLSSFGS
TANPQMFALA REHADKTGRE AVMFSSFLRA QVSLAWVIGP PLAYALAMGF SFTVMYLSAA
VAFIVCGVMV WLFLPSMQKE LPLATGTIEA PRRNRRDTLL LFVICTLMWG SNSLYIINMP
LFIINELHLP EKLAGVMMGT AAGLEIPTML IAGYFAKRLG KRFLMRVAAV GGVCFYAGML
MAHSPVILLG LQLLNAIFIG ILGGIGMLYF QDLMPGQAGS ATTLYTNTSR VGWIIAGSVA
GIVAEIWNYH AVFWFAMVMI IATLFCLLRI KDV