Gene B21_00826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00826 
Symbolcmr 
ID8112745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp867003 
End bp868235 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content52% 
IMG OID644847091 
Producthypothetical protein 
Protein accessionYP_002998664 
Protein GI251784360 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA AATTAGCTTC CGGTGCCAGG CTTGGACGTC AGGCGTTACT TTTCCCTCTC 
TGTCTGGTGC TTTACGAATT TTCAACCTAT ATCGGCAACG ATATGATTCA ACCCGGTATG
TTGGCCGTGG TGGAACAATA TCAGGCGGGC ATTGATTGGG TTCCTACTTC GATGACCGCG
TATCTGGCGG GCGGGATGTT TTTACAATGG CTGCTGGGGC CGCTGTCGGA TCGTATTGGT
CGCCGTCCGG TGATGCTGGC GGGAGTGGTG TGGTTTATCG TCACCTGTCT GGCAATATTG
CTGGCGCAAA ACATTGAACA ATTCACCCTG TTGCGCTTCT TGCAGGGCAT AAGCCTCTGT
TTCATTGGCG CTGTGGGATA CGCCGCAATT CAGGAATCCT TCGAAGAGGC GGTTTGTATC
AAGATCACCG CGCTGATGGC GAACGTGGCG CTGATTGCTC CGCTACTTGG TCCGCTGGTG
GGCGCGGCGT GGATCCATGT GCTGCCCTGG GAGGGGATGT TTGTTTTGTT TGCCGCATTG
GCAGCGATCT CCTTTTTCGG TCTGCAACGA GCCATGCCTG AAACCGCCAC GCGTATAGGC
GAGAAACTGT CACTGAAAGA ACTCGGTCGT GACTATAAGC TGGTGCTGAA GAACGGCCGC
TTTGTGGCGG GGGCGCTGGC GCTGGGATTC GTTAGTCTGC CGTTGCTGGC GTGGATCGCC
CAGTCGCCGA TTATCATCAT TACCGGCGAG CAGTTGAGCA GCTATGAATA TGGCTTGCTG
CAAGTGCCTA TTTTCGGGGC GTTAATTGCG GGTAACTTGC TGTTAGCGCG TCTGACCTCG
CGCCGCACCG TACGTTCGCT GATTATTATG GGCGGCTGGC CGATTATGAT TGGTCTATTG
GTCGCTGCTG CGGCAACGGT TATCTCATCG CACGCGTATT TATGGATGAC TGCCGGGTTA
AGTATTTATG CTTTCGGTAT TGGTCTGGCG AATGCGGGAC TGGTGCGATT AACCCTGTTT
GCCAGCGATA TGAGTAAAGG TACGGTTTCT GCCGCGATGG GAATGCTGCA AATGCTGATC
TTTACCGTTG GTATTGAAAT CAGCAAACAT GCCTGGCTGA ACGGGGGCAA CGGACTGTTT
AATCTCTTCA ACCTTGTCAA CGGAATTTTG TGGCTGTCGC TGATGGTTAT CTTTTTAAAA
GATAAACAGA TGGGAAATTC TCACGAAGGG TAA
 
Protein sequence
MQNKLASGAR LGRQALLFPL CLVLYEFSTY IGNDMIQPGM LAVVEQYQAG IDWVPTSMTA 
YLAGGMFLQW LLGPLSDRIG RRPVMLAGVV WFIVTCLAIL LAQNIEQFTL LRFLQGISLC
FIGAVGYAAI QESFEEAVCI KITALMANVA LIAPLLGPLV GAAWIHVLPW EGMFVLFAAL
AAISFFGLQR AMPETATRIG EKLSLKELGR DYKLVLKNGR FVAGALALGF VSLPLLAWIA
QSPIIIITGE QLSSYEYGLL QVPIFGALIA GNLLLARLTS RRTVRSLIIM GGWPIMIGLL
VAAAATVISS HAYLWMTAGL SIYAFGIGLA NAGLVRLTLF ASDMSKGTVS AAMGMLQMLI
FTVGIEISKH AWLNGGNGLF NLFNLVNGIL WLSLMVIFLK DKQMGNSHEG