Gene B21_03538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03538 
SymbolmdtL 
ID8112816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3778489 
End bp3779664 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID644849708 
Producthypothetical protein 
Protein accessionYP_003001281 
Protein GI251786977 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGCT TTTTGATTTG TAGTTTTGCC CTGGTTTTAC TTTATCCCGC CGGGATTGAT 
ATGTACCTCG TTGGTTTACC GCGCATCGCC GCCGATCTCA ATGCCAGCGA AGCGCAGTTG
CATATTGCGT TCTCCGTATA TCTGGCGGGG ATGGCAGCTG CGATGTTATT TGCCGGTAAA
GTGGCCGATC GTTCAGGGAG AAAGCCGGTC GCCATACCCG GCGCGGCGCT ATTTATTATT
GCCTCGGTGT TCTGTTCACT GGCTGAAACC AGCACGTTAT TTCTTGCAGG CCGATTTCTA
CAGGGGTTGG GCGCAGGCTG TTGTTACGTA GTGGCGTTCG CCATTTTGCG CGACACGCTG
GATGATCGAC GTCGGGCTAA AGTGCTGTCA TTACTCAACG GTATTACCTG CATCATTCCG
GTGTTAGCGC CAGTGCTCGG ACATCTGATT ATGCTTAAAT TCCCGTGGCA GAGTCTGTTC
TGGGCGATGG CAATGATGGG CATCGCGGTA CTGATGTTGT CTTTGTTTAT TTTAAAAGAA
ACGCGCCCAG CGTCCCCCGC TGCTTCGGAC AAACCACGAG AAAATAGCGA GTCGCTGCTT
AATCGGTTTT TCCTCAGCCG TGTTGTTATC ACCACCCTCA GCGTTTCGGT GATCCTCACT
TTCGTCAATA CATCGCCGGT ATTGCTGATG GAAATCATGG GTTTTGAGCG CGGAGAATAC
GCCACCATTA TGGCGTTGAC TGCTGGCGTC AGCATGACCG TTTCATTCTC CACGCCATTT
GCGCTGGGAA TTTTTAAGCC ACGTACGTTG ATGATCACCT CGCAGGTGTT ATTCCTTGCA
GCGGGGATCA CCCTTGCCGT TTCACCTTCC CATGCGGTTT CTCTGTTTGG TATCACGCTG
ATTTGCGCCG GTTTCTCGGT AGGTTTTGGC GTAGCGATGA GTCAGGCGTT AGGACCATTT
TCATTACGCG CGGGCGTAGC CAGCTCGACC TTAGGTATTG CGCAGGTTTG CGGTTCGTCA
CTGTGGATTT GGCTGGCAGC GGTGGTTGGT ATCGGCGCAT GGAATATGCT GATCGGGATT
CTGATTGCCT GTAGCATAGT GAGCCTGTTG CTGATTATGT TCGTCGCGCC TGGACGCCCC
GTTGCCGCTC ATGAAGAAAT CCATCACCAC GCTTGA
 
Protein sequence
MSRFLICSFA LVLLYPAGID MYLVGLPRIA ADLNASEAQL HIAFSVYLAG MAAAMLFAGK 
VADRSGRKPV AIPGAALFII ASVFCSLAET STLFLAGRFL QGLGAGCCYV VAFAILRDTL
DDRRRAKVLS LLNGITCIIP VLAPVLGHLI MLKFPWQSLF WAMAMMGIAV LMLSLFILKE
TRPASPAASD KPRENSESLL NRFFLSRVVI TTLSVSVILT FVNTSPVLLM EIMGFERGEY
ATIMALTAGV SMTVSFSTPF ALGIFKPRTL MITSQVLFLA AGITLAVSPS HAVSLFGITL
ICAGFSVGFG VAMSQALGPF SLRAGVASST LGIAQVCGSS LWIWLAAVVG IGAWNMLIGI
LIACSIVSLL LIMFVAPGRP VAAHEEIHHH A