Gene B21_01974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01974 
SymbolmdtD 
ID8114177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2059699 
End bp2061114 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content56% 
IMG OID644848188 
Producthypothetical protein 
Protein accessionYP_002999761 
Protein GI251785457 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGATC TTCCCGACAG CACCCGTTGG CAATTGTGGA TTGTGGCTTT CGGCTTCTTT 
ATGCAGTCGC TGGACACCAC CATCGTAAAC ACCGCCCTTC CCTCAATGGC GCAAAGCCTC
GGGGAAAGTC CGTTGCATAT GCACATGGTC ATTGTCTCTT ATGTGCTGAC CGTGGCGGTG
ATGCTGCCCG CCAGCGGCTG GCTGGCGGAC AAAGTCGGCG TGCGCAATAT TTTCTTTACC
GCCATCGTGC TGTTTACTCT CGGTTCACTG TTTTGCGCGC TTTCCGGCAC GCTGAACGAA
CTGTTGCTGG CACGCGCGTT ACAGGGCGTT GGCGGCGCGA TGATGGTGCC GGTCGGCAGA
TTGACGGTGA TGAAAATCGT ACCGCGCGAG CAATATATGG CGGCGATGAC CTTTGTCACG
TTACCCGGTC AGGTCGGTCC GCTGCTCGGT CCGGCGCTCG GCGGTCTGCT GGTGGAGTAC
GCATCGTGGC ACTGGATCTT TTTGATCAAC ATTCCGGTGG GGATTATCGG TGCGATCGCC
ACATTGCTGT TAATGCCAAA CTACACCATG CAGACGCGGC GCTTTGATCT CTCCGGATTT
TTATTGCTGG CGGTTGGCAT GGCGGTATTG ACCCTGGCGC TGGACGGCAG TAAAGGTACA
GGTTTATCGC CGCTGACGAT TGCAGGCCTG GTCGCAGTTG GCGTGGTGGC ACTGGTGCTT
TATCTGCTGC ACGCCAGAAA TAACAACCGT GCCCTGTTCA GTCTGAAACT GTTCCGTACT
CGTACCTTTT CGCTGGGCCT GGCGGGGAGC TTTGCCGGAC GTATTGGCAG TGGCATGTTG
CCCTTTATGA CACCGGTTTT CCTGCAAATT GGCCTCGGTT TCTCGCCGTT TCATGCCGGA
CTGATGATGA TCCCGATGGT GCTTGGCAGC ATGGGAATGA AGCGAATTGT GGTACAGGTG
GTGAATCGCT TTGGTTATCG TCGGGTACTG GTAGCGACCA CGCTGGGTCT GTCGCTGGTC
ACCCTGTTGT TTATGACTAC CGCCCTGCTG GGCTGGTACT ACGTTTTGCC GTTCGTCCTG
TTTTTACAAG GGATGGTCAA CTCGACGCGT TTCTCCTCCA TGAACACCCT GACGCTGAAA
GATCTCCCGG ACAATCTGGC GAGCAGCGGC AACAGCCTGC TGTCGATGAT TATGCAATTG
TCGATGAGTA TCGGCGTCAC TATCGCCGGG CTGTTGCTGG GACTTTTTGG TTCACAGCAT
GTCAGCGTCG ACAGCGGCAC CACACAAACC GTCTTTATGT ACACCTGGCT TAGCATGGCG
TTGATCATCG CCCTTCCGGC GTTCATCTTT GCCAGAGTGC CGAACGATAC GCATCAAAAT
GTAGCTATTT CGCGGCGAAA AAGGAGCGCG CAATGA
 
Protein sequence
MTDLPDSTRW QLWIVAFGFF MQSLDTTIVN TALPSMAQSL GESPLHMHMV IVSYVLTVAV 
MLPASGWLAD KVGVRNIFFT AIVLFTLGSL FCALSGTLNE LLLARALQGV GGAMMVPVGR
LTVMKIVPRE QYMAAMTFVT LPGQVGPLLG PALGGLLVEY ASWHWIFLIN IPVGIIGAIA
TLLLMPNYTM QTRRFDLSGF LLLAVGMAVL TLALDGSKGT GLSPLTIAGL VAVGVVALVL
YLLHARNNNR ALFSLKLFRT RTFSLGLAGS FAGRIGSGML PFMTPVFLQI GLGFSPFHAG
LMMIPMVLGS MGMKRIVVQV VNRFGYRRVL VATTLGLSLV TLLFMTTALL GWYYVLPFVL
FLQGMVNSTR FSSMNTLTLK DLPDNLASSG NSLLSMIMQL SMSIGVTIAG LLLGLFGSQH
VSVDSGTTQT VFMYTWLSMA LIIALPAFIF ARVPNDTHQN VAISRRKRSA Q