Gene B21_01971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01971 
SymbolmdtA 
ID8112764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2052251 
End bp2053498 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content55% 
IMG OID644848185 
Producthypothetical protein 
Protein accessionYP_002999758 
Protein GI251785454 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGCA GTTATAAATC CCGTTGGGTA ATCGTAATCG TGGTGGTTAT CGCCGCCATC 
GCCGCATTCT GGTTCTGGCA AGGCCGCAAT GACTCCCAGA GTGCAGCCCC AGGGGCGACG
AAACAAGCGC AGCAATCGCC AGCGGGTGGT CGCCGTGGTA TGCGTTCCGG CCCATTAGCC
CCGGTTCAGG CGGCGACCGC CGTAGAACAG GCAGTTCCGC GTTACCTCAC CGGGCTTGGC
ACCATTACCG CTGCTAACAC CGTTACGGTG CGCAGCCGCG TAGATGGTCA GTTGATGGCG
TTACATTTCC AGGAAGGCCA GCAGGTCAAA GCAGGCGATT TACTGGCAGA AATTGACCCC
AGCCAGTTCA AAGTTGCATT AGCACAAGCC CAGGGCCAAC TGGCAAAAGA TAAAGCCACG
CTTGCCAACG CCCGCCGTGA CCTGGCGCGT TATCAACAAC TGGTAAAAAC CAATCTCGTA
TCTCGTCAGG AACTGGATGC CCAACAAGCG CTGGTCAGTG AAACCGAAGG CACCATTAAG
GCTGATGAAG CAAGCGTCGC CAGCGCACAG CTGCAACTCG ACTGGAGCCG CATCACCGCA
CCAGTCGATG GTCGCGTTGG TCTCAAGCAG GTTGATGTTG GTAACCAAAT CTCCAGTGGT
GATACCACCG GAATTGTGGT GATCACCCAG ACGCATCCTA TCGATTTGCT CTTTACCCTG
CCGGAAAGCG ATATCGCTAC CGTTGTGCAG GCGCAAAAAG CCGGAAAACC GCTGGTGGTA
GAAGCCTGGG ATCGCACCAA CTCGAAGAAA TTAAGTGAAG GCACGCTGTT AAGTCTCGAT
AACCAAATCG ATGCCACTAC CGGTACGATT AAAGTGAAAG CACGCTTTAA TAATCAGGAT
GATGCGCTGT TTCCCAATCA GTTTGTTAAC GCGCGCATGT TAGTCGACAC CGAACAAAAC
GCCGTAGTGA TCCCAACAGC CGCCCTGCAA ATGGGCAATG AAGGCCATTT TGTCTGGGTG
CTGAATAGCG AAAACAAGGT CAGCAAACAT CTGGTGACGC CGGGCATTCA GGACAGTCAG
AAAGTGGTGA TCCGCGCAGG TATTTCTGCG GGCGATCGCG TGGTGACAGA CGGCATTGAT
CGCCTGACCG AAGGGGCGAA AGTGGAAGTG GTGGAAGCCC AGAGCGCCAC CACTCCGGAA
GAGAAAGCCA CCAGCCGCGA ATACGCGAAA AAAGGAGCAC GCTCCTGA
 
Protein sequence
MKGSYKSRWV IVIVVVIAAI AAFWFWQGRN DSQSAAPGAT KQAQQSPAGG RRGMRSGPLA 
PVQAATAVEQ AVPRYLTGLG TITAANTVTV RSRVDGQLMA LHFQEGQQVK AGDLLAEIDP
SQFKVALAQA QGQLAKDKAT LANARRDLAR YQQLVKTNLV SRQELDAQQA LVSETEGTIK
ADEASVASAQ LQLDWSRITA PVDGRVGLKQ VDVGNQISSG DTTGIVVITQ THPIDLLFTL
PESDIATVVQ AQKAGKPLVV EAWDRTNSKK LSEGTLLSLD NQIDATTGTI KVKARFNNQD
DALFPNQFVN ARMLVDTEQN AVVIPTAALQ MGNEGHFVWV LNSENKVSKH LVTPGIQDSQ
KVVIRAGISA GDRVVTDGID RLTEGAKVEV VEAQSATTPE EKATSREYAK KGARS