Gene Smed_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2033 
Symbol 
ID5322892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2080945 
End bp2084079 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content63% 
IMG OID640790970 
Productacriflavin resistance protein 
Protein accessionYP_001327701 
Protein GI150397234 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCG GCATGGGAGG TCATCATCCA GGCGGTGGGC AGGGCGGAAA GACGGGCTTC 
ACAGCGCTCT TCATCCGCCG GCCGATCTTC GCACTGGTGG TCAATACATT GATCGTCGTG
GCTGGCCTTG CCGCGTGGAA CGGCGTGGAG ATTCGCGAGC TGCCGCAGGT GGACCAGCCG
GTGGTTTCCG TCACCACCGA ATTCGACGGC GCCTCGCCGG AAACGATCGA CCGCGAGGTG
ACGTCGGTCG TCGAAGGGGC CGTCTCGCGC GTGCAGGGCA TCAAGGACAT CTCGTCCACC
TCGTCCTTCG GGCGCAGCCG GGTAACGCTC GAATTCTCCG ACACGACCGA TATCGGCCAG
GCGGCCAACG ACGTTCGCGA TGCGCTCGGC CGGATCACCG GTCAACTGCC TGATGACGCC
GACGAGCCTC GTATCGTGAA GGCGGATTCC GACAGCCAGC CGATCATGCG TCTGGCGCTG
ACGTCGGACA CGATGAGCAT GGACGACATG ACGCTGCTCG TCGAGAACGA GATCAGCGAC
CGGCTGGCGG CGGTCGAAGG CGTTGCCGAC GTCACCGTCT ACGGCGATCA GGAGAAGATT
TTCCGTATCG ACGTGAACCA GGCGAAACTT GCCGGGCGCG GTGTCACCGT GGCCAACCTT
CGCGAGGCGC TCGCGAATGC CTCCTACGAT GTGCCGGCGG GGTCGCTGAC GAGTGCAAAC
CAGGATATTT CCGTTCGGGC GACGGCTGAT CTCCAGACGC CCGAGCAGTT CGAAAACCTG
ATGCTCGGCA ATAATGTCAG GCTTCGAGAC GTCGCGACGG TGACGCTCGG CCCGGATACC
GGCACTTCGG CGCTACGCTC CAACGGCCGG GAGGGAATCG GGCTCGGCAT CATCCGTCAG
GCGCAGTCGA ACACGGTCGA TATCTCCCAG GGCGTGCGCA GCGTGGTGAA CGCGATTTCC
TCGGATATCC TGCCTCCCGG AACCGAACTC AAGGTCACCA GCGACGATGC GGTGTTCATC
AACGGCGCCA TTCATGAGGT CGAGATCGCA CTTTTCGTTG CCGTTTTCAT CGTCACCCTG
GTCATCTACC TGTTCCTGCT CGACTGGCGC GCGACGCTGA TCCCGACGAT CACAATGCCG
ATCGCGCTGA TCGGAACGGT AGCCGCGATC TATCTTGCCG GGTTCTCGAT CAACATCCTG
ACCCTTCTGG CGATCGTGCT CGCGACCGGA CTCGTCGTCG ACGACGCGAT CGTGGTGCTC
GAAAACATCG TGCGCCGCCG GGCCGAAGGG CTCGGTCCGC GCGCCGCCGC CGTTCACGGT
ACCCTCGAGG TCTTTTTCGC CGTGCTCGCG ACAACCGCGA CGCTTGCAGC CGTGTTCGTG
CCGCTCTCCT TCCTGCCGGG GCAGACCGGC GGCCTTTTCC GTGAATTCGG CTTCGTGCTC
GCCTTCTCGA TCCTCCTGTC CTCCTTCATC TCGCTGACGC TTTGCCCGAT GCTGGCTTCG
CGCATGCTGA CGAAAGAGAG GCAAGAGCAT CAGGGTGTGA TGCAGCGGTT TGGCCAGCGC
GCCTCCGCCT TTTACCGGGT GACGCTCGGT GCCTGCCTCA ATGCACCGCT CATCGTCTTC
GCCGTTGCGG CGTTCTTCAC CGCCGCCGGC GCCCTGGGGT TCCTGACTCT CAAGTCGGAA
CTGACACCGA ATGAGGACCG CTCCCAGGTC ATGTTGCGCA TCAACGCGCC GCAGGGCGTC
TCGCTGGAAT ACACACAGGC ACAGATACGT CGCATCGAGG AGGGCCTGCA GCCGTTGCTC
AAATCCGGAG AGATCAGCAA TGTGTTCTCG ATTTCCGGAC AGGGCGGATC GGCCAATAGC
GGCTTCATGG TCCTGACGCT CGCGCAATGG GAAGAGCGCG ATCGCACCCA AAGTCAGATC
GTCGCCGACA TCAACAGAAT GACCGCGAAG ATCCCGTCCT TGCGCGCATT CACCATCCAG
GCGAACAGCC TGGGTATCCG CGGCGCCGGC AGCGGCCTTC AGGTCGCGCT CGTCGGCAAC
GACTACACGA AGCTCGGCGA TGCGGCGGCC AAGCTCCTCA GGGCCATGGA GGACAGTGGC
CGTTTCGAGA ATGTAAGGCT GAATTACGAG GCCAACCAGG CGCAATTGTC GGTGACGATC
GACCGCGAAC GCACCTCCGA CCTCGGCGTG GACATAGGCG GACTTTCCTG GGCGCTGCAG
GCGATGCTCG ACGGCAGCAG CGTCGTCGAC ATCTTCGTCG AAGGGGAAGC CTACCCGGTC
AAGCTGCTTT CTTCCACCGA GCCGCTCAAC GACCCGACGG ATCTGCAGAA TATCTTCATC
AAGGCCGGGG ACGGCCGGAT CGTGCCGATG TCGTCGATCG CAACGATCGA GGAGAAGGCG
GTGGCGCCGC AGCTTTCGCG CGAGTCCCAG CTGCGCGCCG TTTCGCTGTC GGCGGGCCTG
AAATCCGACC TCGCCCTTGG CGAGGCGCTG TCGATGGTCG AAGAAATGGC CGAACCCATT
CTGCCGCCGG GATCGCGGAT AATGCCTCTT GCCGAGGCCG CGACGCTGGA CGAGAACGCG
AACGGTCTGT TCGTCACCTT CGGCTTCGCG ATCGTCATCA TCTTCCTGGT GCTTGCGGCG
CAATTCGAAA GTTTCGTGAG CGGCCTCATC ATCATGTCGA CCGTGCCGCT GGGTCTGGCC
TGCGCCATTT TCGCGATGAT CCTGACGGGC AACACGCTGA ACATCTACAG CCAGATAGGC
CTCGTCATGC TTGTCGGCAT CATGGCCAAG AACGGCATTC TGATCGTCGA ATTCGCCAAC
CAGCTCCGCG ACCGCGGACA GGACGTTCGC AGCGCCATAG AGAACGCAGC CAATATCCGT
CTCCGGCCGG TCATGATGAC GATGATCGCG ACTGTCGTCG GCGCGGTACC GCTGGTGCTG
GCAAGCGGTG CCGGTGCCGA AGCGCGCATC GCTCTCGGCT GGGTGCTCGT CGGCGGGTTG
GGCCTGGCGG TGATCGTGAC CCTGTATCTG ACGCCGGTCG CCTACCTGGT CATCGCCGGT
TTCACCAAAC CGCATGCCGA CGAGGAGCGG AAACTGGAGG AAGAGATGAA GGCGGCCGAA
GTGCTGAAGA TCTAG
 
Protein sequence
MNIGMGGHHP GGGQGGKTGF TALFIRRPIF ALVVNTLIVV AGLAAWNGVE IRELPQVDQP 
VVSVTTEFDG ASPETIDREV TSVVEGAVSR VQGIKDISST SSFGRSRVTL EFSDTTDIGQ
AANDVRDALG RITGQLPDDA DEPRIVKADS DSQPIMRLAL TSDTMSMDDM TLLVENEISD
RLAAVEGVAD VTVYGDQEKI FRIDVNQAKL AGRGVTVANL REALANASYD VPAGSLTSAN
QDISVRATAD LQTPEQFENL MLGNNVRLRD VATVTLGPDT GTSALRSNGR EGIGLGIIRQ
AQSNTVDISQ GVRSVVNAIS SDILPPGTEL KVTSDDAVFI NGAIHEVEIA LFVAVFIVTL
VIYLFLLDWR ATLIPTITMP IALIGTVAAI YLAGFSINIL TLLAIVLATG LVVDDAIVVL
ENIVRRRAEG LGPRAAAVHG TLEVFFAVLA TTATLAAVFV PLSFLPGQTG GLFREFGFVL
AFSILLSSFI SLTLCPMLAS RMLTKERQEH QGVMQRFGQR ASAFYRVTLG ACLNAPLIVF
AVAAFFTAAG ALGFLTLKSE LTPNEDRSQV MLRINAPQGV SLEYTQAQIR RIEEGLQPLL
KSGEISNVFS ISGQGGSANS GFMVLTLAQW EERDRTQSQI VADINRMTAK IPSLRAFTIQ
ANSLGIRGAG SGLQVALVGN DYTKLGDAAA KLLRAMEDSG RFENVRLNYE ANQAQLSVTI
DRERTSDLGV DIGGLSWALQ AMLDGSSVVD IFVEGEAYPV KLLSSTEPLN DPTDLQNIFI
KAGDGRIVPM SSIATIEEKA VAPQLSRESQ LRAVSLSAGL KSDLALGEAL SMVEEMAEPI
LPPGSRIMPL AEAATLDENA NGLFVTFGFA IVIIFLVLAA QFESFVSGLI IMSTVPLGLA
CAIFAMILTG NTLNIYSQIG LVMLVGIMAK NGILIVEFAN QLRDRGQDVR SAIENAANIR
LRPVMMTMIA TVVGAVPLVL ASGAGAEARI ALGWVLVGGL GLAVIVTLYL TPVAYLVIAG
FTKPHADEER KLEEEMKAAE VLKI