Gene GM21_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1604 
Symbol 
ID8136935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1869376 
End bp1870923 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content61% 
IMG OID644869217 
Productdrug resistance transporter, EmrB/QacA subfamily 
Protein accessionYP_003021417 
Protein GI253700228 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones104 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCG CGGAAGAAAA GAACGTAAAC AAGTGGCTCA TCACCATTAC GGTGATGCTG 
CCTGCCATCA TGGAGATCGT CGACACCTCG GTCGCCAACG TGGCGCTCCC GCACATGCAG
GGGAGCCTCA ACGCCGGGAC CGACGAGGTC ACCTGGGTCC TCACCTCCTA CCTGGTCAGT
AACGCCGTCG TCCTCCCCAT GACCGGGTGG CTCGCCCGCA TATTCGGCCG CAAAAGGTTC
CTCATCACCT GCATCACCCT CTTCACGATC GCCTCGCTTC TTTGCGGCGC CGCCCCTTCA
CTGGGCATGC TGATCTTCTT CCGCGTCCTG CAGGGGGCCG CCGGCGGCGC GCTCATCCCC
ATGAGCCAAG CGATCATGAT GGAGACCTTC CCCCCCTACC AGCAGGGAAT GGCGATGGCC
ATCTTCGGCG TAGGCGCCAT GTTCGGCCCC ATCATCGGCC CTGCTTTGGG CGGCTGGATC
ACCGACAACA TGAGCTGGCG CTGGATCTTC TACATCAACA TCCCGATCGG GGTCATCGCG
GTCATCATGG CTTCGTTCTT CATCTACGAC CCGAGCTACC TGAAACGTAC CAAGGTCGCC
ATCGACTACT GGGGGCTCGC CCTTCTCACC GTGGGGCTTG GGGCTCTGCA GATAGTGCTC
GACAAAGGGC AGCAGGACGA CTGGTTCAAC TCCCCTTTCA TCGTCGGCTG CGCCGTGGTC
ACCGCCATCG CGCTTTCCGC GCTCGTCTAC GTTGAGCTGA CCCATCCCCA TCCCATCGTG
AACCTGAGGC TCTTCAAGAA CGTCTCCTTT TCCTCGGGGA ACCTGATCAT GTTCGCGGTG
GGCTTTTGCC TTTACAGCTC GATCATGTTG ATCCCGCTGT TCCTGCAGAC CCTCATGGGG
TACAACGCGA CCATGGCAGG TATGGTGCTC GCTCCCGGCG GCGTCGCCAC CTTGGTGTGC
ATGCCCTTCG TGGGCGCTGT GATCCAGCGT TACGACGGCA GGAAGGTCGT CTTCATAGGG
CTCATCATCG GCGCCATTTC CATGTTCATC ATGCAGCGCT TCACCCTGCA GGCGGCCTAC
GTCGACTTCG TCTGGCCTCG CGTGGTGCTG GGGGTAGGCC TTGCCATGAT CTTCGTCCCT
CTGACCACGG TCACCCTGGC AACCATTTCC AAGGAGGAAA TGGGGAACGC GACCGGCATC
TTCAGCCTGC TGAGGAACGT CGGCGGGAGC GTGGGCATAG CCATCGCGGC CACCATGCTG
GCGCGTTACT CGCAGTTTTA CCAGACCAGC CTAGTTGCAC ACGTGAACCC GTACAACCCG
CTGTTCCAAT CCCAGTTCGG GACGCTGAAG GGGGCGCTCA TGGGGCGCGG CCTAGACGCC
GTCGCTGCCG ACAAGGGTGC CATGGCGGTC ATCTACGGGA CCGTGAGCCG GCAGGCCTAC
ATGCTCTCCT ACAACAGGAT CTTCTTCATC GTCGGCCTCG CCTTCCTCGT CATCATTCCG
CTTTTGTTTC TGCTGAAAAA GCCCGTAAAG CACCTGCCGC CGGCGTAA
 
Protein sequence
MKSAEEKNVN KWLITITVML PAIMEIVDTS VANVALPHMQ GSLNAGTDEV TWVLTSYLVS 
NAVVLPMTGW LARIFGRKRF LITCITLFTI ASLLCGAAPS LGMLIFFRVL QGAAGGALIP
MSQAIMMETF PPYQQGMAMA IFGVGAMFGP IIGPALGGWI TDNMSWRWIF YINIPIGVIA
VIMASFFIYD PSYLKRTKVA IDYWGLALLT VGLGALQIVL DKGQQDDWFN SPFIVGCAVV
TAIALSALVY VELTHPHPIV NLRLFKNVSF SSGNLIMFAV GFCLYSSIML IPLFLQTLMG
YNATMAGMVL APGGVATLVC MPFVGAVIQR YDGRKVVFIG LIIGAISMFI MQRFTLQAAY
VDFVWPRVVL GVGLAMIFVP LTTVTLATIS KEEMGNATGI FSLLRNVGGS VGIAIAATML
ARYSQFYQTS LVAHVNPYNP LFQSQFGTLK GALMGRGLDA VAADKGAMAV IYGTVSRQAY
MLSYNRIFFI VGLAFLVIIP LLFLLKKPVK HLPPA