Gene Cmaq_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0049 
Symbol 
ID5709077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp64773 
End bp66455 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content44% 
IMG OID641274552 
Productmajor facilitator transporter 
Protein accessionYP_001539893 
Protein GI159040641 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.384329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTACA AGTGGGTTGT ATTAACTAAT ACAACTCTAG GAGTCATAAT GTCATCAATA 
AACATGTACA TAGTCCTCAT CTCCCTACCA ACAATCTTCA GGGGGTTGAA CATTAATCCA
TTCCTACCCG GGGAGTTTGA TTATCTGCTT TGGGTGTTAA TGGGGTATAG TATAGTGCTG
GCAAGTGTCC TCGTAACCTT TGGAAGAATC TCAGACCTAT ATGGTAGGAC TAGACTTTAC
ACGCTTGGGT TCATAATATT CACCATAGCT TCTATTTTAC TCTCAGTGAT ACCGAGTGGT
TCAGGTAATT TAGGTGCATT ACTACTTATA GTATTTAGGA TGATTCAGGC AGTGGGTGGT
GGCTTCCTAA TGGTTAATAG TACCGCATTA TTAACTGATG CCTTCCCACC AGGTGAGAGG
GGTAAGGCAT TAGGGTTGAA TCAAACAGCC TTCATAATAG GGTCGTTCCT AGGTATAATA
CTTGGCGGCT TATTATCGAA CTATGACTGG CACCTACTCT TCCTAGTTAA TGTACCATTC
GCAGTGGCTG GGGCACTTTG GTCAGTGTTT AAGTTAAGGA GGGTGAGCAG TAGTGGTATT
AAAGTGAAGA TTGATTACTG GGGTAACGTA ACCCTAGCAG CGGGCCTTGT GTTAATATCC
CTTGGCTTCA CCTATGCATT AATGCCCTAT GGTAATTCTG AAATGGGGTG GCTTAACCCA
TACGTAGTAT TATCATTCAT AGTCGGGGTA ATTATGCTGG TTGCCTTCAT ACCAATAGAG
CTTAGGCAGG AGGCGCCCCT CTTTAATCTA TCATTATTTA AGGTGAGGCC ATTCACCTAC
GGCATAATGG CATTATTCCT CTCATCACTG GCGAGGGGCG CCTTAATGTT CCTATTAACA
ATCTGGCTTC AAGGCATATA CCTACCGCTA CATGGTTTCA GTTACGCTGA AACACCATTT
TGGGCAGGCA TATACATGCT TTCAATGCTC ATTGGAATGG TAATAATGGC GCCAATAGGA
GGTGCCTTAA CTGATAAGTA TGGTGCCAGG ATAGTGGCTA CAGTGGGTAT GATAATAATT
GCAGCATCAC TCTACCTACT CACTCTCCTG CCTTATAATT TTAACTTAAC ATCATTTGAA
TTAATACTAT TCCTAAATGG CCTCGGTAAT GGTTTATTCA GTTCCCCTAA CACAACATCA
ATAATGAATG CACTTCACCC TAAGGATAGG GGTGCTGGTA ATGGTATGAG GCAAACCTTC
AGTAACGTAG GCTCCACGAT AAGTATGGCA ATGTTTTTCT CAATAGCAAT GAGCATCTTC
TCCCAATACG TACCCGTTAG GATTCATGAA ATGGCGTTAA GCTACGGTTT ACCCGCAGAC
ATAGCATCCT CCCTCTCAAA GATACCTGCA TCAAGCCTAC TCTTCGCCGC ATTCCTGGGA
ATTGACCCGG CATCAGTATT ACCAAGCACC CTAACGGCTA ACCTGCCGGC AAGCATAATG
AAGGTCCTGG ATTCAAGCAC CTTCCTACCG AATGTACTTG GATCACCATT CATGATGGGT
TTAAGGATTT CCCTATACAT ATCCATAGTA CTGGTGGTGA TAGGAGCAGT ACTCTCATAC
ATGAGGGGAG GCAGGTATGT TTACGAGGAA GCTAAGGGGA AGGAAGAAGC CTATACGGCG
TAG
 
Protein sequence
MEYKWVVLTN TTLGVIMSSI NMYIVLISLP TIFRGLNINP FLPGEFDYLL WVLMGYSIVL 
ASVLVTFGRI SDLYGRTRLY TLGFIIFTIA SILLSVIPSG SGNLGALLLI VFRMIQAVGG
GFLMVNSTAL LTDAFPPGER GKALGLNQTA FIIGSFLGII LGGLLSNYDW HLLFLVNVPF
AVAGALWSVF KLRRVSSSGI KVKIDYWGNV TLAAGLVLIS LGFTYALMPY GNSEMGWLNP
YVVLSFIVGV IMLVAFIPIE LRQEAPLFNL SLFKVRPFTY GIMALFLSSL ARGALMFLLT
IWLQGIYLPL HGFSYAETPF WAGIYMLSML IGMVIMAPIG GALTDKYGAR IVATVGMIII
AASLYLLTLL PYNFNLTSFE LILFLNGLGN GLFSSPNTTS IMNALHPKDR GAGNGMRQTF
SNVGSTISMA MFFSIAMSIF SQYVPVRIHE MALSYGLPAD IASSLSKIPA SSLLFAAFLG
IDPASVLPST LTANLPASIM KVLDSSTFLP NVLGSPFMMG LRISLYISIV LVVIGAVLSY
MRGGRYVYEE AKGKEEAYTA