Gene Cmaq_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0084 
Symbol 
ID5710357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp99770 
End bp101539 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content44% 
IMG OID641274587 
Productmajor facilitator transporter 
Protein accessionYP_001539928 
Protein GI159040676 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0483176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0703431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGGAAG GGTTTTTAAG CGATAGGGTG TTAAGCGGTA ATTATATGTC TAATGAGCCA 
TCGGAGCTAA TTAGTGAGGA GAGGAGGAGG GCAATCCTGG TTAACTCATT CCTGGGATCA
TTAATGGCTT CAATGACTAT GTCAGCAATA ATAATAGCGT TACCTGATGT ATTAAGGGGT
ATTGGTGTTG ATCCAATGTC ACCACTTGGC TTTACATCAA TGCTTTGGTT AATGTTCTCA
TACCCATTAA TGGTTGCTGT GGCTGTACCA ATAGTGGGTA GGTTATCCGA CATGTATGGT
AGGGGTAGAA TGTTCACGAT AGGTGATGCA GTATTCACAA TACTCTCAAC GCTACTAGGC
TTAGTGCCAG GATATGGATT AGTGGCAGCA TTACAGATGA TTGCTTACAG GTTTATTCAA
GGCTTAGGTG GATCCATGAT GTTTACGAAT AGTGCCGCAA TAATAACCGA CGTCTACCCA
CCCCACAGGA GGGGTGTCGC TATGGGTATT GTCAGCATAG CCTTCAGTGC AGGTAGCATA
ATAGGCCTAG TTATAGGCGG TGTATTAGCT GTAATTAACT GGAGGCTGGT TTTCCTAATT
AATACGCCAA TAGGCATAAT CAGTACCATA TGGGCTTACT TAACGGTATA TAAGTTACCG
TTAGGCATTA AGAAGGTTAA GGTTGATTAC ATAGGTGCAT CAATGCTTGC TGCATCACTT
GTCCTTCTCC TTCTAGGCAT AACATTCGGT ATGCTGCCTT CGGGGAACTC ATCAATGAGT
TGGGGGAATC CAACCGTATG GGGACTAATA GGTGGTGGAT TACTGCTCCT GGCTTTACTG
ATACCGATTG AAATGAGGAT TAAGGAGCCT ATACTTAGGA TTAACTTATT TAAGATAAGG
CCATTCACGT TCGGTGTATT AAGTGCATTA TTCCTGTTCC TAGCTCAAGG TGCAAACGTA
TTCGTTTTAT CACTACTACT GCAGGCAATA TACCTCCCAA TGCATGGAGT ACCTTACTCT
GAAACGCCAC TATTGGCTGG CATATACCTA ATACCGAGTA GTGTGGCTAA TGCCATATTT
GCCCCATTGG GTGGTAGATT AATTAATAGG TTTGGAGCCA GGGTTGTTTC AACAATCGGT
GCAATACTAC TGGGGATTAG CTTCGAGCTG CTGACTACGC TTTCAATGAA CTTTAATTAC
ACTCTATTCG CAGCCGACTT ACTCCTAATG GGTGCTGGTT CAGGCTTATT CCAGTCCCCT
AACTTAGTCT CAATAATGAG TTCAGTACCC CAGGAGGATA GGTCAGCGGC ATCTGGGTTA
AGGGCAAGCA TGCAGAACAT AGGGTTATTA ATGAGTTTCG CAGTATTCCT AACACTCATA
TTAGCTGGAT CAGCGGCATC ATTATCATTA TCACTAAGTA AGGCGTTAAT TAACGCTGGT
GTTCCTCAAA GCGACGTAGC GGCATTATCA AGAATACCCC CAGCCTATGC CTTATTCGCA
GCATTCATGG GTTATGACCC AATAAAAGTC ATGCTTAGTG AAGCTGGTAT TCAATTACCT
AGTAGCATTT ACGCCGCTGT GACTCACCCA TCATTCTTCC CAAGCGCCAT AGCCCCAGCT
ATGGCTATGG GTTTCGAGTA CGCCTACCAC ATAGCCGCTG TAATGGCGTT TGCGGCGGCG
GTGTTCTCGT ACTTAAGGGG TAGGGAGCAT ATTGTTCATC AAGTTAAGTT ACTGGAGAGT
GAAAACGGTA AGAGACCTTT CACTGAGTAG
 
Protein sequence
MREGFLSDRV LSGNYMSNEP SELISEERRR AILVNSFLGS LMASMTMSAI IIALPDVLRG 
IGVDPMSPLG FTSMLWLMFS YPLMVAVAVP IVGRLSDMYG RGRMFTIGDA VFTILSTLLG
LVPGYGLVAA LQMIAYRFIQ GLGGSMMFTN SAAIITDVYP PHRRGVAMGI VSIAFSAGSI
IGLVIGGVLA VINWRLVFLI NTPIGIISTI WAYLTVYKLP LGIKKVKVDY IGASMLAASL
VLLLLGITFG MLPSGNSSMS WGNPTVWGLI GGGLLLLALL IPIEMRIKEP ILRINLFKIR
PFTFGVLSAL FLFLAQGANV FVLSLLLQAI YLPMHGVPYS ETPLLAGIYL IPSSVANAIF
APLGGRLINR FGARVVSTIG AILLGISFEL LTTLSMNFNY TLFAADLLLM GAGSGLFQSP
NLVSIMSSVP QEDRSAASGL RASMQNIGLL MSFAVFLTLI LAGSAASLSL SLSKALINAG
VPQSDVAALS RIPPAYALFA AFMGYDPIKV MLSEAGIQLP SSIYAAVTHP SFFPSAIAPA
MAMGFEYAYH IAAVMAFAAA VFSYLRGREH IVHQVKLLES ENGKRPFTE