Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0084 |
Symbol | |
ID | 5710357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 99770 |
End bp | 101539 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641274587 |
Product | major facilitator transporter |
Protein accession | YP_001539928 |
Protein GI | 159040676 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0483176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0703431 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGGGAAG GGTTTTTAAG CGATAGGGTG TTAAGCGGTA ATTATATGTC TAATGAGCCA TCGGAGCTAA TTAGTGAGGA GAGGAGGAGG GCAATCCTGG TTAACTCATT CCTGGGATCA TTAATGGCTT CAATGACTAT GTCAGCAATA ATAATAGCGT TACCTGATGT ATTAAGGGGT ATTGGTGTTG ATCCAATGTC ACCACTTGGC TTTACATCAA TGCTTTGGTT AATGTTCTCA TACCCATTAA TGGTTGCTGT GGCTGTACCA ATAGTGGGTA GGTTATCCGA CATGTATGGT AGGGGTAGAA TGTTCACGAT AGGTGATGCA GTATTCACAA TACTCTCAAC GCTACTAGGC TTAGTGCCAG GATATGGATT AGTGGCAGCA TTACAGATGA TTGCTTACAG GTTTATTCAA GGCTTAGGTG GATCCATGAT GTTTACGAAT AGTGCCGCAA TAATAACCGA CGTCTACCCA CCCCACAGGA GGGGTGTCGC TATGGGTATT GTCAGCATAG CCTTCAGTGC AGGTAGCATA ATAGGCCTAG TTATAGGCGG TGTATTAGCT GTAATTAACT GGAGGCTGGT TTTCCTAATT AATACGCCAA TAGGCATAAT CAGTACCATA TGGGCTTACT TAACGGTATA TAAGTTACCG TTAGGCATTA AGAAGGTTAA GGTTGATTAC ATAGGTGCAT CAATGCTTGC TGCATCACTT GTCCTTCTCC TTCTAGGCAT AACATTCGGT ATGCTGCCTT CGGGGAACTC ATCAATGAGT TGGGGGAATC CAACCGTATG GGGACTAATA GGTGGTGGAT TACTGCTCCT GGCTTTACTG ATACCGATTG AAATGAGGAT TAAGGAGCCT ATACTTAGGA TTAACTTATT TAAGATAAGG CCATTCACGT TCGGTGTATT AAGTGCATTA TTCCTGTTCC TAGCTCAAGG TGCAAACGTA TTCGTTTTAT CACTACTACT GCAGGCAATA TACCTCCCAA TGCATGGAGT ACCTTACTCT GAAACGCCAC TATTGGCTGG CATATACCTA ATACCGAGTA GTGTGGCTAA TGCCATATTT GCCCCATTGG GTGGTAGATT AATTAATAGG TTTGGAGCCA GGGTTGTTTC AACAATCGGT GCAATACTAC TGGGGATTAG CTTCGAGCTG CTGACTACGC TTTCAATGAA CTTTAATTAC ACTCTATTCG CAGCCGACTT ACTCCTAATG GGTGCTGGTT CAGGCTTATT CCAGTCCCCT AACTTAGTCT CAATAATGAG TTCAGTACCC CAGGAGGATA GGTCAGCGGC ATCTGGGTTA AGGGCAAGCA TGCAGAACAT AGGGTTATTA ATGAGTTTCG CAGTATTCCT AACACTCATA TTAGCTGGAT CAGCGGCATC ATTATCATTA TCACTAAGTA AGGCGTTAAT TAACGCTGGT GTTCCTCAAA GCGACGTAGC GGCATTATCA AGAATACCCC CAGCCTATGC CTTATTCGCA GCATTCATGG GTTATGACCC AATAAAAGTC ATGCTTAGTG AAGCTGGTAT TCAATTACCT AGTAGCATTT ACGCCGCTGT GACTCACCCA TCATTCTTCC CAAGCGCCAT AGCCCCAGCT ATGGCTATGG GTTTCGAGTA CGCCTACCAC ATAGCCGCTG TAATGGCGTT TGCGGCGGCG GTGTTCTCGT ACTTAAGGGG TAGGGAGCAT ATTGTTCATC AAGTTAAGTT ACTGGAGAGT GAAAACGGTA AGAGACCTTT CACTGAGTAG
|
Protein sequence | MREGFLSDRV LSGNYMSNEP SELISEERRR AILVNSFLGS LMASMTMSAI IIALPDVLRG IGVDPMSPLG FTSMLWLMFS YPLMVAVAVP IVGRLSDMYG RGRMFTIGDA VFTILSTLLG LVPGYGLVAA LQMIAYRFIQ GLGGSMMFTN SAAIITDVYP PHRRGVAMGI VSIAFSAGSI IGLVIGGVLA VINWRLVFLI NTPIGIISTI WAYLTVYKLP LGIKKVKVDY IGASMLAASL VLLLLGITFG MLPSGNSSMS WGNPTVWGLI GGGLLLLALL IPIEMRIKEP ILRINLFKIR PFTFGVLSAL FLFLAQGANV FVLSLLLQAI YLPMHGVPYS ETPLLAGIYL IPSSVANAIF APLGGRLINR FGARVVSTIG AILLGISFEL LTTLSMNFNY TLFAADLLLM GAGSGLFQSP NLVSIMSSVP QEDRSAASGL RASMQNIGLL MSFAVFLTLI LAGSAASLSL SLSKALINAG VPQSDVAALS RIPPAYALFA AFMGYDPIKV MLSEAGIQLP SSIYAAVTHP SFFPSAIAPA MAMGFEYAYH IAAVMAFAAA VFSYLRGREH IVHQVKLLES ENGKRPFTE
|
| |