Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2203 |
Symbol | |
ID | 4056878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | - |
Start bp | 2324380 |
End bp | 2325486 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641231246 |
Product | major facilitator transporter |
Protein accession | YP_605666 |
Protein GI | 94986302 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGATTG GGGTGGTGGA CCCGATTCTG CCCGAGATCG GGCACCAGCT GGGCGCGACG CCAACCCAGG TCGAACTGCT CTTCACCGCC TACTTGGGTG TGATGGCCGT GATGACGCTG TTTGCAGGAA ACATCGGCGC GCGGCTGGGA CGCCGCCGGG TGGCGGTGAT CGGCCTAGCC CTGATCGCCC TGTTTGCGCT GGCCTGTGGG CTGAGCGGCA GTATTCCCGC GCTGGCGGTG TTTCGAGGTG GCTGGGGGCT GGGCAGCGCT CTCTTCACGC CAACCGCGCT GGTGCTGTTG CTCGCGCTGA TTGGCCACGC TGAAAAAGCC ATTATGCGCT ACGAGGCTGC CATCGGTCTG GGCATGAGCA TGGGGCCGTT GCTGGGCGGC GTGCTGGGCA GTCACGGCTG GCGTTTTCCC TTTCTGGGCG CGGCCACACT GATGCTGCTG GCGCTGGTCG CGGTCGCCGC CTTCGTGCGC GTGCCGGAGA GCCGGGAGCC GGTTCGTCCT GTGGGAGACG TGTTCCGCGC GTACCGCCAC CCGGCCTTTC TGGCGGTGGG TCTGACCGGG CTGCTGTACT ACTTCGGCTT CTTCTTGTTG CTGGGCTACA CGCCGCTCTT CCTGCACCTG GGTACCCTGG GGCTGGGCCT GACCTTCTTC GGCTGGGGCG TGCTGCTGGG GTTGGGGAGC ACCGTGCTGG TGGAACGCTT GCTGCGCCGT CTGCGGGCCA GCTGGATCGT CATCCTGGCG CTGGCCGGAC TGACCCTGCT GTTTGTGCTG CTGGGCTTCG CACCCCTGGC GGGCGGCGTC AAGATCGCGC TGGTCGTCCT GAGCGGCACC CTCTTCGGCC TGATGAATGC CCTGCTGACC ACGCTCAGCG TGGAAGTCGC CCACCTGCCG CGCGCCACCG CCACGAGCGC CTACAACTTC CTGCGCTGGT TGGGGGCGAC GGTGGCACCG GTGGGGAGCG GCTTTGTGGC CGAACACCTC GGCGCCCCTG TGCCCTACGC GTTCGGGGCT GCGGCGGTGG CCCTGGCCGT GCTGATCATG GCCTTTGCGG CCCGCTCCAT CGATGCGGCA CGAGCCAGCG ATCCGCACGT CCACTAA
|
Protein sequence | MGIGVVDPIL PEIGHQLGAT PTQVELLFTA YLGVMAVMTL FAGNIGARLG RRRVAVIGLA LIALFALACG LSGSIPALAV FRGGWGLGSA LFTPTALVLL LALIGHAEKA IMRYEAAIGL GMSMGPLLGG VLGSHGWRFP FLGAATLMLL ALVAVAAFVR VPESREPVRP VGDVFRAYRH PAFLAVGLTG LLYYFGFFLL LGYTPLFLHL GTLGLGLTFF GWGVLLGLGS TVLVERLLRR LRASWIVILA LAGLTLLFVL LGFAPLAGGV KIALVVLSGT LFGLMNALLT TLSVEVAHLP RATATSAYNF LRWLGATVAP VGSGFVAEHL GAPVPYAFGA AAVALAVLIM AFAARSIDAA RASDPHVH
|
| |