Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3442 |
Symbol | |
ID | 5712500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3626001 |
End bp | 3627206 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641269371 |
Product | major facilitator superfamily (MFS) transporter |
Protein accession | YP_001534776 |
Protein GI | 159045982 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACTT CCGGATTCCT TCGTCAAAAC GCCCGTTGGC TAGGCGCCGG GGGGCTACTG GCGTTCTCCG GCAGCTATGG GCAGACCTTT TTCATCGCCA TCTTCGCCGG CGACATCCAA GAGAGTTTCG GGCTGAGCCA CGGCGCGTGG GGCGCCATCT ATGCACTGGG CACACTGGGC TCTGCTGCGG TCATGGTCTG GCTCGGCACC CTGACCGACC ATTATCGCGT CCGCCGCCTT GGGCCCTGGG CGCTTGCAGG CCTTGCTGCG GCCTGCCTGG CCATGGCGGT CAACCCTTGG GTCTGGGCCC TTCCACTGGT GGTGTTCGGC TTGCGTCTCG GCGGTCAGGG TATGGTGACC CATGTGGGCC GCGTGGCTAT GGCGCGCTGG TTCTCAGCCA ATCGCGGAAA GGCGATATCC GTGTCTTCGC TTGGGTATTC GGTGGGCGAA GCGTTTCTTC CGATCCTGTT CGTGGCGCTT TTGACCGTCG TCGAGTGGCG CGCGCTCTGG GGTCTTGCAG CCTTGCTGAG CCTCGCGGCC ATCCCGGTGC TTCTGGCCCT TTTGCACCGC GAACGCCAAC CGCAAAGCTA TGCCGAAGAC AGTCAGAGTA CTGGTTCGGG CGGCCGCCAC TGGACCCGGG CAGAGGTCGT GCGAGCACCT TTCTTCTGGG CGATTGTACC CGCACTTCTG TCGACGCCCG CCTTCGTGAC CGCATTCTTT TTTCAACAAG TCCATCTGGC GGCGGAAAAG GATATCGCCC ATCTCGATCT TGTCGCGCTT TTTCCGCTCT ATTCGGTGGT CTCGGTTCTC GCGATGCTGG GCACGGGATT TCTCGTGGAC CGGTTCGGCA CACGTCTCCT GATGTGCCTG TTCACCCTAC CCTTGGCACT GAGCTTCATC GTCATATCAG GCACCGAAAC CCTCTTGGGC CTCGCGGTCG GGCTAAGCCT TCTGGCGGTT ACGGTCGGCG CGAATAACAC ACTGCCCGCG GCCTTTTGGG CGGACATGTT CGGGACCCGC CATCTCGGCG CGATCAAATC GGTCGCCATG GCCCTTATGG TTTTGAGTTC CGCGATAGGG CCGCTGATTA CCGGTCTGGC CATCGATGCA GGGGCGTCCT TCCCTGCGCA AATGCCCTTG ATCAGTCTCT ATATCGCCCT CTCAGCCGGG CTGCTCGCCG CCGCTCTGTG GCGTACGGGA GCCTAA
|
Protein sequence | MTTSGFLRQN ARWLGAGGLL AFSGSYGQTF FIAIFAGDIQ ESFGLSHGAW GAIYALGTLG SAAVMVWLGT LTDHYRVRRL GPWALAGLAA ACLAMAVNPW VWALPLVVFG LRLGGQGMVT HVGRVAMARW FSANRGKAIS VSSLGYSVGE AFLPILFVAL LTVVEWRALW GLAALLSLAA IPVLLALLHR ERQPQSYAED SQSTGSGGRH WTRAEVVRAP FFWAIVPALL STPAFVTAFF FQQVHLAAEK DIAHLDLVAL FPLYSVVSVL AMLGTGFLVD RFGTRLLMCL FTLPLALSFI VISGTETLLG LAVGLSLLAV TVGANNTLPA AFWADMFGTR HLGAIKSVAM ALMVLSSAIG PLITGLAIDA GASFPAQMPL ISLYIALSAG LLAAALWRTG A
|
| |