Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0531 |
Symbol | araF1 |
ID | 5711984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 515852 |
End bp | 516841 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641266433 |
Product | ribose/xylose/arabinose/galactoside ABC-type transport system protein |
Protein accession | YP_001531878 |
Protein GI | 159043084 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACCA AATTCACACT TCTGGCCAGC GTGGCAATGG CGTCGTCGGC ACTTTTTGCG ACACAGGCGG TGGCAGACGG CCACTCCAAG GATATCGCAA CCGTGGTCAA GATTGCCGGC ATCCAGTGGT TCAACCGCAT GGAAGAAGGC GTCAAGAAGT TTGCCGAGGA AACGGGCATG AACGCGTTTC AGGTCGGCCC CGCCCAAGCA GATCCGCAGC AGCAGGTCGC GCTGATCGAG GACATGATTG CCCAGGGCGT CGACGCGCTT GCCGTCGTGC CGATGTCCCC CGAAGCACTT GAGCCGGTCC TGGGCCGCGC GATGGAAGCA GGCATAACCG TCATCACTCA CGAGGCGGCG GCCCAGCAGA ACACGACCTA TGACCTTGAG GCATTCGTCA ACGAAGACTT CGGCGCGAAC CTGATGGAAC AGCTTGCCAC CTGCATGGGC GGCGAGGGCG AATACGCGGT GTTCGTCGGA TCGCTGACCT CCCAGACGCA CAACCAGTGG GTAGATGGTG CCATCGCCTA CCAAGAGGCT AACTATCCGA ACATGACGCT GGTCGGCGAC AAGAACGAGA CCTTCGACGA CGCCGAGCAG GCCTACACCA AGACGCAGGA GGTCCTGCGC GCGTTTCCGA ACATCAAGGG CATGCAAGGC TCTGCTTCGA CGGATGTCGC GGGCATCGGG CGCGCCATCG AAGAGCGCGG CATGGAAGAC GCCACCTGCG TTTTCGGCAC CTCCCTGCCC TCCATCGCCG GTCAGTATCT TGAAACCGGC GCGGTGGACG GCATCGGCTT CTGGGATCCG GCGGTTGCAG GCGAGGCCAT GAACAAGCTC GCGGTGATGG TGATGAACGG CGAGGAAGTC ACCGACGGCA TGGACCTTGG ACTGCCGGGC TATGAGAGCG TTTCGCTCGA TGGCAAGGTC ATCTACGGCC AGGCATGGGT AAATGTCGAT GCGGAAAACA TGAGCGAATA CCCGTTCTGA
|
Protein sequence | MKTKFTLLAS VAMASSALFA TQAVADGHSK DIATVVKIAG IQWFNRMEEG VKKFAEETGM NAFQVGPAQA DPQQQVALIE DMIAQGVDAL AVVPMSPEAL EPVLGRAMEA GITVITHEAA AQQNTTYDLE AFVNEDFGAN LMEQLATCMG GEGEYAVFVG SLTSQTHNQW VDGAIAYQEA NYPNMTLVGD KNETFDDAEQ AYTKTQEVLR AFPNIKGMQG SASTDVAGIG RAIEERGMED ATCVFGTSLP SIAGQYLETG AVDGIGFWDP AVAGEAMNKL AVMVMNGEEV TDGMDLGLPG YESVSLDGKV IYGQAWVNVD AENMSEYPF
|
| |