Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3601 |
Symbol | araG |
ID | 5318435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 28238 |
End bp | 29764 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775415 |
Product | L-arabinose transporter ATP-binding protein |
Protein accession | YP_001312348 |
Protein GI | 150375752 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.611963 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGATT TTCTAGAATT CACATCAATT TCAAAGGGCT ATCCCGGCGT TCAGGCGCTT TCGGATGTCT CCTTTTCCGT GCGCAAGGGT GCCGTTCACG GACTGATGGG CGAAAACGGT GCGGGGAAAT CCACCCTTAT CCGGGTGCTC TCCGGTGACC AGGCAGCCGA CACCGGTGAA ATTCGCATCG ACGGCAGGCC GCAGCACTAT CGCTCGGTAC GCGACGCCTT CGACGCCGGC GTCATCGTCA TTCATCAGGA ACTGCAACTC GTGCCGGAGC TGACGGTTGC CGAGAACCTC TGGCTCGGCC GCTTTCCCGG CAGGGCCGGT GTCATCGATC GCCGGCAGCT TATCCGCGTC GTCGGTGAAA GGCTCGCGGA GATCGGTATC GACGTCGATC CCGGGGCGAA GGTTGCATCC TTGTCGATCG GCGAGCGCCA GATGGTAGAG ATTGCCAAGG CGGTGATGCT CGACGCCCGC GTCATTGCTC TCGACGAGCC GACATCGTCG CTCTCCTCGA GAGAAAGCGA GATCCTGTTC TCGCTGATCG ACAGGTTGCG GTCGAACGGT ACCGTCATTC TGTACGTTTC GCATCGCCTC GACGAGATTT TCCGCCTGTG CGACAGTCTG ACCGTTCTCA GGGACGGGCG GCTTGCGGCC CATCATCCGG ACGTTTCGAA AGTGACCCGC GATCAGATCA TCGCCGAGAT GGTCGGGCGC GAGATTTCCA ACATCTGGGG TTGGCGCGCC CGTCCTCTCG GAGACGCAAG GCTGACGGTC GAAGGCGTGT CCGGCGCCGC CCTGCCACAC CCGATCAGTT TTACCGCGCG CAGCGGCGAG ATTCTCGGCT TTTTCGGGCT GATCGGCGCG GGCCGCAGCG AAATGGCCCG CCTGGTCTAT GGCGCCGACA GCCGACGGCA GGGCACAGTC CTGGTGGACG GCGTTGCCGT ACGGGCCGAC AGTCCCCCGC ATTCGATCCG CGCCGGCATC GTTCTCTGCC CGGAAGACCG GAAATTCGAC GGCATCGTCC ACGGGAGATC CATCGAAGAG AACATGGCGA TCTCCTCCCG TCGCCATTTC TCTCGCTTCG GCATTCTCGA CCGCGGCAAG GAAGCGGAAC TTGCGGAGCG GTTCATCGCG AGGCTGCGCG TGCGCACGCC CTCCCGCCAC CAGGACATCG TCAATCTCTC CGGCGGAAAC CAGCAGAAGG TGATTCTTGG ACGCTGGTTG TCGGAAGAGG GCGTCAAGGT GCTGCTGATC GACGAGCCGA CCCGCGGCAT CGACGTTGGC GCGAAGTCGG AAATCTACGA AATCCTTTAC GAGCTTGCAG CGCAAGGCAT GGCGATCGTC GTCATCTCCA GCGAATTGCC GGAAGTGATG GGCATTGCGG ATCGCATTCT GGTGATGTGC GAAGGCCGCA TCGCGGCAGA AATTGCAAGG GAAGACTTCG ACGAGCACCG GATTCTCACA GCCGCGCTTC CGGATGCCTC CGCCACAAAA GCTCCCATTT CCGAACAGGT ACGCTAA
|
Protein sequence | MQDFLEFTSI SKGYPGVQAL SDVSFSVRKG AVHGLMGENG AGKSTLIRVL SGDQAADTGE IRIDGRPQHY RSVRDAFDAG VIVIHQELQL VPELTVAENL WLGRFPGRAG VIDRRQLIRV VGERLAEIGI DVDPGAKVAS LSIGERQMVE IAKAVMLDAR VIALDEPTSS LSSRESEILF SLIDRLRSNG TVILYVSHRL DEIFRLCDSL TVLRDGRLAA HHPDVSKVTR DQIIAEMVGR EISNIWGWRA RPLGDARLTV EGVSGAALPH PISFTARSGE ILGFFGLIGA GRSEMARLVY GADSRRQGTV LVDGVAVRAD SPPHSIRAGI VLCPEDRKFD GIVHGRSIEE NMAISSRRHF SRFGILDRGK EAELAERFIA RLRVRTPSRH QDIVNLSGGN QQKVILGRWL SEEGVKVLLI DEPTRGIDVG AKSEIYEILY ELAAQGMAIV VISSELPEVM GIADRILVMC EGRIAAEIAR EDFDEHRILT AALPDASATK APISEQVR
|
| |