Gene Smed_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3601 
SymbolaraG 
ID5318435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp28238 
End bp29764 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content62% 
IMG OID640775415 
ProductL-arabinose transporter ATP-binding protein 
Protein accessionYP_001312348 
Protein GI150375752 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.611963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGATT TTCTAGAATT CACATCAATT TCAAAGGGCT ATCCCGGCGT TCAGGCGCTT 
TCGGATGTCT CCTTTTCCGT GCGCAAGGGT GCCGTTCACG GACTGATGGG CGAAAACGGT
GCGGGGAAAT CCACCCTTAT CCGGGTGCTC TCCGGTGACC AGGCAGCCGA CACCGGTGAA
ATTCGCATCG ACGGCAGGCC GCAGCACTAT CGCTCGGTAC GCGACGCCTT CGACGCCGGC
GTCATCGTCA TTCATCAGGA ACTGCAACTC GTGCCGGAGC TGACGGTTGC CGAGAACCTC
TGGCTCGGCC GCTTTCCCGG CAGGGCCGGT GTCATCGATC GCCGGCAGCT TATCCGCGTC
GTCGGTGAAA GGCTCGCGGA GATCGGTATC GACGTCGATC CCGGGGCGAA GGTTGCATCC
TTGTCGATCG GCGAGCGCCA GATGGTAGAG ATTGCCAAGG CGGTGATGCT CGACGCCCGC
GTCATTGCTC TCGACGAGCC GACATCGTCG CTCTCCTCGA GAGAAAGCGA GATCCTGTTC
TCGCTGATCG ACAGGTTGCG GTCGAACGGT ACCGTCATTC TGTACGTTTC GCATCGCCTC
GACGAGATTT TCCGCCTGTG CGACAGTCTG ACCGTTCTCA GGGACGGGCG GCTTGCGGCC
CATCATCCGG ACGTTTCGAA AGTGACCCGC GATCAGATCA TCGCCGAGAT GGTCGGGCGC
GAGATTTCCA ACATCTGGGG TTGGCGCGCC CGTCCTCTCG GAGACGCAAG GCTGACGGTC
GAAGGCGTGT CCGGCGCCGC CCTGCCACAC CCGATCAGTT TTACCGCGCG CAGCGGCGAG
ATTCTCGGCT TTTTCGGGCT GATCGGCGCG GGCCGCAGCG AAATGGCCCG CCTGGTCTAT
GGCGCCGACA GCCGACGGCA GGGCACAGTC CTGGTGGACG GCGTTGCCGT ACGGGCCGAC
AGTCCCCCGC ATTCGATCCG CGCCGGCATC GTTCTCTGCC CGGAAGACCG GAAATTCGAC
GGCATCGTCC ACGGGAGATC CATCGAAGAG AACATGGCGA TCTCCTCCCG TCGCCATTTC
TCTCGCTTCG GCATTCTCGA CCGCGGCAAG GAAGCGGAAC TTGCGGAGCG GTTCATCGCG
AGGCTGCGCG TGCGCACGCC CTCCCGCCAC CAGGACATCG TCAATCTCTC CGGCGGAAAC
CAGCAGAAGG TGATTCTTGG ACGCTGGTTG TCGGAAGAGG GCGTCAAGGT GCTGCTGATC
GACGAGCCGA CCCGCGGCAT CGACGTTGGC GCGAAGTCGG AAATCTACGA AATCCTTTAC
GAGCTTGCAG CGCAAGGCAT GGCGATCGTC GTCATCTCCA GCGAATTGCC GGAAGTGATG
GGCATTGCGG ATCGCATTCT GGTGATGTGC GAAGGCCGCA TCGCGGCAGA AATTGCAAGG
GAAGACTTCG ACGAGCACCG GATTCTCACA GCCGCGCTTC CGGATGCCTC CGCCACAAAA
GCTCCCATTT CCGAACAGGT ACGCTAA
 
Protein sequence
MQDFLEFTSI SKGYPGVQAL SDVSFSVRKG AVHGLMGENG AGKSTLIRVL SGDQAADTGE 
IRIDGRPQHY RSVRDAFDAG VIVIHQELQL VPELTVAENL WLGRFPGRAG VIDRRQLIRV
VGERLAEIGI DVDPGAKVAS LSIGERQMVE IAKAVMLDAR VIALDEPTSS LSSRESEILF
SLIDRLRSNG TVILYVSHRL DEIFRLCDSL TVLRDGRLAA HHPDVSKVTR DQIIAEMVGR
EISNIWGWRA RPLGDARLTV EGVSGAALPH PISFTARSGE ILGFFGLIGA GRSEMARLVY
GADSRRQGTV LVDGVAVRAD SPPHSIRAGI VLCPEDRKFD GIVHGRSIEE NMAISSRRHF
SRFGILDRGK EAELAERFIA RLRVRTPSRH QDIVNLSGGN QQKVILGRWL SEEGVKVLLI
DEPTRGIDVG AKSEIYEILY ELAAQGMAIV VISSELPEVM GIADRILVMC EGRIAAEIAR
EDFDEHRILT AALPDASATK APISEQVR