Gene Rleg2_3476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3476 
SymbolaraG 
ID6982230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3591167 
End bp3592672 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content62% 
IMG OID643398194 
ProductL-arabinose transporter ATP-binding protein 
Protein accessionYP_002282969 
Protein GI209551052 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTCC TCGAATTCAA CAATATCTCC AAGGGTTATC CCGGCGTGCA GGCGCTGGCG 
GATGTTTCAT TCTCAGTCGA GAAGGGCGCC GTGCACGGCC TGATGGGCGA GAACGGCGCG
GGCAAATCGA CGCTGATCCG CGTGCTATCA GGTGATCAGG CCGCCGATAC CGGCAGCATC
CTGATCGGGG CGGAGGAGCA GAAATACGGA TCCGTGCGTG ACGCCTTTCA TGCTGGTATC
GTCGTCATCC ATCAGGAATT GCAGCTCGTT CCGGAGCTGA CGGTGGCCGA AAATCTCTGG
CTCGGGCGTT TTCCGGCCAA GGGCGGCATG ATCCATTCGA GCAGGCTGAT CGAAACGGTG
CGGGGAAAGC TCGAAGAGAT CGGCATCGAC GTCGATCCGG CGGCCAAGGT CGCCACGCTT
TCGATCGGCG CGCGGCAGAT GGTCGAGATC GCCAAGGCCG TCATGCTCGA CGCACGGGTG
ATCGCGCTCG ATGAGCCGAC CTCCTCGCTT TCATCGCGCG AGAGCGAGAT CCTGTTTTCC
CTGATCGAGA GGCTGAAGGC GAAGGGAACG GTCATTCTCT ACGTCTCGCA TCGTCTCGAC
GAGATTTTTC GGCTTTGCGA CAGCCTAAGC GTGTTGCGCG ACGGCAAGCT TGCCGCCCAC
CATCCCGACA TCGCCGAGAC GACACGCGAG CAGATCATCT CGGAAATGGT CGGGCGCGAG
ATCAGTAATG TCTGGGGATG GCGCGAACGT CCGCTCGGCG ACATCAGGCT GGAGGTCAAG
GGCCTGTCGG GGCCGAGGCT GCGCAATCCC ATCGGTTTCT CCGTCCGCCA GGGCGAGATC
CTCGGCTTCT TCGGCCTGAT CGGCGCCGGC CGCAGCGAGA TGGCGCGGCT GCTCTACGGC
GCCGATGTCA GGCATCAGGG TCAGGTCGCG ATCGATAGCG TTGTCGTCTT GCCGAACAGT
CCGAAGGCGG CGATCAAGGC CGGCATGGTG CTCTGCCCGG AGGACCGCAA ATTCGACGGC
ATCGTCCAGG GCCGGTCGAT CGAAGAGAAT ATCGCGATTT CGTCGCGCCG GCATTTCTCG
CCCTTCGGCA TTCTGAGCCC GAAAAAAGAG GCGGCGCTGG CCGATCGGTT CATCGCCCGG
CTTCGGGTGC GAACCCCGTC GCGCAAGCAG GACATCATCA ATCTCTCCGG CGGCAACCAG
CAGAAGGTCA TTCTCGGCCG CTGGCTTTCC GAGCAGGGCA TCAAGGTCCT CGTCATAGAC
GAACCGACGC GCGGCATCGA CGTCGGGGCG AAATCGGAAA TCTACGAGAT CCTTTACGAA
CTTGCGGCCG GCGGCATGGC GATCGTGGTC ATATCAAGCG AATTGCCCGA GGTCATGGGC
ATCTGCGATC GCATCATGGT GATGTGTCAG GGCAAGGTGG CGGCCAATGT CGCCCGCCAG
GATTTCGACG AGCGCGCCAT CCTCACCGCT GCGCTCCCCG ATAAGAATGC CGCAGGCAGC
ATTTAG
 
Protein sequence
MAFLEFNNIS KGYPGVQALA DVSFSVEKGA VHGLMGENGA GKSTLIRVLS GDQAADTGSI 
LIGAEEQKYG SVRDAFHAGI VVIHQELQLV PELTVAENLW LGRFPAKGGM IHSSRLIETV
RGKLEEIGID VDPAAKVATL SIGARQMVEI AKAVMLDARV IALDEPTSSL SSRESEILFS
LIERLKAKGT VILYVSHRLD EIFRLCDSLS VLRDGKLAAH HPDIAETTRE QIISEMVGRE
ISNVWGWRER PLGDIRLEVK GLSGPRLRNP IGFSVRQGEI LGFFGLIGAG RSEMARLLYG
ADVRHQGQVA IDSVVVLPNS PKAAIKAGMV LCPEDRKFDG IVQGRSIEEN IAISSRRHFS
PFGILSPKKE AALADRFIAR LRVRTPSRKQ DIINLSGGNQ QKVILGRWLS EQGIKVLVID
EPTRGIDVGA KSEIYEILYE LAAGGMAIVV ISSELPEVMG ICDRIMVMCQ GKVAANVARQ
DFDERAILTA ALPDKNAAGS I