Gene Rleg_3764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3764 
SymbolaraG 
ID8014594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3818175 
End bp3819680 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content62% 
IMG OID644826327 
ProductL-arabinose transporter ATP-binding protein 
Protein accessionYP_002977546 
Protein GI241206450 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.639409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTCC TCGAATTCAG CAATATCTCC AAGGGTTATC CCGGCGTGCA GGCGCTGGCG 
AATGTGTCCT TCACTGTCGA GAAGGGTGCC GTCCACGGCT TGATGGGTGA AAACGGCGCC
GGTAAATCGA CGCTGATCCG GGTACTGTCC GGCGATCAGG CCGCCGATGC CGGCAACATC
CTCATCGACG GCGAGGAGCA GAGATACGGG TCCGTCCGCG ACGCCTTCCA TGCCGGCGTC
ATCGTCATCC ATCAGGAACT GCAACTCGTT CCGGAGTTGA CCGTCGCCGA AAATCTCTGG
CTCGGACGCT TTCCGGCCAA GGGCGGCGTC ATCCATACGA AAGTGCTGAT CGAGACGGTG
CGGTCGAAGC TCGAGGAGAT AGGCATCGAT GTCGATCCGT CGGCCAAGGT CGCCTCGCTT
TCGATCGGTG CGCGGCAGAT GGTCGAGATC GCCAAGGCCG TCATGCTCGA TGCACGGGTG
ATCGCGCTCG ACGAGCCGAC CTCCTCGCTT TCCTCGCGCG AAAGCGAAAT CCTGTTTTCG
CTCATCGACA GGCTGAAGGC GCAGGGAACG GTCATTCTCT ACGTCTCGCA TCGTCTCGAT
GAGATCTTTC GTCTTTGCGA CAGCCTGACG GTGCTGCGCG ACGGCAAGCT TGCCGCCCAC
CATCCTCAGA TCGCCGAAAC CACGCGCGAG CAGATCATCT CGGAAATGGT CGGACGCGAG
ATCAGCAATG TCTGGGGATG GCGCGAACGT CCGTTCGGCG GCATTCGGCT GGAGGTCAAC
GGCCTGTCGG GGCCGAGGCT GCGCCATCCG ATCAGCTTTT CCGTCCGCGA GGGCGAAATC
CTCGGTTTCT TCGGCCTGAT CGGCGCTGGC CGTAGCGAGA TGGCGCGGCT GCTTTACGGC
GCCGATGCCA GGCATCAGGG CCAGGTGACC ATCGACGGCG TTGCCGTTTC GCCGAACAAT
CCGAAAGCGG CGATCAATGC CGGCATGGTG CTTTGCCCCG AGGACCGCAA GTTCGACGGC
ATCGTCCAGG GCCGATCGAT CGAAGAAAAT ATCGCGATCT CGTCACGCCG GCACTTTTCG
CCCTTCGGCA TTCTGAGCCC GAGACAAGAG GCGGCGCTGG CAGATCGGTT CATCGCCAAG
CTTCGGGTGC GAACACCGTC GCGCAAGCAG GACATCATCA ATCTATCGGG CGGCAACCAG
CAGAAGGTCA TTCTCGGCCG CTGGCTGTCC GAGCAGGGGA TCAAGGTGCT GGTCATCGAC
GAGCCGACGC GCGGCATCGA TGTCGGGGCG AAATCGGAAA TCTACGAAAT TCTCTATGAG
CTTGCGGCCG GCGGCATGGC GATCGTGGTG ATCTCCAGCG AGTTGCCCGA AGTCATGGGC
ATCTCCGATC GCATCATGGT GATGTGCCAG GGCAGGGTGG CGGCCAACGT CGCCCGTCCG
GATTTCGACG AGCGCAGCAT CCTGACGGCA GCGCTTCCCG ACAAGAATGC CGCAGGCACC
CTTTAG
 
Protein sequence
MAFLEFSNIS KGYPGVQALA NVSFTVEKGA VHGLMGENGA GKSTLIRVLS GDQAADAGNI 
LIDGEEQRYG SVRDAFHAGV IVIHQELQLV PELTVAENLW LGRFPAKGGV IHTKVLIETV
RSKLEEIGID VDPSAKVASL SIGARQMVEI AKAVMLDARV IALDEPTSSL SSRESEILFS
LIDRLKAQGT VILYVSHRLD EIFRLCDSLT VLRDGKLAAH HPQIAETTRE QIISEMVGRE
ISNVWGWRER PFGGIRLEVN GLSGPRLRHP ISFSVREGEI LGFFGLIGAG RSEMARLLYG
ADARHQGQVT IDGVAVSPNN PKAAINAGMV LCPEDRKFDG IVQGRSIEEN IAISSRRHFS
PFGILSPRQE AALADRFIAK LRVRTPSRKQ DIINLSGGNQ QKVILGRWLS EQGIKVLVID
EPTRGIDVGA KSEIYEILYE LAAGGMAIVV ISSELPEVMG ISDRIMVMCQ GRVAANVARP
DFDERSILTA ALPDKNAAGT L