Gene Rleg2_5951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5951 
Symbol 
ID6977337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp364626 
End bp365456 
Gene Length831 bp 
Protein Length276 aa 
Translation table11 
GC content61% 
IMG OID643393403 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002278221 
Protein GI209546331 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0245803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0107841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAAT CTATGCTGAC GCGCCGAAAC GCGATGCTCG GTGCCGCGGC CCTGGTGGCT 
GCCGTCACCT TGGCGCAACC GGCCGCCGCC ATCACGCCTG ACGAAATCAA GGCTCGCGGC
AAGATCATCG TCGGAATTCA GGGCGACAAC CCGCCTTGGG GCTTTGTGAC CAGCGGCGGC
AAGCAGGACG GCCTCGACGC CGACATTGCG ACGCTGTTCG CCAAGGAACT GGGCGTTTCC
GTCGAGTTCG TGCCGCTCGA AGTCAACAAC CGCATTCCGG CGCTGACGGC CGGCCGCGTC
GATGTTCTGT TTGCAACCAT GGCAATGCTG CCGGATCGCG CCAAGGCGGT GCAGTTCAGC
AAGCCCTATG TTGCCAATGC CATCGTTCTG ATCGGCCCCA AATCGGCGGA GATCAAGACC
AACGCCGACA TGGCCAAGTT CACGGTCGGC GTCGCCAAGG GCGCTGCGCA GGACACGCAG
GTGACGAAGA ACGCGCCTGA GGGCACGACG ATCCGCCGCT ATGACGGAGA CGCTGCGAGC
GTCCAGGCCC TGGTGTCCGG CCAGGTCGAC ACGCTGGGCG GCAACATTTT CTATATGGAC
CGGGTGAACA AGGCGCGCCC GGGCGAATTC GAAAACAAGC TTGAATTCCA GAAGCTCTAC
AACGGTGCTT GCACGCGTCT CGGGGAGAAG GAAATCAATG CGGCGCTGAA CACCTTCATC
GACAAGATCA AGACAAACGG CGATCTCAAG GCCGTCTACG ACAAGTGGAT GAAGGTCCCG
GTTCCGGAGT TCCCGGAAAA GCTGGAAGGC ATTCCGTTCG CGGCGAACTG A
 
Protein sequence
MFKSMLTRRN AMLGAAALVA AVTLAQPAAA ITPDEIKARG KIIVGIQGDN PPWGFVTSGG 
KQDGLDADIA TLFAKELGVS VEFVPLEVNN RIPALTAGRV DVLFATMAML PDRAKAVQFS
KPYVANAIVL IGPKSAEIKT NADMAKFTVG VAKGAAQDTQ VTKNAPEGTT IRRYDGDAAS
VQALVSGQVD TLGGNIFYMD RVNKARPGEF ENKLEFQKLY NGACTRLGEK EINAALNTFI
DKIKTNGDLK AVYDKWMKVP VPEFPEKLEG IPFAAN