Gene Rleg_7091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_7091 
Symbol 
ID8022377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp500823 
End bp501653 
Gene Length831 bp 
Protein Length276 aa 
Translation table11 
GC content60% 
IMG OID644833928 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002985062 
Protein GI241666978 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.51636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAGT CTATGCTAAC GCGTCGAAAC GCGATGCTCG GAGCCGCCGC CCTCGTGGCA 
GCCGTCACCC TGGCGCAGCC GGCCGCCGCC GTCACGCCCG ACGAAATCAA GGCTCGTGGC
AAGATCATCG TCGGAATTCA GGGCGACAAT CCGCCTTGGG GCTTTGTGAC CAGCGGCGGC
AAGCAGGACG GCCTCGACGC CGACATCGCA ACGCTGTTTG CCAAGGAACT CGGCGTTTCC
GTCGAGTTCG TGCCGCTTGA AGTCAACAAC CGCATTCCCG CACTGACGGC CGGCCGGGTC
GACGTTCTGT TCGCAACGAT GGCGATGCTG CCGGATCGCG CAAAAGCCGT GCAGTTCAGC
AAGCCCTATG TTGCCAATGC CATCGTCCTG ATCGGTCCGA AAAAGGCTGA GATCAAGACG
AATGCCGACA TGGCCAAGTT CACGGTCGGC GTCGCCAAGG GGGCTGCTCA GGACACGCAG
GTCACCAAGA ACGCGCCGCC CAGCACCACA ATCCGCCGAT ATGACGGAGA CGCCGCAAGC
GTCCAGGCGC TGGTGTCCGG CCAGGTCGAA ACGCTTGGTG GCAACATCTT CTACATGGAC
CGGCTGGAGA AGGCCCGTCC GGGCGAATTC GAAAACAAGC TTGAATTCCA GAAGCTCTAC
AACGGTGCTT GCACCCGTCT CGGCGAAAAG GAAATCAATG CGGCGCTGAA CACCTTCATC
GACAAGATCA AGGCCAACGG CGAACTCAAA ACCGTCTACG ACAAGTGGAT GAAGGTTCCG
GTACCGGAAT TCCCGGAAAC ACTGGAAGGC ATTCCGTTCG CGGCGAAGTG A
 
Protein sequence
MFKSMLTRRN AMLGAAALVA AVTLAQPAAA VTPDEIKARG KIIVGIQGDN PPWGFVTSGG 
KQDGLDADIA TLFAKELGVS VEFVPLEVNN RIPALTAGRV DVLFATMAML PDRAKAVQFS
KPYVANAIVL IGPKKAEIKT NADMAKFTVG VAKGAAQDTQ VTKNAPPSTT IRRYDGDAAS
VQALVSGQVE TLGGNIFYMD RLEKARPGEF ENKLEFQKLY NGACTRLGEK EINAALNTFI
DKIKANGELK TVYDKWMKVP VPEFPETLEG IPFAAK