Gene Rleg_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1995 
Symbol 
ID8013031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1992959 
End bp1994053 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content63% 
IMG OID644824582 
Productrare lipoprotein A 
Protein accessionYP_002975814 
Protein GI241204718 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0797] Lipoproteins 
TIGRFAM ID[TIGR00413] rare lipoprotein A 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.359065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0392025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGG ATTACGGGGC GGCGTCTTTC GCAACGACAG CACGGTGGCT GGCGATCTCG 
GCGATGTGTG CGACGGTTGC GGCGTGCGGC ACGACACAGG CGGTGCCGAA GAAAAAATCC
CACGGCAAAG AATACTTCTC CGAATCCGAA TACGGCGTGA AGGCGAGCCC GCGGGTCGCT
ACCGGCAACA ATATCCCGAA GGGCGGCGGC CGCTACATCG TCGGCAACCC CTACGAGGTG
AAGGGCAAGT GGTATTATCC GAAGGAAGAT TTCGCCTATA ACAAGGTCGG CGTCGCCTCC
TGGTATGGTT CGGCCTTTCA TGGGCGCCTG ACGGCCAACG GCGAAGTCTA CGACCAAATG
CATCTTTCAG CTGCACATCC GACCTTCCCA CTGCCGAGCT ATGCGCGCGT CACCAATCTC
GAAAGTGGCT CCTCCGTCAT CGTGCGCGTC AACGATCGCG GCCCCTATCA TGCAGGCCGT
ATCATCGACC TTTCGAACAA GACGGCCGAC ATGCTGGATC TGCAGCACAG CGGCACCGGC
AAAGTGCGCG TGCAATATGT CGGCCGCGCC CGCATGGACG GCCACGACAT GCCCTATCTG
ATGGCCTCCT ACGCGCCGAA GGGCAGCCGC CTTCCCGGCG TCAATCCGGA AGGCCAGATC
GCAACCGGCG TCATGGTCGC CTCCAACAGC CGCAAGATCA CCCGCGACCA GCTGCAGAGT
TCGGAAGACT ACGAGACGCC GGCCAACGTG CCGGTGCCTA GGTCGGCGAC TTCTTATGCC
GGCTCGACGC CGAGCGCCCG CAACAATGCG GCAGCCGCCG CGGCACCGGC GGCCCATGTT
CTGGTGGCGC CATCAGCGCC GTCCTTCAAC AATGGCGCGC AGGCCATGGA CCAGATGGTC
GTGCTGCCGG AAATCGGTCC GATGCCCTAT GAGCGGCCCC AGAATTCGCT GGCCCTCGGC
TACCAGAACG AAGAGGTGAA GACGGTGACC GTCGATCTCG CCTTCGACGC GGTGATGGTG
CGCAATGACG GGCTGACGCA AGAGTCCATC CTTGCCTCGG CCAAGCGCCA GCACGCAAAG
TCCGCCGCCC GCTGA
 
Protein sequence
MKLDYGAASF ATTARWLAIS AMCATVAACG TTQAVPKKKS HGKEYFSESE YGVKASPRVA 
TGNNIPKGGG RYIVGNPYEV KGKWYYPKED FAYNKVGVAS WYGSAFHGRL TANGEVYDQM
HLSAAHPTFP LPSYARVTNL ESGSSVIVRV NDRGPYHAGR IIDLSNKTAD MLDLQHSGTG
KVRVQYVGRA RMDGHDMPYL MASYAPKGSR LPGVNPEGQI ATGVMVASNS RKITRDQLQS
SEDYETPANV PVPRSATSYA GSTPSARNNA AAAAAPAAHV LVAPSAPSFN NGAQAMDQMV
VLPEIGPMPY ERPQNSLALG YQNEEVKTVT VDLAFDAVMV RNDGLTQESI LASAKRQHAK
SAAR