Gene Rleg_6520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6520 
Symbol 
ID8017104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp234515 
End bp235657 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content63% 
IMG OID644828307 
Productpeptidase M24 
Protein accessionYP_002979507 
Protein GI241554294 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.180886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTGGC AGCACCCCGT ACCCCGCATC ACTGAGGACG AACGGCAGAA CCGCCTCGCC 
GGGCTTCGGA AACTGATCGA AGCCGAAGGA TTGGCTGCCG TGCTTCTTGG GCCGACCGAA
AGCCTCCACT ACTTCACCGG GCTCGTCTGG CATCCGAGCG AAAGGTTCCT CGGCGCGCTC
GTCATGCCCG CGACCATTTC CTACATCGTT CCGGGGTTCG AGCGCAGCCG TGTCGAAACG
CTGCCACATC TGCCGGGGGA AATCCTGGTC TGGGAAGAGG AGGAGAGCAG CGCCGCTCTC
ATCGCCCGCC TTGTTGCCCA GCGCGGCAGA CTTGCCCTCG ACGATGGCTT GCCGCTTTTC
TTCTATCACG CATTGGCAGC GGAGATGGGC GCGGCAAGGC TTGCCGATGG CGGGCGGCTG
ATCCGCGACC TGCGTTGCAT CAAATCGGCT GCAGAGCTTG CCCTCATTCA GTATGCGATG
GACCTGACGC TCGACGTCCA CAAGCAAGTG CATGGGCTTT TGAAGCCGGG CATCAAATCA
TCCGAGGTGG TCGAATTCAT CGACCGACAG CATCGCCAGG CCGGCGCCGA TGCCGGCTCG
ACGTTCTGCA TCGTCTCCTT CGGCGCGGCG ACCTCGCTTC CGCATGGCGC CGACGGCGAT
CAGGTCCTTG GTCGCGACGA CGTCGTTCTC GTCGATACCG GCTGCCGGAT CGACGGTTAT
CATTCCGATA TCACCAGGAC CTATATTCTG GAGGACGGCA ACAGCGCGTT CGAACGCGCC
TGGTGGATCG AGCGCGAGGC GCAACAGGCC GTCTTCGACG CAGCCCGGAT CGGCGCCGCC
TGCTCGAGCC TCGACGATGC GGCCCGCAAG GTGCTTGCCA AACACTCGCT AGGCCCCGAC
TATCGCCTGC CGGGTTTGCC GCATCGCGCC GGTCATGGCC TCGGGCTCGA GATCCACGAG
GAGCCATACA TCGTTCGCGG CAACGACGCG CCGCTTGCCG CCGGCATGTG TTTTTCCAAT
GAACCGATGA TCGTCTTCCC CGGGAAATTC GGGATCCGGT TGGAAGACCA TATCTACATG
ACCGCCGAGG GACCACGCTG GCTGACCAAT CCAGCGGCGG GACCGACAAA GCCATTCTCC
TGA
 
Protein sequence
MSWQHPVPRI TEDERQNRLA GLRKLIEAEG LAAVLLGPTE SLHYFTGLVW HPSERFLGAL 
VMPATISYIV PGFERSRVET LPHLPGEILV WEEEESSAAL IARLVAQRGR LALDDGLPLF
FYHALAAEMG AARLADGGRL IRDLRCIKSA AELALIQYAM DLTLDVHKQV HGLLKPGIKS
SEVVEFIDRQ HRQAGADAGS TFCIVSFGAA TSLPHGADGD QVLGRDDVVL VDTGCRIDGY
HSDITRTYIL EDGNSAFERA WWIEREAQQA VFDAARIGAA CSSLDDAARK VLAKHSLGPD
YRLPGLPHRA GHGLGLEIHE EPYIVRGNDA PLAAGMCFSN EPMIVFPGKF GIRLEDHIYM
TAEGPRWLTN PAAGPTKPFS