Gene Rleg_4551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4551 
Symbol 
ID8015943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4680148 
End bp4681401 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content62% 
IMG OID644827128 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_002978328 
Protein GI241207232 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.123351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTCA TGCCCCAGAG CCAGAACACA ATTGATCCCG TCAAGCTGGA AAAACTTGCG 
GAAGTCGCCG TCAAGGTCGG GCTGCAGCTG CAAAAAGGTC AGGATCTGGT GATCACCGCG
CCTGTCGTGG CGCTGCCGCT GGTTCGCCTG ATCACCAAGC ATGCCTATCT GGCCGGCGCC
GGACTGGTCT CCGCCTTCTA TTCCGACGAG GAAACGACGC TGGCGCGCTA TCAATATGGC
AGCGACGAGA GCTTCGACCG CGCCTCCGGC TGGCTCTACG AGGGCATGGC CAAGGCCTAT
GCCAACGGGG CGGCCCGTCT TGCGGTCGCC GGCGACAATC CGATGCTGCT GTCCGAGCAG
GATGCCGGCA AGGTCGGCCG CGCCAATCGC GCCAACTCAA CGGCCTACAA GCCGGCGCTG
GAGAAGATCT CGAATTTCGA CATCAACTGG AACATCGTCT CCTACCCGAA CCCATCCTGG
GCCAAGGTGG TCTTCCCCGA CGATCCGGAA CCGATTGCGA TTGCCAAGCT CGCCAAGGCG
ATCTTTGCCG CCTCGCGCGT CGATGTCAGC GATCCCGTCG CCGCCTGGGC CGAGCACAAT
GCCAATCTTG GCAAGCGATC CGCCTGGCTG AACGGCGAGC GTTTCGCCTC GCTGCATTTC
CAGGGACCGG GTACCGACCT GACGATCGGC CTTGCCGACG GGCATGAATG GCATGGCGGC
GCTTCCACCG CCAAGAACGG CATTACCTGC AATCCGAACA TCCCGACCGA GGAAGTCTTC
ACCACGCCGC ATGCGCTGCG CGTCGAAGGC CATGTGTCGA GCACCAAGCC GCTCTCGCAC
CAGGGCACGT TGATCGACAA TATCCAGGTA CGTTTCGAGG GTGGGCGCAT CGTCGAGGCC
AAGGCCTCGC GCGGCGAAGA GGTCTTGAAC AAGGTGCTCG ATACCGACGA GGGCGCGCGC
CGGCTCGGCG AAGTGGCGCT GGTGCCGCAT TCCTCACCGA TCTCGGCCAG CGGCATCCTG
TTCTACAACA CGCTGTTCGA CGAAAACGCC TCGTGCCACA TCGCACTCGG CCAGTGCTAT
TCCAAGTGCT TCCTCGATGG CGCGACACTG AGCCAGGAGC AGATCAAGGC GCAGGGCGGC
AATTCCAGCC TGATCCATAT CGACTGGATG ATCGGCTCGG ACAAAGTCGA TATCGACGGC
ATCAAGCCGG ATGGTTCACG GGTTCCGGTG ATGCGGCAGG GCGAATGGGC CTGA
 
Protein sequence
MTFMPQSQNT IDPVKLEKLA EVAVKVGLQL QKGQDLVITA PVVALPLVRL ITKHAYLAGA 
GLVSAFYSDE ETTLARYQYG SDESFDRASG WLYEGMAKAY ANGAARLAVA GDNPMLLSEQ
DAGKVGRANR ANSTAYKPAL EKISNFDINW NIVSYPNPSW AKVVFPDDPE PIAIAKLAKA
IFAASRVDVS DPVAAWAEHN ANLGKRSAWL NGERFASLHF QGPGTDLTIG LADGHEWHGG
ASTAKNGITC NPNIPTEEVF TTPHALRVEG HVSSTKPLSH QGTLIDNIQV RFEGGRIVEA
KASRGEEVLN KVLDTDEGAR RLGEVALVPH SSPISASGIL FYNTLFDENA SCHIALGQCY
SKCFLDGATL SQEQIKAQGG NSSLIHIDWM IGSDKVDIDG IKPDGSRVPV MRQGEWA