Gene Rleg_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1079 
SymbolpepN 
ID8012205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1057038 
End bp1059686 
Gene Length2649 bp 
Protein Length882 aa 
Translation table11 
GC content63% 
IMG OID644823662 
Productaminopeptidase N 
Protein accessionYP_002974913 
Protein GI241203817 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.189522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACAG ATACCGGCCA GGTCATTCAT CTGGCAGATT ACCGTCCCAC CGACTTCGTG 
CTGGAACGTG TGGACCTGAC CTTCGAACTC GACCCGACTG AGACAAAGGT CGAGGCGCGT
CTGATCTTTC ATCGTCGCCC GGGCGCCGAT CCGGCAGCGC CGATCGTTCT CGACGGCGAC
GAACTGACGC TGTCGGGGCT GCTGCTCGAC CAGGTGGAGC TGGACCCTTC GCGTTACGAC
GCAGCACCGG AAAGCCTGAC GGTGCGCGAC CTGCCGGAGA GCGCGCCCTT CGAACTGACG
ATCACCACCG TCATCAACCC TGAGGCCAAC ACCAAGCTGA TGGGCCTTTA CCGCACCGGC
GGCATCTACT GCACGCAGTG CGAGGCGGAG GGTTTCCGCC GCATCACCTA TTTCCCCGAC
CGGCCCGACG TGCTTGCGCC GTTCACGGTC AACATCATCG CCGACAAGGA CGCCAACCCG
CTGCTTTTGT CGAACGGCAA CTTCCTCGGC GGCGCCGGCT ACGGCCCCGG CAAACATTTC
GCCGCCTGGT TCGATCCGCA TCCGAAGCCG AGCTATCTCT TCGCGCTCGT CGCTGGCGAT
CTCGGCGTTG TCGAAGACAC GTTCACGACC ATGTCCGGTC GCGAGGTGGT GCTGAAGATC
TATGTCGAGC ACGGCAAGGA GCCGCGCGCA GCCTATGCCA TGGACGCGCT GAAACGCTCG
ATGAAGTGGG ACGAAGAGAG GTTCGGCCGC GAATACGATC TCGACATCTT CATGATCGTC
GCCGTCTCCG ATTTCAACAT GGGCGCGATG GAGAACAAGG GCCTCAACGT CTTCAACGAC
AAATACGTGC TTGCCGATCC CGAGATCGCC ACCGATGCCG ACTATGCCAA TATCGAGACG
ATCATCGCGC ATGAATATTT TCACAACTGG ACCGGCAACC GCATCACCTG CCGCGACTGG
TTCCAGCTGT GCCTCAAGGA AGGCCTGACG GTCTATCGAG ACCACGAATT CTCTTCCGAC
CAGCGCTCGC GCGCCGTCAA GCGCATCGCC GAAGTGCGCC ACCTGAAATC GGAGCAGTTC
CCGGAAGATA GCGGCCCGCT CGCCCATCCG GTGCGGCCGA CGACATATCG CGAGATCAAC
AATTTCTACA CGACGACCGT CTACGAAAAG GGCAGCGAAG TCACGCGCAT GATCGCGACG
TTGCTCGGCA AGGACGGCTT CAAGAAGGGC ATGGACCTCT ATTTCGACCG CCATGACGGT
GAGGCCGTGA CGATCGAGGA TTTCGTCAAA TGCTTCGAGG ATGCAAGCGG GCGCGACCTC
GCGCAATTTT CGCTCTGGTA CCATCAGGCC GGCACGCCGC TCGTCACCGC ATCGGGCAGC
TATGATGCGG CAGCCGGCAG CTTCACCCTG TCGCTCGAAC AGATGATCCC TGCAACGCCC
GGCCAGCCGA GCAAGGAGCC GATGCATATT CCGCTCAGCC TCGCGCTGTT TGGCGAAAAC
GGCGGCAAGA TCGAGCCGAC CTCGGTCGAC GGCGCGGAAT ATGCCGGCGA GGTGCTGCAT
CTCACCGGCC GCACGCAGAC GGCCGTGTTC CATGGCGTTG GCTCGCGGCC GGTCGTTTCG
ATCAACCGCA GCTTCTCGGC GCCGATCAAC CTGCATTTCG ATCAGAGCCC GGCCGATCTT
GCCCATCTCG CCCGCCATGA GACCGATCAT TTTGCCCGCT GGCAGGCATT GACCGATCTG
GCGCTGCCGA ACCTGCTGAA AGCGGCACGC GACGCCCGCG AGGGCAAGCC TGTGATCTGC
GAGACGACCT TCGTCGAGAC GCTGATTGCC GCCGCTGCCG ACGAGAGCCT CGAGCCCGCC
TTCCGCGCCC AGGCGCTGGC TCTGCCGAGC GAATCCGATA TCGCCCGTGA ACTCGGCGGC
AACAATGACC CCGATGCCAT CCATGCCGGC CGGCAGGCGG TCCTGAAACA GATCGCCGAT
GCCGGAAAGG ATGTCTTTGC CGGCCTCTAC GCAGCGATGA CGACATCAGG CGATTTCAAC
CCGGATGCGA AGAGCGCCGG CCTGCGGGCG CTGCGCAATA GCGCCCTCAC CTACCTCTCG
CATGCCGAAG AGACGCCGAC CCGCGCCAAG GCCGCCTTCG ATGCGGCCAA CAATATGACC
GATCTCAGCC ATGCGCTGAC CATCCTCGCC CATCGTTTTC CCGACAGCGC GGAGACGAGC
GAGGCGCTCG CCACCTTCCG TGACCGCTTC GCGGAAAATG CGCTCGTCAT CGACAAATGG
TTCGCGATCC AGGCCGGCAT TCCGGGCGCA AAAACCCTGG GACGGGTCCG CGCGCTGATG
GACGATCCGC TCTTCAAGCG CACCAATCCG AACCGGATGC GGTCGCTGGT CGGCACCTTC
GCCTTTGCCA ACCCCACCGG ATTCGGCCGC GCCGATGGCG AAGGCTATCA CTTCCTCTCG
GATCAGATTC TCGATATCGA CGGGCGCAAC CCGCAGCTTG CCGCCCGCAT TCTCACCTCG
ATGCGCTCCT GGCGCTCGCT CGAACCCGTG CGTGCCGATC ACGCCCGCTC GGCGCTGATC
GAGATCGAGC GAGCCACCGA TCTTTCGACC GACGTGCGCG ACATCGTCGA GCGCACGCTT
AAGGGGTAA
 
Protein sequence
MRTDTGQVIH LADYRPTDFV LERVDLTFEL DPTETKVEAR LIFHRRPGAD PAAPIVLDGD 
ELTLSGLLLD QVELDPSRYD AAPESLTVRD LPESAPFELT ITTVINPEAN TKLMGLYRTG
GIYCTQCEAE GFRRITYFPD RPDVLAPFTV NIIADKDANP LLLSNGNFLG GAGYGPGKHF
AAWFDPHPKP SYLFALVAGD LGVVEDTFTT MSGREVVLKI YVEHGKEPRA AYAMDALKRS
MKWDEERFGR EYDLDIFMIV AVSDFNMGAM ENKGLNVFND KYVLADPEIA TDADYANIET
IIAHEYFHNW TGNRITCRDW FQLCLKEGLT VYRDHEFSSD QRSRAVKRIA EVRHLKSEQF
PEDSGPLAHP VRPTTYREIN NFYTTTVYEK GSEVTRMIAT LLGKDGFKKG MDLYFDRHDG
EAVTIEDFVK CFEDASGRDL AQFSLWYHQA GTPLVTASGS YDAAAGSFTL SLEQMIPATP
GQPSKEPMHI PLSLALFGEN GGKIEPTSVD GAEYAGEVLH LTGRTQTAVF HGVGSRPVVS
INRSFSAPIN LHFDQSPADL AHLARHETDH FARWQALTDL ALPNLLKAAR DAREGKPVIC
ETTFVETLIA AAADESLEPA FRAQALALPS ESDIARELGG NNDPDAIHAG RQAVLKQIAD
AGKDVFAGLY AAMTTSGDFN PDAKSAGLRA LRNSALTYLS HAEETPTRAK AAFDAANNMT
DLSHALTILA HRFPDSAETS EALATFRDRF AENALVIDKW FAIQAGIPGA KTLGRVRALM
DDPLFKRTNP NRMRSLVGTF AFANPTGFGR ADGEGYHFLS DQILDIDGRN PQLAARILTS
MRSWRSLEPV RADHARSALI EIERATDLST DVRDIVERTL KG