Gene Rleg2_0930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0930 
SymbolpepN 
ID6979648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp948564 
End bp951212 
Gene Length2649 bp 
Protein Length882 aa 
Translation table11 
GC content63% 
IMG OID643395641 
Productaminopeptidase N 
Protein accessionYP_002280450 
Protein GI209548533 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.182731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACAG ATACCGGCCA GGTCATTCAT CTGGCAGATT ACCGTCCCAC CGACTTTGTG 
CTGGAACGCG TGGACCTGAC CTTCGAGCTT GACCCGACGG AGACAAAGGT CGAGGCACGC
CTGATCTTTC ATCGCCGCCC GGGCGCCGAT CCGGCAGCGC CGATCGTGCT CGACGGCGAC
GAGTTGACGC TGTCGGGGCT GCTCTTCGAC CAGGTGGAGC TCGACCCTTC GCGTTATGAC
GCAACAGCGG AAAGCCTGAC GGTGCGCGAC CTGCCGGAAA GCGCGCCCTT CGAGCTGACG
ATCACGACGG TGATCAATCC CGAGGCCAAC ACCCAGCTGA TGGGCCTTTA TCGCACCGGC
GGCATCTACT GCACGCAATG CGAAGCGGAA GGCTTCCGCC GCATCACCTA TTTCCCTGAC
CGGCCCGACG TGCTTGCACC GTTCACGGTC AATATCATCG CCGACAAGGA CGCCAGCCCG
CTGCTCCTCT CGAACGGCAA CTTCCTCGGC GGCGCCGGCT ACGGCCCCGG CAAGCATTTC
GCCGCCTGGT TCGACCCGCA TCCGAAGCCG AGCTATCTCT TTGCGCTCGT TGCCGGCGAT
CTCGGCGTCG TCGAAGACAC GTTCACCACG ATGACCGGCC GCGAGGTGGT GCTGAAGATC
TATGTCGAGC ACGGCAAGGA GCCGCGCGCA GCCTATGCCA TGGACGCGCT GAAGCGCTCG
ATGGCATGGG ACGAAGAGAG CTTCGGACGC GAATACGACC TCGACATCTT CATGATCGTC
GCTGTCTCCG ACTTCAACAT GGGCGCGATG GAGAACAAGG GGCTCAACGT CTTCAACGAC
AGATACGTCC TTGCCGATCC CGAGACCGCG ACCGATGCCG ACTACGCCAA TATCGAAGCG
GTCATCGCGC ATGAATATTT CCACAATTGG ACCGGCAACC GCATCACCTG CCGCGACTGG
TTCCAGCTCT GCCTCAAGGA AGGGCTGACG GTCTATCGCG ATCAGGAATT CTCCTCCGAC
CAGCGCTCGC GCCCGGTCAA GCGCATCGCC GATGTGCGCC ATCTGAAATC CGAGCAATTC
CCGGAAGATG GCGGCCCTCT CGCCCATCCG GTGCGGCCGA CGACCTATCG TGAGATCAAC
AATTTCTACA CGAGGACCGT CTACGAGAAA GGCAGCGAAG TGACGCGGAT GATCGCGACG
CTGCTCGGCA AGGACGACTT CAAGAAGGGC ATGGACCTCT ATTTCGACCG CCATGACGGC
CAGGCGGTGA CGATCGAAGA TTTTGTTAAA TGCTTCGAGG ATGCGAGCGG ACGCGACCTC
ACGCAATTCT CTCTCTGGTA CCATCAGGCC GGCACGCCGC TGGTCACCGC ATCGGGCAGC
TATGATGCGG CGGCGACCAC CTTCACCCTG TCGCTCGAAC AGATGACGCC TGCCACGCCC
GGCCAGTCGA GCAAGGAACC GATGCATATT CCGCTGAGCC TGGCGCTCTT CGGCGAAAAC
GGCGGCAAGC TCGAGCCGAG CTCGGTCGAC GGCGCCGAAT ATGCCGGCGA GGTGCTGCAT
CTCACCGGCC GCACGCAGAC CGTGGTGTTC CATGGCATCG GCTCCCGGCC GGTCGTGTCG
ATCAACCGCA GCTTCTCGGC GCCGATCAAC CTGCATTTCG ATCAGAGCCC GGCCGATCTC
GCCCATCTCG CCCGCCATGA GACCGATCAT TTCGCCCGCT GGCAGGCGTT GACCGATCTG
GCGCTGCCGA ACCTCTTGAA AGCCGCCCGC GACGCCCGCG AAGGCAAGCC TGTCGTCTGC
GAGGCGACCT TCGTCGAAAC GCTGATTGCC GCCGCCGCCG ACGACAGCCT CGAGCCCGCC
TTCCGCGCCC AGGCGCTGGC GCTGCCGAGC GAATCCGACA TCGCCCGCGA ACTCGGCGGC
AATAACGATC CCGATGCCAT TCATGCCGGC CGGCAGGCGA TCCTGAAACA GGTCGCCGAG
GCCGGAAAGG ATGTTTTTGC CGGCCTCTAT GCCGCGACGA CGACATCGGG CGATTTCAGC
CCGGACGCAA AGAGCGCCGG TCTCAGGGCA CTGCGCAATA CGGCGCTGAC CTATCTCTCA
TATGCCGAGC AGACGCCGGC CCGCGCCAGG GCCGCTTTCG ATGCGGCCAA CAACATGACC
GATCTCAGCC ATGCGCTGAC GATCCTCGCC CATCGTTTCC CCGACAGCGC CGAGACCGCC
GAGGCGCTCG CCGCCTTCCG TCAGCGCTTT GCGGAGAATG CCCTCGTCAT CGACAAATGG
TTTGCTATCC AGGCCGGCAT TCCCGGCGCA AAAGCCCTTG AGCGGGTCCG CACCCTGATG
GACGATCCGC TGTTCAAGCG GACCAATCCG AACCGGATGC GGTCACTGGT CGGCACCTTC
GCCTTTGCCA ATCCGACCGG CTTCGGCCGC GCCGACGGCG AAGGCTATCG TTTCCTCGCC
CGTCAGATTC TCGATATCGA CGAGCGCAAT CCGCAGCTTG CCGCCCGCAT TCTGACTTCG
ATGCGCTCCT GGCGCTCGCT CGAACCGGGG CGGGCCGATC ACGCCCGCGC GGCGCTGAAC
GAGATCGAGC AGGCTGCCGC TCTTTCCACA GACGTGCGCG ATATCGTCGA GCGCACGCTG
AAGGGGTAA
 
Protein sequence
MRTDTGQVIH LADYRPTDFV LERVDLTFEL DPTETKVEAR LIFHRRPGAD PAAPIVLDGD 
ELTLSGLLFD QVELDPSRYD ATAESLTVRD LPESAPFELT ITTVINPEAN TQLMGLYRTG
GIYCTQCEAE GFRRITYFPD RPDVLAPFTV NIIADKDASP LLLSNGNFLG GAGYGPGKHF
AAWFDPHPKP SYLFALVAGD LGVVEDTFTT MTGREVVLKI YVEHGKEPRA AYAMDALKRS
MAWDEESFGR EYDLDIFMIV AVSDFNMGAM ENKGLNVFND RYVLADPETA TDADYANIEA
VIAHEYFHNW TGNRITCRDW FQLCLKEGLT VYRDQEFSSD QRSRPVKRIA DVRHLKSEQF
PEDGGPLAHP VRPTTYREIN NFYTRTVYEK GSEVTRMIAT LLGKDDFKKG MDLYFDRHDG
QAVTIEDFVK CFEDASGRDL TQFSLWYHQA GTPLVTASGS YDAAATTFTL SLEQMTPATP
GQSSKEPMHI PLSLALFGEN GGKLEPSSVD GAEYAGEVLH LTGRTQTVVF HGIGSRPVVS
INRSFSAPIN LHFDQSPADL AHLARHETDH FARWQALTDL ALPNLLKAAR DAREGKPVVC
EATFVETLIA AAADDSLEPA FRAQALALPS ESDIARELGG NNDPDAIHAG RQAILKQVAE
AGKDVFAGLY AATTTSGDFS PDAKSAGLRA LRNTALTYLS YAEQTPARAR AAFDAANNMT
DLSHALTILA HRFPDSAETA EALAAFRQRF AENALVIDKW FAIQAGIPGA KALERVRTLM
DDPLFKRTNP NRMRSLVGTF AFANPTGFGR ADGEGYRFLA RQILDIDERN PQLAARILTS
MRSWRSLEPG RADHARAALN EIEQAAALST DVRDIVERTL KG