Gene Smed_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0642 
SymbolpepN 
ID5321478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp692088 
End bp694742 
Gene Length2655 bp 
Protein Length884 aa 
Translation table11 
GC content62% 
IMG OID640789578 
Productaminopeptidase N 
Protein accessionYP_001326333 
Protein GI150395866 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.411291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.599846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGACAA ACACTGGCCA GATCGTTCAT CTGGAAAACT ACCGGCCGAC CGATTTCGTT 
CTCGAAAGGG TGGACCTGAC GTTCGAACTC GACCCGCAGG AAACGAAGGT CGAGGCGCGC
ATGATCTTCC ACCGCCGCGA AGGCGTCAGT CCCTCGGCAC CGCTCGTCCT CGATGGCGAC
GAGCTCGTCA TGACCGGCCT GCTGCTCGAC CAGGAAGCGA TACCGGGCAC GCTCTATGAA
GCGACCGACG ACACGCTCAT CATCCGCGAA CTGCCCGAAA GGGCTCCTTT CGAGATCACC
GTCACGACGG TATTGTCGCC GGAGACCAAC ACCAAGCTGA TGGGCCTTTA CCGCACGAGC
AACGTCTATT GCACCCAGTG CGAAGCCGAA GGCTTCCGTC GCATTACCTA TTTCCCGGAT
CGTCCAGACG TGCTGGCGGT CTATACCGTC AACATCATCG CGGACAAGGC TGCGGCGCCG
CTTCTGCTTT CGAACGGCAA CTATCTCGGT GGCGCCGACA TGGGCGACGG ACGCCATTTC
GCCTCCTGGT TCGATCCGCA TCCGAAACCG AGCTATCTCT TTGCACTCGT TGCCGGCGAT
CTCGGCGTCG TAGAGGACAC CTTCACCACC GTTTCCGGCG GCAACGTCGC CCTCAAGATC
TATGTCGAGC ACGGCAAGGA GCCGCGGGCG ACCTATGCCA TGGACGCGCT GAAACGGTCG
ATGAAATGGG ACGAGGACGT CTTCGGCCGT GAATACGACC TCGACATATT CATGATCGTT
GCGGTCTCCG ATTTCAACAT GGGCGCCATG GAGAACAAGG GCCTCAACGT TTTCAACGAC
AAATATGTGC TCGCCGATCC CGAAACGGCG ACCGATGCGG ACTATGCCAA CATCGAGGCG
ATCATCGCGC ACGAATATTT CCACAACTGG ACCGGCAACC GCATTACCTG CCGCGACTGG
TTCCAGCTCT GCCTGAAGGA AGGCCTGACC GTCTACCGCG ACCACGAGTT TTCCGCCGAT
ATGCGCTCGC GCGCCGTGAA GCGCATCGCC GAAGTGCGGC ATTTGAAATC CGAGCAGTTC
CCGGAAGATG CCGGTCCCCT CGCCCACCCT GTACGGCCGA CGCAATACCG CGAGATCAAC
AATTTCTACA CGACGACGGT CTACGAAAAG GGATCCGAGG TGACGCGGAT GATCGCGACC
ATCCTCGGGC GCGATCTCTT CAAGAAGGGC ATGGACCTCT ATTTCGAGCG TCACGACGGC
CAGGCGGTGA CGATCGAGGA TTTCGTCGCC TGCTTCGAAG CTGCCAGCGG CCGCGACCTC
AAGCAATTCT CACTATGGTA CCATCAGGCG GGAACCCCGC TCGTTACCGC TTCGGGCGTC
TATAACACCG CCGGGCAGAC ATACACTCTC TCGCTCGAGC AGACCGTGCC GCCGACACCC
GGACAAAGCA GCAAGGCACC GATGCATATC CCTCTTCGCT TCGGGCTGCT GCTTCCGGAC
GGAAGCGAGG CGACGCCCAC GGCCGTCTCC GGCGCCGAGA TCAGCGGGGA CGTACTGCAC
CTGACCGAGC GAAAACAGAG CGTCACCTTC TCGGGTGTTC CGGCGCAGCC TGTCCCTTCC
TTCAACCGAG ACTTTTCGGC GCCGATCAAT TTGCACGTTG TGCAAAGCGC CGGGGACCGT
GCCTTGATCG CGCGTTTCGA AACCGACCTC TTCGCACGTT GGCAGGCGCT GAATACGATG
GCACTCGACA ATCTCGTCAA GGCGGCGGCG CAAACGCGTG CCGGACGGCC GGTCGCCTGC
GACGACGCGC TCGTCGACGC GCTTCTCGCC GCGGCCGCAG ACAACCGGCT GGAGCCCGCA
TTCCGCGCGC AGGTGTTGTC CTTGCCGAGC GAAACCGACA TCGGCCGCGA AATCGGCGGC
AGCAACGACC CGGATGCGAT CCATACCGGC CGTCAGGCTG TTCTCACCGC CATCGCCACG
GCGGGCAAGG ACAGTTTCGC GCGGCTGGTT CACGAGATGT CCCAGTCCGG CCCGTTCCGT
CCGGACGCCG AAAGCGCCGG GCGTCGAGCC CTGCGCTATT CCGGTCTCTT CTATCTCGTC
TATGCCGACG GGCAGCCGGG CAAAGCCGCT GATGCGTTTC GCTCCGCCAA CAATATGACC
GATCTCAGCC AGGCGCTGAC GCTCCTTGCG CATCGTTTTC CGGATGCCGA GGAGACGGCG
GAGGCCTTGG CCGCGTTCAA GGAACGCTTT GCCAATAATG CGCTGGTCAT CGACAAGTGG
TTCGCCATTC AGGCGACGAT CCCCGGAGCT GCGACGCTCG ACCGGGTTCG GGGGCTGATG
TCGGATCCGC TCTTCAATGC CAACAACCCC AACCGGGTAC GCTCGCTAGT CGGCACCTTC
GCATTCGCGA ACGCCACAGG CTTCAATCGC GTCGATGGCG AAGGCTATCG TTTCCTCGCC
CGACAGATCC TCGACATCGA CGCGCGCAAT CCTCAACTTG CCGCGCGCAT TCTGACGTCG
ATGCGCTCGT GGGGTTCGCT GGAAGAGGTT CGGGCCAGCC ACGCCCGAAG TGCGCTCGAG
GAGATTGCGC GTGCCTCCAG CCTTTCCGCG GATGTCAGCG ACATCGTCGA CCGCATGCTC
GAAGACAAGC ACTGA
 
Protein sequence
MRTNTGQIVH LENYRPTDFV LERVDLTFEL DPQETKVEAR MIFHRREGVS PSAPLVLDGD 
ELVMTGLLLD QEAIPGTLYE ATDDTLIIRE LPERAPFEIT VTTVLSPETN TKLMGLYRTS
NVYCTQCEAE GFRRITYFPD RPDVLAVYTV NIIADKAAAP LLLSNGNYLG GADMGDGRHF
ASWFDPHPKP SYLFALVAGD LGVVEDTFTT VSGGNVALKI YVEHGKEPRA TYAMDALKRS
MKWDEDVFGR EYDLDIFMIV AVSDFNMGAM ENKGLNVFND KYVLADPETA TDADYANIEA
IIAHEYFHNW TGNRITCRDW FQLCLKEGLT VYRDHEFSAD MRSRAVKRIA EVRHLKSEQF
PEDAGPLAHP VRPTQYREIN NFYTTTVYEK GSEVTRMIAT ILGRDLFKKG MDLYFERHDG
QAVTIEDFVA CFEAASGRDL KQFSLWYHQA GTPLVTASGV YNTAGQTYTL SLEQTVPPTP
GQSSKAPMHI PLRFGLLLPD GSEATPTAVS GAEISGDVLH LTERKQSVTF SGVPAQPVPS
FNRDFSAPIN LHVVQSAGDR ALIARFETDL FARWQALNTM ALDNLVKAAA QTRAGRPVAC
DDALVDALLA AAADNRLEPA FRAQVLSLPS ETDIGREIGG SNDPDAIHTG RQAVLTAIAT
AGKDSFARLV HEMSQSGPFR PDAESAGRRA LRYSGLFYLV YADGQPGKAA DAFRSANNMT
DLSQALTLLA HRFPDAEETA EALAAFKERF ANNALVIDKW FAIQATIPGA ATLDRVRGLM
SDPLFNANNP NRVRSLVGTF AFANATGFNR VDGEGYRFLA RQILDIDARN PQLAARILTS
MRSWGSLEEV RASHARSALE EIARASSLSA DVSDIVDRML EDKH