Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2303 |
Symbol | |
ID | 3786708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2619512 |
End bp | 2621359 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637812390 |
Product | peptidase S9, prolyl oligopeptidase active site region |
Protein accession | YP_412986 |
Protein GI | 82703420 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCT ACACCATTGA ACAATTCATG GCGACGACCT CCATAATGGG GGCGTCGTTC AGCGCGGATG AGGAGCGCAT CCTGTTCAGC AGTAATGAAT CGGGTATTTT CAATGCCTAC ACGCTGCCAG TGACAGGTGG TACGGCAGAG GCGTTGACCT GCTCCGCCAG CGATACCACG TTTTCGGTCA GCTTCTTTCC GCATGATGAT CGGGTACTGA TTACCCGGGA TTATCATGGC GATGAAAATT ATCATTTGTT CCTGCTCGCC CCCGACGGCG AGGAGGAAGA TCTGACACCG GGAGAAAAGC TCAAGGCGCA ATTCATGGGC TGGAGCCCCG ATGGACAGGC TTTTTACGTC GCTACCAATG AGCTTGATGC CAGATTCTTC GATGTATACC GTTATGACGC CGAGACTTTT GCGCGAACCT TGCTGTACCG GAATCACCAG GGGTTCGATC TTGGGCCCAT CAGTCGCGAT GAGCGCTGGA TTGCACTGAA CCGCTCGCAT ACCGCATCGG ACAGTGACAT TTACCTGTTC GATGTCGAAA AACGGGAAGT AAGGCACCTC ACGCCGCATG AGGGCTCCAT CAGTTTCCAT GCTGAGACCT TCGATTCCAC TTCGCGCAAG CTCTTCTTTC TTACCACGGA GGGAAGCGAA TTCAAGCATT TGTGCACCTA TGATCTCACT TCAGGGGTCG TCTGCGACCA CGAGAGTGCC GAGTGGGACG TGATGTACAC CTATTTCTCA TACGATGGCC GATTCCGCGT AACTGGCATC AACGCAGATG GAAGCATTGT CGTTCGTGTT GTCGAAATTG AAGATGGAGA GAGGGAGAAA CCTGTAAAAC TGCCAGCACT ACCCCAGGGA GAAATCCGGG GTGTGGTCTT TTCGCAAAAC AGCACGCGCA TGGCCTTCTA CGTAAATGGC GACCGCTCTC CCGATAATCT GTTCGTGCAC GATTTCAGCA CCGGGCAGTT TCGTCAACTC ACGCAAAGCC TGAACAAGGA GATCGATCCA TCAGATCTGG TGGAGGCGGA AGTGGTGCGG TTCCGATCCT TCGATGGAAT GATGATTCCC TCCATCTATT ACAAGCCGCA TGAAGCATCA GGCACCAACA AGGTGCCTGC CATCGTGTAC GTTCATGGAG GACCCGGCGG GCAGACCATG CGGGGTTATA ACGCCCAGAT TCAGTACCTC GTGAATCATG GATATGCCGT GCTGGGCATC AATAACCGGG GAAGCTCCGG CTATGGTAAA ACCTTCTTTA CCGCGGCCAA CCGCAAGCAC GGACGAGAGC CCTTGTGGGA TTGTGTGGAA GCGAAGACCT TTCTGGCCAG TCTCGGTTAC ATCGACCATG AGCGCATCGG CATCATGGGC GCGAGTTATG GCGGTTACAT GACACTTGCA GCCCTCGCGT TCCGGCCTGA AGCTTTCAAG GTAGGGGTGG ACATTTTCGG CGTCAGCAAC TGGCTGCGTA CCCTGGAGAG CATTCCTGTT TACTGGGAGT CCGTCCGCAA AGCCATTTAT GATGAAATTG GCGATCCCGT GGCGGACATC GATTTTCTTG TTGCGACTTC CCCCCTGTTC CATGCCAGGG AAATACGAAA GCCTTTATTG GTCATCCAGG GAGTCAATGA CCCTCGTGTG GTCAAGGCCG AGAGCGATGA AATGGTGGAG GCGGTCAGGA AGCATGGCAT TCCGGTGGAG TACATTGTTT TCCCTGATGA AGGCCATAGT TTTACCAAGA AAAAAAACCA GATCGAGGCG AATCGGCGAA TACTGGAGTT TCTGGACAAG TATCTGAAAG GTGATGTGAA CAAGACTGCA AGCCAAGTGG AAAAATAG
|
Protein sequence | MKRYTIEQFM ATTSIMGASF SADEERILFS SNESGIFNAY TLPVTGGTAE ALTCSASDTT FSVSFFPHDD RVLITRDYHG DENYHLFLLA PDGEEEDLTP GEKLKAQFMG WSPDGQAFYV ATNELDARFF DVYRYDAETF ARTLLYRNHQ GFDLGPISRD ERWIALNRSH TASDSDIYLF DVEKREVRHL TPHEGSISFH AETFDSTSRK LFFLTTEGSE FKHLCTYDLT SGVVCDHESA EWDVMYTYFS YDGRFRVTGI NADGSIVVRV VEIEDGEREK PVKLPALPQG EIRGVVFSQN STRMAFYVNG DRSPDNLFVH DFSTGQFRQL TQSLNKEIDP SDLVEAEVVR FRSFDGMMIP SIYYKPHEAS GTNKVPAIVY VHGGPGGQTM RGYNAQIQYL VNHGYAVLGI NNRGSSGYGK TFFTAANRKH GREPLWDCVE AKTFLASLGY IDHERIGIMG ASYGGYMTLA ALAFRPEAFK VGVDIFGVSN WLRTLESIPV YWESVRKAIY DEIGDPVADI DFLVATSPLF HAREIRKPLL VIQGVNDPRV VKAESDEMVE AVRKHGIPVE YIVFPDEGHS FTKKKNQIEA NRRILEFLDK YLKGDVNKTA SQVEK
|
| |