Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1156 |
Symbol | |
ID | 3706808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1263282 |
End bp | 1265213 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637737661 |
Product | peptidase S9, prolyl oligopeptidase active site region |
Protein accession | YP_343190 |
Protein GI | 77164665 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGCCC CTGTAGAAAA GCCCTATGGC GCTTGGCTAT CCCCCATTAC CGCTGATCTC ATTGTCTCGG AAACAATTGG CCTAGGCCAG ATAGCACTAT CCGGGGACGC CGTTTACTGG GTGGAAATGC GCCCCACGGA AGGGGGGCGG AATGTAATCC TAGGCCGCAC CAGGGACGGA GAAGTCAAAG AAATTAATCC CGCTCCTTAT AATGCTAGAA CCCGAGTCCA TGAGTATGGC GGCGGGGCTT ATCTAGTGGC TGGGGATTCG GTGTTTTTTT CCCATTTTGA AGATCAAAGG CTGTATCGGT GTCGGATAGA TGATTCAATT GAGCCGCTTA CTCTGGAAGG AGATTATCGT TACGCCGATG CGATTTTTGA TCGATTCCGC AATCGTCTTG TCTGCGTTCG TGAAGACCAT ACGGATAAGA CTCGGGAGCC CGTCAATACG CTGGTCAGTA TCCCCCTCGA CGGTAATAAG CGGATTTCCG TGTTGGCCTC GGGGGCGGAT TTTTATGCTT CACCCCGCCT CAGCCCCGAT GGAAGCCAGC TCGCCTGGCT GACTTGGAAC CATCCCAATA TGCCTTGGGA TGGTACGGAT CTCTGGCTGG CCCCGGTAGA GACAGCGGGC TCCTTGGGAG AGAGCAAACA CATTGCGGGC GCAGCGGATG AGTCTATTTT TCAGCCCGAG TGGTCCCCTA CAGGCACTTT GTATTTTATT TCTGACCGTA CTGGATGGTG GAATCTTTAT CGCTGGCGGG AACAGCAAAT AGAATCGGTT ACTCGGATGG AGGTTGAATT TGGCGTACCC CAATGGGTTT TTGGCCTATC CACTTATGCC TTTGAATCGG CGGAGCGTAT CGTCTGCGCC TATAGCAGGG ACGGCGTAAG CCATTTAGCA ATTATTGATG CCGCCAGTGG CGTTTTGGAG GAATTAGAGA CTCCTTATAC GGAAATAGGA TCTTTGCGGG CGCAGGCCGG ATATGCCGTA TTTATTGCCG CCTCGCCTAC GGAATTTCCT GCTGTTGTCC AGCTTGACTT AGCGACCAGA GAAGTGGAAG TGCTGCGTCG GGCCAGCGAG ATGAGCATTG ATTCCGGGTA TCTTTCCATT CCCGAAGCTA TTCAGTTCCC AACCACGGCG GGCGCTCTGT CCCATGCTTT CTTCTATCCT CCGAAAAATA AGGATTTTAC CGGCTTGCCA GGGGAGCGGC CTCCCTTATT GGTAATCAGC CATGGGGGTC CTACCGCGGC TACGAATAAC GTCCTGAGTT TAAAGATTCA ATACTGGACT AGCCGGGGCA TTGCCGTGCT TGATGTGAAT TATCGGGGCA GTAGCCACTA TGGACGGGAA TACCGCCAGC AATTGAAGGG GCAGTGGGGA TGCGCCGATG TGGAGGATTG CGTCAACGGG GCCTTGTATT TAGCGCAGCG GGGAGAGGTA GATCGGGAGC GTCTTGCCAT TCGGGGAAGC AGCGCGGGTG GTTTTACCAC CTTGGCGGCC TTGACTTTTC ATGAGGTATT TAAGGCCGGG GCGAGCTATT ATGGCGTCAG CGATCTGGCG GCGCTGGCAA AAGAAACCCA TAAGTTCGAG TCCCGTTACC TCGATCACTT GATTGGACCC TACCCGGAAC GGGCTGATCT GTACGCGGCC CGCTCGCCAA TTCACGCCGT TGATAAACTC TCCTGTCCCG TTATCTTTTT TCAGGGCCTG GAAGATAAGA TTGTGCCGCC TGAGCAAGCC GAGCAAATGG TGGAAGCACT ACGCGAGAAA GGGGTGCCCG TGGCCTATGT TCCCTTTGAA GGCGAGCAAC ATGGCTTCCG GCGAGCGGAA AATATCAAAC GGGCGCTGGG GGCCGAGCTT TATTTTTACG CCCAGATTTT TGGCTTTGAC TTGGCGGAAC GGATTGAACC GGTAGCGATT GAAAATCTTT AA
|
Protein sequence | MLAPVEKPYG AWLSPITADL IVSETIGLGQ IALSGDAVYW VEMRPTEGGR NVILGRTRDG EVKEINPAPY NARTRVHEYG GGAYLVAGDS VFFSHFEDQR LYRCRIDDSI EPLTLEGDYR YADAIFDRFR NRLVCVREDH TDKTREPVNT LVSIPLDGNK RISVLASGAD FYASPRLSPD GSQLAWLTWN HPNMPWDGTD LWLAPVETAG SLGESKHIAG AADESIFQPE WSPTGTLYFI SDRTGWWNLY RWREQQIESV TRMEVEFGVP QWVFGLSTYA FESAERIVCA YSRDGVSHLA IIDAASGVLE ELETPYTEIG SLRAQAGYAV FIAASPTEFP AVVQLDLATR EVEVLRRASE MSIDSGYLSI PEAIQFPTTA GALSHAFFYP PKNKDFTGLP GERPPLLVIS HGGPTAATNN VLSLKIQYWT SRGIAVLDVN YRGSSHYGRE YRQQLKGQWG CADVEDCVNG ALYLAQRGEV DRERLAIRGS SAGGFTTLAA LTFHEVFKAG ASYYGVSDLA ALAKETHKFE SRYLDHLIGP YPERADLYAA RSPIHAVDKL SCPVIFFQGL EDKIVPPEQA EQMVEALREK GVPVAYVPFE GEQHGFRRAE NIKRALGAEL YFYAQIFGFD LAERIEPVAI ENL
|
| |