Gene Noc_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1156 
Symbol 
ID3706808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1263282 
End bp1265213 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content54% 
IMG OID637737661 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_343190 
Protein GI77164665 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGCCC CTGTAGAAAA GCCCTATGGC GCTTGGCTAT CCCCCATTAC CGCTGATCTC 
ATTGTCTCGG AAACAATTGG CCTAGGCCAG ATAGCACTAT CCGGGGACGC CGTTTACTGG
GTGGAAATGC GCCCCACGGA AGGGGGGCGG AATGTAATCC TAGGCCGCAC CAGGGACGGA
GAAGTCAAAG AAATTAATCC CGCTCCTTAT AATGCTAGAA CCCGAGTCCA TGAGTATGGC
GGCGGGGCTT ATCTAGTGGC TGGGGATTCG GTGTTTTTTT CCCATTTTGA AGATCAAAGG
CTGTATCGGT GTCGGATAGA TGATTCAATT GAGCCGCTTA CTCTGGAAGG AGATTATCGT
TACGCCGATG CGATTTTTGA TCGATTCCGC AATCGTCTTG TCTGCGTTCG TGAAGACCAT
ACGGATAAGA CTCGGGAGCC CGTCAATACG CTGGTCAGTA TCCCCCTCGA CGGTAATAAG
CGGATTTCCG TGTTGGCCTC GGGGGCGGAT TTTTATGCTT CACCCCGCCT CAGCCCCGAT
GGAAGCCAGC TCGCCTGGCT GACTTGGAAC CATCCCAATA TGCCTTGGGA TGGTACGGAT
CTCTGGCTGG CCCCGGTAGA GACAGCGGGC TCCTTGGGAG AGAGCAAACA CATTGCGGGC
GCAGCGGATG AGTCTATTTT TCAGCCCGAG TGGTCCCCTA CAGGCACTTT GTATTTTATT
TCTGACCGTA CTGGATGGTG GAATCTTTAT CGCTGGCGGG AACAGCAAAT AGAATCGGTT
ACTCGGATGG AGGTTGAATT TGGCGTACCC CAATGGGTTT TTGGCCTATC CACTTATGCC
TTTGAATCGG CGGAGCGTAT CGTCTGCGCC TATAGCAGGG ACGGCGTAAG CCATTTAGCA
ATTATTGATG CCGCCAGTGG CGTTTTGGAG GAATTAGAGA CTCCTTATAC GGAAATAGGA
TCTTTGCGGG CGCAGGCCGG ATATGCCGTA TTTATTGCCG CCTCGCCTAC GGAATTTCCT
GCTGTTGTCC AGCTTGACTT AGCGACCAGA GAAGTGGAAG TGCTGCGTCG GGCCAGCGAG
ATGAGCATTG ATTCCGGGTA TCTTTCCATT CCCGAAGCTA TTCAGTTCCC AACCACGGCG
GGCGCTCTGT CCCATGCTTT CTTCTATCCT CCGAAAAATA AGGATTTTAC CGGCTTGCCA
GGGGAGCGGC CTCCCTTATT GGTAATCAGC CATGGGGGTC CTACCGCGGC TACGAATAAC
GTCCTGAGTT TAAAGATTCA ATACTGGACT AGCCGGGGCA TTGCCGTGCT TGATGTGAAT
TATCGGGGCA GTAGCCACTA TGGACGGGAA TACCGCCAGC AATTGAAGGG GCAGTGGGGA
TGCGCCGATG TGGAGGATTG CGTCAACGGG GCCTTGTATT TAGCGCAGCG GGGAGAGGTA
GATCGGGAGC GTCTTGCCAT TCGGGGAAGC AGCGCGGGTG GTTTTACCAC CTTGGCGGCC
TTGACTTTTC ATGAGGTATT TAAGGCCGGG GCGAGCTATT ATGGCGTCAG CGATCTGGCG
GCGCTGGCAA AAGAAACCCA TAAGTTCGAG TCCCGTTACC TCGATCACTT GATTGGACCC
TACCCGGAAC GGGCTGATCT GTACGCGGCC CGCTCGCCAA TTCACGCCGT TGATAAACTC
TCCTGTCCCG TTATCTTTTT TCAGGGCCTG GAAGATAAGA TTGTGCCGCC TGAGCAAGCC
GAGCAAATGG TGGAAGCACT ACGCGAGAAA GGGGTGCCCG TGGCCTATGT TCCCTTTGAA
GGCGAGCAAC ATGGCTTCCG GCGAGCGGAA AATATCAAAC GGGCGCTGGG GGCCGAGCTT
TATTTTTACG CCCAGATTTT TGGCTTTGAC TTGGCGGAAC GGATTGAACC GGTAGCGATT
GAAAATCTTT AA
 
Protein sequence
MLAPVEKPYG AWLSPITADL IVSETIGLGQ IALSGDAVYW VEMRPTEGGR NVILGRTRDG 
EVKEINPAPY NARTRVHEYG GGAYLVAGDS VFFSHFEDQR LYRCRIDDSI EPLTLEGDYR
YADAIFDRFR NRLVCVREDH TDKTREPVNT LVSIPLDGNK RISVLASGAD FYASPRLSPD
GSQLAWLTWN HPNMPWDGTD LWLAPVETAG SLGESKHIAG AADESIFQPE WSPTGTLYFI
SDRTGWWNLY RWREQQIESV TRMEVEFGVP QWVFGLSTYA FESAERIVCA YSRDGVSHLA
IIDAASGVLE ELETPYTEIG SLRAQAGYAV FIAASPTEFP AVVQLDLATR EVEVLRRASE
MSIDSGYLSI PEAIQFPTTA GALSHAFFYP PKNKDFTGLP GERPPLLVIS HGGPTAATNN
VLSLKIQYWT SRGIAVLDVN YRGSSHYGRE YRQQLKGQWG CADVEDCVNG ALYLAQRGEV
DRERLAIRGS SAGGFTTLAA LTFHEVFKAG ASYYGVSDLA ALAKETHKFE SRYLDHLIGP
YPERADLYAA RSPIHAVDKL SCPVIFFQGL EDKIVPPEQA EQMVEALREK GVPVAYVPFE
GEQHGFRRAE NIKRALGAEL YFYAQIFGFD LAERIEPVAI ENL