Gene Noc_0712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0712 
Symbol 
ID3706978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp770459 
End bp771403 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content56% 
IMG OID637737215 
Productpeptidase S33, proline iminopeptidase 1 
Protein accessionYP_342756 
Protein GI77164231 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0121198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACTC TCTATCCCGA CATCAAGCCT TATGTACGCC ATACCCTAAC GGTGGACCCG 
CCCCATGAAC TCTATGTCGA GGAATGCGGC CACCCGGGAG GACTCCCCAT CCTCTTTCTC
CACGGAGGGC CAGGTAGTGG CTGCCAACCC CATCACCGCT GCTTTTTTGA TCCCGACATT
TATCGGGTAA TTTTATTCGA TCAGCGTGGT TGCGGCAGAT CCCAACCCCA TGGCGAGTTG
GAGAAAAATA CCACTACAGC GCTACTTGCG GATATGGAAT TTATCCGCAA CCACTTAGAG
ATTGAGCGCT GGCTTATTTT TGGTGGCTCC TGGGGCGCCG CCCTCGGGCT ACTCTACGGA
GAAACTCATC CAAGCCGGGT TTTAGGGCTC ATTTTACGCG GCATCTTCCT GGGCCGGGAG
CAAGACACCC GCTGGTTCCT GCAAGAGGGC GCGCCGCGAA TTTTTCCCGA TGCTTGGGCG
GCCTTGGTAG AGGATATTCC CGCCGAGGAG AGAAATAACC TCATAGAATT CTTCCACCAC
CGTCTTAAGG GTCCCGACGA GCTGGCCCAG ATGGCGGCGG CTAAGGCCCT ACATGCCTGG
GAGTCCAGTT GTATGCGCCT TGTCAACAGC GAGGCACCTT CCCAATCAGG CCGCACCACA
CTGCTAGCCC ACGCCCGTTT GCTTATTCAC TACGCCAGAC ATCATTACTT TATTCAACCC
AATCAGATAC TCGATCATGC CCATCAATTA AAAAATATTC CTGGAATCAT CGTCCATGGC
CGCTATGATG TCATTTGCCC TGCCGGCAAT GCCTGGGAGC TGCATCAAGC CTGGCCTTCA
TCCGAGCTGC AAATCGTGCC CCTAGCCGGC CATGGAGCAA CCGAGCCAGC CATCGCGGAC
GCGCTAATTC GGGCAACGAA CCTCATGGCA AGGCGGGTAG GGTAA
 
Protein sequence
MLTLYPDIKP YVRHTLTVDP PHELYVEECG HPGGLPILFL HGGPGSGCQP HHRCFFDPDI 
YRVILFDQRG CGRSQPHGEL EKNTTTALLA DMEFIRNHLE IERWLIFGGS WGAALGLLYG
ETHPSRVLGL ILRGIFLGRE QDTRWFLQEG APRIFPDAWA ALVEDIPAEE RNNLIEFFHH
RLKGPDELAQ MAAAKALHAW ESSCMRLVNS EAPSQSGRTT LLAHARLLIH YARHHYFIQP
NQILDHAHQL KNIPGIIVHG RYDVICPAGN AWELHQAWPS SELQIVPLAG HGATEPAIAD
ALIRATNLMA RRVG