Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1396 |
Symbol | |
ID | 3706082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1546799 |
End bp | 1547917 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637737890 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_343419 |
Protein GI | 77164894 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0139255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTAAAGC GAAAGAAAAT CTCTATTGGC TCCTTGGCGT TGTTGTTGGT ACTCGCATTT ACTGTAGGGA TAGGGTGGCA GGGATTTGCA GAGCGGTTTT TGGAAGCGCC TCCTCCGGCA GAGCCGCGCC CCGTTCTGGC GCGTGGAGAC TTGGCCGCAG TTGAGAAATC AACGATTGAG CTCTTTCGTA AAGTCTCCCC CGCCGTAGTT TTTATCACCA CCCTATCCCG GCACCGTGAT TGGTTCAGCT TAAATGTGCA AGAGATCCCC CGGGGGACTG GCTCGGGCTT TATTTGGGAT GATAGCGGCC ATATTGTGAC TAATTTACAC GTTGTGCAAG GATCGAATGC TGCTAAGGTG ACCCTATATG ACCACTCAAC TTGGGATGCC AAACTGATTG GGGCGGCCCC TGAAAAGGAT TTAGCGGTAC TGCGGATTAA GGCGCCGCGA AACAAGCTTA TGCCTATCGC CATCGGTAGT TCCGGTGATC TCCAAGTAGG TCAAAAAGCA TTTGCGATTG GTAACCCCTT TGGTCTGGAT CAAACGTTAA CTACGGGGGT GATCAGCGCC TTGGGTAGAG AAATGGAGTC GGCGGCAAGA ATTCCCATTC GCAATGTGAT TCAAACGGAC GCCGCCATTA ACCCAGGAAA CTCAGGAGGG CCTTTGCTGG ATAGTGCCGG GCGCCTGATG GGAGTGAATA CAGCGATTTA TAGCCCTTCC GGTACCTATG CCGGGATTGG TTTTGCTATT CCCGTGGATA CGGTCAATTG GGTAGTACCG GAACTGATTG CCAAGGGCCG AGTGGAGCGG CCAACGTTGG GGATTGAATT GTTGCCAGCG CGGGCTATGG CCAATATGAG AGTGGAGGGG GCGGTCATTC TGCGCGTGAT TCCGGGTAGT GGCGCGGAGC AAGCCGGTTT GCGGGGGGTC CAACGCGACT CCCTGGGACG GATCTATCTG GGGGATATTA TCGTTGCGGT CGAGGGTCAG CCCGTATTGG ATGCCGATGA TTTAGTCTTA GCCCTTGAGC GGCGTCAAGC TGGCGAAAAA ATTCAGGTGC AGGTGATTCG TGAAGAGCAA CGACTGGATA TAGAGGTGAC ATTAGGCTCA CCGGCATAG
|
Protein sequence | MLKRKKISIG SLALLLVLAF TVGIGWQGFA ERFLEAPPPA EPRPVLARGD LAAVEKSTIE LFRKVSPAVV FITTLSRHRD WFSLNVQEIP RGTGSGFIWD DSGHIVTNLH VVQGSNAAKV TLYDHSTWDA KLIGAAPEKD LAVLRIKAPR NKLMPIAIGS SGDLQVGQKA FAIGNPFGLD QTLTTGVISA LGREMESAAR IPIRNVIQTD AAINPGNSGG PLLDSAGRLM GVNTAIYSPS GTYAGIGFAI PVDTVNWVVP ELIAKGRVER PTLGIELLPA RAMANMRVEG AVILRVIPGS GAEQAGLRGV QRDSLGRIYL GDIIVAVEGQ PVLDADDLVL ALERRQAGEK IQVQVIREEQ RLDIEVTLGS PA
|
| |