Gene Noc_2462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2462 
Symbol 
ID3704864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2808437 
End bp2809852 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content49% 
IMG OID637738941 
Productpeptidase S1C, Do 
Protein accessionYP_344445 
Protein GI77165920 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGTT TTTTGAACAA ACAATGGGCC CTCTTGGCGA TATCGCTACT GATTGCCGTG 
ACTGGGCAGG CCGCAGGGTT GCCTGATTTT ACTGAATTGG TGAAAAATAA TAGTCCTGCG
GTGGTTAATA TCAGCACTAC CCAGAAAATC ACTCAAGGCG GCTCCCGTTT GCCACACGGG
TTACACCAAT TACCAGAGGG TAGTCCCTTT GAGGATTTTT TTCGCCGCTT TTTTGACGAG
GGAACGGGTG AACCCCGGAG CTTTGAAACC CATTCTTTAG GATCAGGTTT TGTTATTTCA
TCGGATGGGT ACATTATCAC GAATAATCAC GTCATCCAGG ATGCAGATGA GATTATCGTG
CGCTTTAGCG ACCGAAGAGA GCTGGAAGCC GAAGTTGTAG GGAGTGACGA GAGTAGCGAT
CTTGCCCTGC TAAAAGTAGA AGCAAAAGAT CTACCTACTC TGAGGCAAAG CAATGCCAGT
CAGCTTAAGG TTGGCGAATG GGTGCTTGCC ATAGGTTCGC CCTTTGGTTT CGAGCATTCG
GCCACAGCGG GCATTGTTAG TGCTTTAGGA AGGAGTTTGC CGGAGGAGAG CTATGTTCCG
TTTATTCAAA CTGATGTGGC TATTAACCCA GGAAATTCAG GCGGCCCTCT CTTTAACTTA
ATGGGGGAAG TAGTGGGTAT CAACTCTCAG ATCTACAGCC GTACAGGCGG TTTCATGGGG
CTTTCTTTTG CTATACCTAT TGATGTAGCC ATGGAGGTCG TGGATCAACT CAAGGAAAAA
GGACGGGTGA CTCGAGGATG GCTGGGGGTT GTGATTCAGG ATGTTACCCG TGAGTTGGCA
GAATCATTTG GGCTTGGGAA ACCTCAAGGG GCTCTGGTAG CCAGGGTCTT AGCGGATAGC
CCAGCCGCTA AGGGAGGAGT CCAAGTGGGC GATATTATCC TGGATTTTAA TGGAAAATCA
GTGCCTCGGT CAGCGGCCTT GCCGCCACTG GTGGGGCGAG CCGAGATTGG TGAGACGGTT
GACATCCAAA TTTTACGTGC TGGTGATGAA AAAACACTCA AAATTAAAAT TGGAGAGCTG
CCAGAAAAAA AGGAATTGGA GAAGGCAACA GGGAAACCGG ATCATATATT TGAGGAGCGA
CTTGCGCTTG AGATAGCCGA CTTGAAAAAA GAGGAGCGTA CACAGCTTGA TGTGGTCTCT
GGTGTGCGAG TGACGAAAGT AGAAGAAGGG CCCGCCTTCG AGGCGGGAAT TCGGCATGGG
GATATTATTC TTAGTATTAA CAATACGGAG ATAAAGGATA CTGCCCAGTT CAAAGAGCTT
GCTAAAAAAT TGCCTGCGGG TAAGGCAGTG CCTCTTTTAT TACAACGAGA GCAGACCACC
TCTTTTATCG CCATCACCAT TCCAGAGGGG AGCTGA
 
Protein sequence
MKRFLNKQWA LLAISLLIAV TGQAAGLPDF TELVKNNSPA VVNISTTQKI TQGGSRLPHG 
LHQLPEGSPF EDFFRRFFDE GTGEPRSFET HSLGSGFVIS SDGYIITNNH VIQDADEIIV
RFSDRRELEA EVVGSDESSD LALLKVEAKD LPTLRQSNAS QLKVGEWVLA IGSPFGFEHS
ATAGIVSALG RSLPEESYVP FIQTDVAINP GNSGGPLFNL MGEVVGINSQ IYSRTGGFMG
LSFAIPIDVA MEVVDQLKEK GRVTRGWLGV VIQDVTRELA ESFGLGKPQG ALVARVLADS
PAAKGGVQVG DIILDFNGKS VPRSAALPPL VGRAEIGETV DIQILRAGDE KTLKIKIGEL
PEKKELEKAT GKPDHIFEER LALEIADLKK EERTQLDVVS GVRVTKVEEG PAFEAGIRHG
DIILSINNTE IKDTAQFKEL AKKLPAGKAV PLLLQREQTT SFIAITIPEG S