Gene Noc_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1387 
Symbol 
ID3706073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1536473 
End bp1538791 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content52% 
IMG OID637737881 
Productpeptidase S16, ATP-dependent protease La 
Protein accessionYP_343410 
Protein GI77164885 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACA ACGCTTATCC AACATTGCCG CTTAAAAATA CTGTCCTCTT TCCCCACCTG 
GTCCTTCCCC TGTCGGTGGG GCGAGCAGGA TCTATAGCCG CAGTCGAGGC GGCGCTGAGC
AGCGAAGATA AACTAATTGC TGTCTTTCCT CAAAAGGATC CCCGGACTGA CGAACCCGCT
GCGGATGATT TATTCCGCTT TGGCACGGTA GGAATCATCA AAAAGATGGT CAGGAGCGAG
GATACGGTTC AGATTCTAGT TCAGGGAATA GAGCGGGTTG AGCAGCTAGA AATGGTCCAG
AAGCAACCTT ATCTTTCCCT CAAAATTGCC ACTCTCTCTG AACCCTCGGA TACGGGCACT
GAAATCGAAG CCTTGCACCG AACTGTTATC GAACTCGCTG GCAAAATGAT TGAACTGGTG
CAACCCCAGA TCCAGGTCGG CATCCACCAC ATTATTTCCG ACGTGGAAAA GCCCCTCCAC
CAGATCTATC TTCTCACCTC TATCCTCTCG CTGGATTTTG ACAAGGAGAA AGAACTGCTG
GCTGCCGCTA CCCAGGTAGA AGCCTTGCAG TTAATGCACC GTTATCTTAA CCACGAAGTG
CAGGTTCTGG AGGTGCGGCA AAAAATCACC AGCACCGCCC AAACAGAGAT AGATAAGAAA
CAGCGTGAAT ATGTCCTACG CCAGCAATTA GAGGCCATCC AAGAAGAACT GGGGGAGACT
AACCCTGAAC AGGCTGAGAT CAAGGAGTTA CGCCAGCGAA TGGAAGAAAC GGAACTCCCG
GAGCTGGTCC GCAAAGAAGT GGAGAAAGAA ATTACCCGAT TGGAACGGAT GCCTTCGGCT
GCTCCTGATT ATCAGCTGAC CCGCGGTTAC GTGGAGCTAG CCCTAGAATT ACCCTGGAAT
AAAACCACGG AAGATCGTTT AGATCTCAAA AGGGCGCGCG AGATCCTCGA TGAAGATCAC
TTCGACTTGG AAGACGTCAA AGAACGGATC ATCGAACATC TGGCGGTGAT GAAACTCAAC
CCGGAAGCTA AATCGCCCAT TCTTTGCTTC GTTGGCCCCC CTGGAGTTGG TAAAACCTCG
GTGGGACAAT CCATGGCCCG CGCCTTGGGA CGAAAATTCG AGCGCATGAG TCTTGGTGGC
CTGCATGATG AATCGGAGTT GCGCGGCCAT CGCCGCACCT ACATTGGTGC TATGCCTGGC
CGAATCATTC GCGCCATTCG CCGTACTGGT TACCAGAATC CGCTTCTAAT GTTGGATGAA
ATCGACAAAC TGGGCCGGGA TTTTCGCGGC GATCCGGCGG CGGCATTATT AGAGATTCTT
GATCCCGCCC AGAATGCCGA ATTTCATGAT AACTACTTGG ATCTGCCTTT CGATCTTTCT
AAAATCTTCT TCGTCACCAC CGCTAATACG TTAGATACCA TCCCCCGCCC TCTGCTTGAT
CGGATGGAGA TTCTGCGGCT ACCGGGGTAC AGTGACGAAG AAAAACAACA TATCGCCCGT
CGTTATCTAA TTGGACGGCA AATTAGAGAA GCCGGCCTTT CCGAGATCCA ACTCTCCATA
CCGGATGAGA CATTAAGTTA CCTTATTCGG CGTTATACTC GGGAAGCCGG AGTGCGTGAA
CTAGAGCGGA TGCTGGGGCG AATTGCCCGC AAAGTGGCTA CCCAAGTCGC CACTGGTCAA
ACTCAGCCGG TAACCGTCAC GCCGCAAGAC CTTGTCGAAT TACTAGGACC AGAGCGATTT
TTCGCTGAAG AAATGCGCCA GCAGCTCGCC CCCGGGGTCG CGGCAGGCTT AGCTTGGACC
GAAGCGGGCG GCGATGTCTT GTACGTGGAA GCGGCTCTGC TACCAGAAGG GAAAGGGATG
ACTCTGACGG GACAGCTGGG CAGTATCATG CAAGAATCAG CAAAAGCTGC CCAAAGCTAC
CTCTGGTCCC GCGCCGAAGA ACTTAACATC GATCAAAAAA CCATCCGGGA ATCGGGGGTC
CACATTCATG TTCCAGCGGG CGCTATCCCT AAAGATGGCC CCTCGGCCGG AGTCACCATG
GCTTCAGCAC TCACTTCCGC TTACGCCCAT CAACCTGTTC GCAGCGATAC GGCAATGACA
GGGGAAATAA CACTGAGTGG TTTAGTCCTT CCCGTGGGAG GGATTAAAGA GAAAGTGCTT
GCCGCCCACC GGTCCGGCAT CCAGCGGATC ATTCTTCCCA AAGAAAATGA GAAAGACTTG
CGGGAAATTC CCGAGCATGT CCGGCAAAGC ATTCAATTTA TTCTAGCCAG ACGGATTGAA
GAGGTGCTAG CTGAAGCTAT CCCAGATTTA AATAGGTGA
 
Protein sequence
MENNAYPTLP LKNTVLFPHL VLPLSVGRAG SIAAVEAALS SEDKLIAVFP QKDPRTDEPA 
ADDLFRFGTV GIIKKMVRSE DTVQILVQGI ERVEQLEMVQ KQPYLSLKIA TLSEPSDTGT
EIEALHRTVI ELAGKMIELV QPQIQVGIHH IISDVEKPLH QIYLLTSILS LDFDKEKELL
AAATQVEALQ LMHRYLNHEV QVLEVRQKIT STAQTEIDKK QREYVLRQQL EAIQEELGET
NPEQAEIKEL RQRMEETELP ELVRKEVEKE ITRLERMPSA APDYQLTRGY VELALELPWN
KTTEDRLDLK RAREILDEDH FDLEDVKERI IEHLAVMKLN PEAKSPILCF VGPPGVGKTS
VGQSMARALG RKFERMSLGG LHDESELRGH RRTYIGAMPG RIIRAIRRTG YQNPLLMLDE
IDKLGRDFRG DPAAALLEIL DPAQNAEFHD NYLDLPFDLS KIFFVTTANT LDTIPRPLLD
RMEILRLPGY SDEEKQHIAR RYLIGRQIRE AGLSEIQLSI PDETLSYLIR RYTREAGVRE
LERMLGRIAR KVATQVATGQ TQPVTVTPQD LVELLGPERF FAEEMRQQLA PGVAAGLAWT
EAGGDVLYVE AALLPEGKGM TLTGQLGSIM QESAKAAQSY LWSRAEELNI DQKTIRESGV
HIHVPAGAIP KDGPSAGVTM ASALTSAYAH QPVRSDTAMT GEITLSGLVL PVGGIKEKVL
AAHRSGIQRI ILPKENEKDL REIPEHVRQS IQFILARRIE EVLAEAIPDL NR