Gene Noc_2169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2169 
Symbol 
ID3704843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2502671 
End bp2505127 
Gene Length2457 bp 
Protein Length818 aa 
Translation table11 
GC content52% 
IMG OID637738645 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_344159 
Protein GI77165634 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCCATT GGCATTTAAT GAGAGTCTGT GTAGTTTTTA TGGGCTTAAC ACTCATGCCG 
GCATACGGAG ATGGCGGTAA TGGTCGCCGA TTAGAAAGCA ACGAAGTTTT TTCTCCCACT
CCAGACGCGC CGTCCTCCTC CCACCCCCAT ATTGAGTATC GTTTGCAAAA GATGCTAGAA
GGGGCGGCCA GCCCTAAGGG AACGGCCCAG ATCCAGCCTG TGCAAATTCA TATTCTTTTG
GCTGAGCCGT TGACCGAGGA GACCCGCTTA GCCATCGAGG CGTTGGGGGG TAAGGTGGAG
ATCGAGCGCG GTGAGCATTT GCAAGTATCG TTGCCCCTGA TGCGGGTTTC CGAGTTAGCG
GCCTTATCCC AGGTCCAGTA TGTGCGGTTA CCCCGCCGGG CCGAGCCTCG GGAGGAAAAT
CGCCTTGTGC CGCAAAGTAT CCGTAGTGAA GGTGTCCAGG TCACTTTTGC CGATCAGCTC
CACTTGCTAG GGGTGACGGG AAAGGGAGTG CGGGTTGCTG TTTTGGATAG GGGATTCCAG
GGATATCAGG ATCTCTTGGG GATTGAATTA CCCCTTGAGG TCACGACAAA AAATTTTAAT
CTAGGGGAAG GGTTCGAGGG GACTCGTCAT GGTACCGCGG TAACTGAAAT TCTATACGAT
ATGGCCCCCG AGGCTGAGTT TACGCTGGTC GCGGTGAGTA CCGAATTAGA ATACATGGCG
GCGCTGGATT GGTTGCGTGC GCAAGGGGTC TCCATTGTTT CATTTTCCCT GGGCTTCGAT
AATCTCGGTC CCTTGGATGG CAGGAGTCCG ATTTCCGCGG CTGCAAGCCG GTTGTTTGAT
GAAGCAGGTA TCTTGTTTGT CGCTGCTGCC GGGAATGAGC AGCAAAATTA CTGGAGTGGT
CTTTTTAACG ACCTGGATGG CAACGGCGCT CACGACTTTA GCAATGAAGA CGAGGCCCTA
AGTGTCCAGT TGCGTGAGGG AGATGAAGTT CGCATCATTC TTAACTGGGA CGATTGGGGC
GAGGATCCCG CCCATCCCCG CGCCGAGCAA GATTATGATC TCTATATCTT TTGCCCAGGC
ACAGTCCAGT TCTCCTCCGA TAATGCCTGT GTTTCATCCG TGGGCTTGCA AACGGGACTA
TTAGGCCAGG AACCGATAGA ACAGGTTTTT TTCGCTGCGC CTGCAACGGG CAGGTACGAT
ATATTTATCG TGCGCGGGAG TCCTGGGGCC GGTGCGCGGT TGCTGCGACT ATTCGTCGGC
GGTAGCCAGG GAGAAATCTT TCCCATGGAG TATCAAAATA CTGCCAGTAC CTTGGTTTCT
CCTAGTGATG GTCGGAGTGT ATTTGCGGTC GGCGCCGTGG ATATAGATAG CCAGCAATTA
ACTTTTACTT CCTCCCTAGG TCCTACCTGG GATGGTCGTG TCAAGCCGGA TATCGTTGCC
CCTGATGGCG TGACTACCGC CGCTCTGGGC GCTTTTTTCG GGACTTCGGC GGCAACTCCT
TATGTGGCGG GAGCAGGGGC GTTGCTCAAG TCTCAAGAGC CCAACCGCAG CGCCCAGGAT
CTCAAGCTAT TGCTACAACA GGCCAGCACG GATCGTGCCG CTGGGGGAAA AGATAATGAA
TATGGTTCCG GAGCCTTGCT GCTTGAGAAT TTTGTAGCCG ATGAGCGACT CTCGCCCCTG
TCGGGTGTAT GGTGGAACCC TTCGCAAGAT GGGCATGGCT TCTTTTTCGG TGTTCGCAAT
GATAGGTTAG TAGCGACTTG GTATACCTAT GACGGAGCGG GTAACCCGTT TTGGTTACTA
TCCGCTGGCT CCATGTCTGC GACGAGGCGC TATTCTGGTA CGCTATATGC CTTTCACGGT
CCGCCATTAA AATCGCCCTT GAATACGCTG TTTGATAGTA GCGGTAGCAC CGTGGCTACA
AATGAAGTAG GCAGCCTCGA TATTGATTTT ATCCATCCTG GAGAAGCTTC CATCAATATT
CAACTGAGTG GCGATTCTCT AATTTTTCCA AATACATTTA GCCTTAAGGC GCAGCCTTTT
TTGGCCTATC CGGCAGCTGC CCAAAATCCC CAGGTTCCTT ATACTTCAAA GTACAATGGC
CTATGGTGGA ATGCGGAGCA GAGCGGACAC GGTTTTTTTA TTAATATTCA AGAGAGCGTA
CTAACCGCAG CGTGGTATAC CTACGATGAT AGCCAGGGAG AGCCGGTCTG GATTTTGACG
GCGGGTTCTA TGGACTCCGC AGCTGCCTAC TCAGGAACGG CCTATCGTTT TTCGGGTCCA
GCTCTTATGC CTGGCGCAAA TCTTGCTGAT TACTTTGATG AAACGGGATC TACGGTGAAT
GGAGTGGCCA GTGGTACTTT TTCCATTACT TTTACCTCGA ATACCACAGC GGTCGCAACT
ATCGGTAATG TGCTTTCGGT GAATGAAACC CTACAGTTAG AGCGCTTTAA TTTTTAA
 
Protein sequence
MIHWHLMRVC VVFMGLTLMP AYGDGGNGRR LESNEVFSPT PDAPSSSHPH IEYRLQKMLE 
GAASPKGTAQ IQPVQIHILL AEPLTEETRL AIEALGGKVE IERGEHLQVS LPLMRVSELA
ALSQVQYVRL PRRAEPREEN RLVPQSIRSE GVQVTFADQL HLLGVTGKGV RVAVLDRGFQ
GYQDLLGIEL PLEVTTKNFN LGEGFEGTRH GTAVTEILYD MAPEAEFTLV AVSTELEYMA
ALDWLRAQGV SIVSFSLGFD NLGPLDGRSP ISAAASRLFD EAGILFVAAA GNEQQNYWSG
LFNDLDGNGA HDFSNEDEAL SVQLREGDEV RIILNWDDWG EDPAHPRAEQ DYDLYIFCPG
TVQFSSDNAC VSSVGLQTGL LGQEPIEQVF FAAPATGRYD IFIVRGSPGA GARLLRLFVG
GSQGEIFPME YQNTASTLVS PSDGRSVFAV GAVDIDSQQL TFTSSLGPTW DGRVKPDIVA
PDGVTTAALG AFFGTSAATP YVAGAGALLK SQEPNRSAQD LKLLLQQAST DRAAGGKDNE
YGSGALLLEN FVADERLSPL SGVWWNPSQD GHGFFFGVRN DRLVATWYTY DGAGNPFWLL
SAGSMSATRR YSGTLYAFHG PPLKSPLNTL FDSSGSTVAT NEVGSLDIDF IHPGEASINI
QLSGDSLIFP NTFSLKAQPF LAYPAAAQNP QVPYTSKYNG LWWNAEQSGH GFFINIQESV
LTAAWYTYDD SQGEPVWILT AGSMDSAAAY SGTAYRFSGP ALMPGANLAD YFDETGSTVN
GVASGTFSIT FTSNTTAVAT IGNVLSVNET LQLERFNF