Gene Noc_0611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0611 
Symbol 
ID3706843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp656756 
End bp658042 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content51% 
IMG OID637737119 
ProductTPR repeat-containing protein 
Protein accessionYP_342660 
Protein GI77164135 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000267471 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAGGTT TTTTTCAGTG CTTAAGAGGG CACGGCCGTA CGCTTGTGGG AATATGGGCT 
AGGCGGTTAC CGCTTTTTAT GCTAGCAAAG GTATTGCTCT TAGGGGGAGG ATTATTGGTG
AATGCCCAAG CGGCTGAGCA GTACCTTCTG ACCCCTTCTA CCTATGAATC CCTGAGCGCT
GTCCATAAGC TCATGGACAA GCAGCAGTAT ACATCTGCCC TTAAACAACT CACCGCACTA
CAAGACGAGG TGAATGGTAA GGCTTACGAG CAGGCGGTTG TGCTCCAAAC CCTCGGCTAT
GTGTATTCTT CTTTAGAAAA ATATCCCAAG GCGATCCAGG CATTTAAAGC TAGCCTAGCC
CTAGATGCGC TGCCTGCCCG GGTCACTCAT GATTTGCGTT ATGGTCTGGC GCAGCTTTAC
ATGGCTACGG AGCAGTATGG AAAAGCTCTC CAGTTGCTAG AGGCATGGTT TAAGGCTGCG
GAATCTCCCC CGGCGGAAGC CCATGTATTG GCTGCTAGTG CCTACTACCA TCTGAAGCGG
TATGCCGAAG TCATTCCTCA TATTGAGGTA GCCATTGAGC TTGCCCAAGC GCCGCAAGAG
GAGTGGTATC AACTGCACCT TGCCGCCCGT TTAGAGTTGA AGCAATATTC CCAGGCGGCC
CAAATATTAG AAACCTTGAT AGGCCACTTC CCTAACAAGG AGCAGTATTG GAAGCAGCTG
GGAGCGGTGT ACATGGAGAT GAATAAAGAG CATCGGGCCT TAGCCGTGGA AGCGCTAGTA
GCACATATGG AGCCTCTTGA TAGTAAAAGC CTCATTCACC TTGCTAATCT TTATCGTTAC
CTCCATATTC CCTATAAAGC CGCGCAAGTT TTGCAGCAGG GTTTAAGGGA TGAAACTATT
CAAACAAGCA GCAAGCATTG GGAATTTCTT GCCGATGCCT GGCTCGCCGC CCGGGAATGG
GAACGCGCCG CTGCTGCTTT TAAGGAGGCA GGGCGGTTGA GGCAGGATGG CAAAATGGCC
CTTCGCCGCG GTCAGGTTCT CATCGAGCTG CAAGACTGGA AGCAAGCAGA GAAAGCCTTT
GCGCAAAGTT TGCGCAAAGG GGGACTGGAT GATCCTGGGC AAGCCCGTTT TTTCTTGAGT
CAGGCGAGAT ATGAGCAGGG GCACTTTGCA GAAGCTATTC AGGCGTTAAA GTTAATTCAG
GCTTCCTCAG CTTATAGCAA ACAGGCCGCC CAATGGTTAA AGCATTTACA GGTAGTCCGG
AAGCAGGGAG CTGACGGCAA AGGTTAA
 
Protein sequence
MEGFFQCLRG HGRTLVGIWA RRLPLFMLAK VLLLGGGLLV NAQAAEQYLL TPSTYESLSA 
VHKLMDKQQY TSALKQLTAL QDEVNGKAYE QAVVLQTLGY VYSSLEKYPK AIQAFKASLA
LDALPARVTH DLRYGLAQLY MATEQYGKAL QLLEAWFKAA ESPPAEAHVL AASAYYHLKR
YAEVIPHIEV AIELAQAPQE EWYQLHLAAR LELKQYSQAA QILETLIGHF PNKEQYWKQL
GAVYMEMNKE HRALAVEALV AHMEPLDSKS LIHLANLYRY LHIPYKAAQV LQQGLRDETI
QTSSKHWEFL ADAWLAAREW ERAAAAFKEA GRLRQDGKMA LRRGQVLIEL QDWKQAEKAF
AQSLRKGGLD DPGQARFFLS QARYEQGHFA EAIQALKLIQ ASSAYSKQAA QWLKHLQVVR
KQGADGKG