Gene Noc_1611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1611 
Symbol 
ID3705734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1801093 
End bp1802232 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content56% 
IMG OID637738087 
Producthypothetical protein 
Protein accessionYP_343616 
Protein GI77165091 
COG category[R] General function prediction only 
COG ID[COG5621] Predicted secreted hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCCAA GGAAAACTCA CCTCTTTTGG CTACTGGCGC TCATCGCGAT AGGAATCGGT 
AGCCTTCAGT GGTTGCAAAG CAACGAACCG GCATTGGAGC CCCAGGGAGG GATTCCCTTA
GCGGAGCTGC TAGGCGAGCA GGAAGGATTT GCCCGAGTAG AGGCCCCCTG GTCATTTTCC
TTCCCCCAGG ATCACGGTGC CCACTCCCGC TACCGGACGG AGTCCTGGCA TTTCACGGGT
CATCTGGCCT CGGAACAGGA AGCGCATTTT GGCTTCCAGC TCAGTTTTTT CCGGGTGGGC
CTGAAGCCTC CGGAAGCGCC CCCTCGTCCT TCAGCTTGGG GGGCCAAGGA GATTTATCGT
GGCCATTTTG CCCTGACCGA CGTCAATCAA GGGCGTTTTC GCGCCTTTGA GCGTTTCAGC
CGGGCTGCCC TAGGGCTGAG CGGAGCTGAC TCCTCGCCTA CCCAGGTATG GGTAGAAAAT
TGGCGCATCC AGGCTCTGGG AGAAGAAAAC GCCAATTTCC GCCTGCGGGC GACCGCCGAT
GGCGCGAGTA TCGACTTGAC TTTACGCAAC CTTAAACCGC CTCTCCTTCC TAACAACGAT
TCCTCCGGAC AGGCCGGTGT CTTTTACAGC TATCAATTCA CGCGTCTAGG GGCCCAGGGC
ACAATCCAGC GAGGTAATCA GATCTACCCC GTAAAAGGTT TAGCCTGGCT AGATCGGGCT
TGGGGAGCGG TACCCGTACC TGCCGGGCCG GTAGTTTGGG ATCGTTTTCT GCTACAGCTA
GATGATGGCC GGGAACTGCT GATTTTTCGA CTCCGTCGGC GGGATGGCAG TGGTACACCT
ATTAATAGCG GATTTTTGGT GGATCGAGCA GGTAAAATCC AATCCTTCGA CTCCGAAGCG
CTCACCATTG AAATATTAGA CTACTGGGAA AGTCCCAAGG ATGGGACCCC ATACCCCGCC
CGCTGGCGGT TTCACCTCCC TGCCCAAGGT ATTGACCTTC GCCTGACCCC TGCTGTGGCG
AATCAGGAAC TCAATCTTTT ACTCCGCTAC TGGGGCGGGT TAGTGCAGGT CAGAGGTCAA
GAGAAAGGGA AAAAGATAAA AGGGCAAGGT TATGTGGAAT TGATCGGCTA CGGAGCATAA
 
Protein sequence
MFPRKTHLFW LLALIAIGIG SLQWLQSNEP ALEPQGGIPL AELLGEQEGF ARVEAPWSFS 
FPQDHGAHSR YRTESWHFTG HLASEQEAHF GFQLSFFRVG LKPPEAPPRP SAWGAKEIYR
GHFALTDVNQ GRFRAFERFS RAALGLSGAD SSPTQVWVEN WRIQALGEEN ANFRLRATAD
GASIDLTLRN LKPPLLPNND SSGQAGVFYS YQFTRLGAQG TIQRGNQIYP VKGLAWLDRA
WGAVPVPAGP VVWDRFLLQL DDGRELLIFR LRRRDGSGTP INSGFLVDRA GKIQSFDSEA
LTIEILDYWE SPKDGTPYPA RWRFHLPAQG IDLRLTPAVA NQELNLLLRY WGGLVQVRGQ
EKGKKIKGQG YVELIGYGA