Gene Noc_1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1078 
Symbol 
ID3707203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1183345 
End bp1184376 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content54% 
IMG OID637737580 
Productputative dehydrogenase 
Protein accessionYP_343113 
Protein GI77164588 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACCC CGCCACCACG CTTAGTCCCA TCTATTGCCC GGGCTTATTG GGTGGAAGCG 
TCCGGGAAAG GCGCCATTCG CCAGGAAACG CTCTCTGTGC CGGTACCAGT GGGTTATAGC
CTCCTTGAAA CCTGGCTAAC TGGAATCAGC CCTGGGACAG AGCGATTGGT CGGATTGGGG
AAGGTACCCG CGGAATGCCA GCAAGCCATG GCTTGCCCTG CTATGGGGGG ATCATTCAAG
CTGCCAGTCA AATACGGATA TTGCCTCCTA GGCCAGGCTA TTAATGGCCC CTACGCTGAC
CAGCTCGTCT TTACCATGCA TCCCCACCAA GATTACGCAA TTGTCCCCAA TAAGCAACTA
TTACCCCTCC CTCAGGATAT ACCCCCTCTA CGGGCTACGC TGCTTCCTAA CCTGGAAACC
GCCCTGAACG CCATTTGGGA CAGCGAATAC CAGGCGCCAG CGCCGGTAGC CATCGTCGGT
GGCGGCATTG TGGGCTTGCT TATCGCCTTT TTGCTCAAAA CCGCCTGGGA TGCCTTCCCT
ATTATCATTG AGCGCGATCC GCAGCGTCGG CAACTCATTG AAAAACTAGG ATGGGGACTT
ACTGTCCTTG AAGTCCAGGA GGCCCCCCAG GGGGTATTTT CCCTCTGTTT TCATGCCTCG
GGACAAGGAG CAGGACTGCA AACAGCCTTG GATAGCGTGG GGTTTGAAGG AAAAGTCATT
GAGGTGAGCT GGTTGGCTCA TCAGCCAGTC ACCCTTAACC TGGGCGGATC TTTTCACTTC
CAAAGGAAAC AGATTCTCTC TTCCCAAGTC AGCACGATTG CCAAACCCAA GCGGGAACAT
ACGAGCCACC AGCAGCGTTT AGAGCAGACC CTGAATTATT TGCAAAGCCC CTTACTTGAT
GCTCTTATTG CCCCAGCGAT CACCTTTGAG AGCCTGCCTC TTTTTATGCA GGAACTCTAC
CATAAAAATC CGGTCGACTT TTCCTTTGCC GTGACCTATC CACCCTTTCA TCCTCGACTC
CACAAAGCCT AA
 
Protein sequence
MSTPPPRLVP SIARAYWVEA SGKGAIRQET LSVPVPVGYS LLETWLTGIS PGTERLVGLG 
KVPAECQQAM ACPAMGGSFK LPVKYGYCLL GQAINGPYAD QLVFTMHPHQ DYAIVPNKQL
LPLPQDIPPL RATLLPNLET ALNAIWDSEY QAPAPVAIVG GGIVGLLIAF LLKTAWDAFP
IIIERDPQRR QLIEKLGWGL TVLEVQEAPQ GVFSLCFHAS GQGAGLQTAL DSVGFEGKVI
EVSWLAHQPV TLNLGGSFHF QRKQILSSQV STIAKPKREH TSHQQRLEQT LNYLQSPLLD
ALIAPAITFE SLPLFMQELY HKNPVDFSFA VTYPPFHPRL HKA