Gene Noc_0435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0435 
Symbol 
ID3706606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp473378 
End bp474433 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content58% 
IMG OID637736945 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_342489 
Protein GI77163964 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.692768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCAT TCGTAATGTT AAACATCGGG CAAGTGGGGG TCGTTGAAAA GGATCGTCCG 
ACCTGTGGGC CTCTGGACGC GATCCTGCGT CCGACGAAGG GGCTGATCTG CACATCTGAT
GTGCATACCG TCCATGGTGC GGTTGGAGAA CGGGAGAACC TCACCCTGGG ACATGAAGCT
GTCGGCGTGG TCGAGGAAGT TGGAGCGCTA GTCGCCAACT TCAAACAGGG AGACCGGGTA
GCGGTGGGTG CTATCACCCC GGACTGGGGT TCAGACGCTG CCCAGGGAGG CCATTCATCG
CAGTCTGGTG GAGCACTGGG AGGCTGGAAG TTTGCCAATA TCAAAGACGG CACCTTTGCC
GAGTACGTAC ATGTCAACGA AGCAGACGCC AACCTCGCGC TGATCCCCAA GGGTGTGCCA
GACGAGTCGG CTGTGTATGT GTGTGACATG ATGAGCACTG GATTCATGGC CGCAGAGAAC
GCCAAAATCC CCATCGGTGG CAATGTCGTA GTCTTCGCCC AGGGACCCGT GGGCCTCATG
TGCACTGTGG GAGCACGACT GCAGGGCGCC GGTTTCGTGA TCGCGGTTGA AAGCGTGCCC
AAGCGCCAGG AGCTGGCCCG GCACTTCGGG GCCGACGAGG TGGTGGACTT TACCAAGGTG
GACGTGGTAG AGCGGATTCT TGAGCTCACG AACGGCGAAG GCGTGGATGC GGCCATCGAT
GCGCTAGGCA CATCCCAGGT GCTCCAACAG TGCGTCAAGG TGACCAAGCC CGGCGGTATG
ATCTCCAACG CTGGTTACCA TGGTGATGGC GAATTTGTCG AAATCCCCCG CGTGGAGTGG
GGCGTCGGAA TGGCGGAGAA GGACATCGCG ACGGGTCTCT GCCCAGGCGG ACACCTACGG
CTCTCCCGTT TACTGAGGTT GCTGGAAACC GGGCGGATCG ATCCCACTCC GATGACTACC
CATACCTTTG GATTCGATGA AATCGAGAAG GCATTCCGCA TGATGGAGAA AAAAGAGGAC
GGTATGATCA AACCGATGAT TGATTTCGAA GCCTGA
 
Protein sequence
MKAFVMLNIG QVGVVEKDRP TCGPLDAILR PTKGLICTSD VHTVHGAVGE RENLTLGHEA 
VGVVEEVGAL VANFKQGDRV AVGAITPDWG SDAAQGGHSS QSGGALGGWK FANIKDGTFA
EYVHVNEADA NLALIPKGVP DESAVYVCDM MSTGFMAAEN AKIPIGGNVV VFAQGPVGLM
CTVGARLQGA GFVIAVESVP KRQELARHFG ADEVVDFTKV DVVERILELT NGEGVDAAID
ALGTSQVLQQ CVKVTKPGGM ISNAGYHGDG EFVEIPRVEW GVGMAEKDIA TGLCPGGHLR
LSRLLRLLET GRIDPTPMTT HTFGFDEIEK AFRMMEKKED GMIKPMIDFE A