Gene Noc_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1722 
Symbol 
ID3705037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1926844 
End bp1927839 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content57% 
IMG OID637738203 
Product4-hydroxythreonine-4-phosphate dehydrogenase 
Protein accessionYP_343724 
Protein GI77165199 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1995] Pyridoxal phosphate biosynthesis protein 
TIGRFAM ID[TIGR00557] 4-hydroxythreonine-4-phosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTGCC TGCGATTGGC CCTCACTCCT GGCGAACCTG CTGGCATTGG CCCGGATATC 
TCGATCCAAT TGGCCTGCCA GCGGCGGGAA TATGATTTGG TCGTGGTGGC CGATCCAGAA
ATTCTACGCC AACGGGCACG GCAGCGGGGG TTAGCCCTGA TAGTTGAGTC CTATGATCCT
GCTCAGCCGC CGGAGCCAGG GGTATCCGGA ACATTGAAGG TTCTACCATT AAGGGCTCCC
GTCCCTGTTA CCACGGGGTG TTTGTCTCCG GGGAATGCGG CGTATGTTTT AGCATGTTTG
CGGCGCAGTG TAGCAGGCTG CTTACAAGGA GAGTTTTCCG CTTTGGTCAC CGGTCCCGTT
CACAAAGGGA TTATCAATCA GGCAGGCATC TCCTTTAGTG GTCATACTGA ATTTTTAGCC
CAGCTGTGCA ACCGCTCTCA AGTAGTGATG ATGTTGACCG CGCCTGGACT GCGGGTAGCG
TTAGCCACCA CTCATTTACC CTTGCGTGAG GTGAGTGCCG CGATTACCCG CCGAGGACTG
GAGGGGACGC TGCGGATACT GCACCGGGAT TTGAGGCAGC GCTTTGGAAT TTCCAGGCCG
CGTATTTTAG TCTGCGGCTT GAATCCCCAT GCGGGAGAGG GGGGGCATTT GGGGCGTGAG
GAAATCGAGA TCATCGAGCC GGTCATAGCA ACCCTCCGGG CTCAGGGAAT GCAATTATTC
GGACCCTTGC CGGCGGATAC CTTGTTTGTG CCCCGTTACC TGAAAGAAGC GGATGCAGTG
CTCGCCATGT ACCATGATCA GGGGCTGCCC GTGCTCAAGC ATGTAGGTTT TGGGCGAGCG
GTGAATATTA CCTTGGGGCT GCCTATTATT CGCACCTCAG TGGACCATGG CACCGCTTTG
GATCTTGCCG GCAAGGGCCC GGTAGGAAGC GGCAGCTTGG AAGCGGCTGT GGAGGCGGCA
CTAGCGATGG CGGAGAGAGA GCAACTTTCA AAGTGA
 
Protein sequence
MACLRLALTP GEPAGIGPDI SIQLACQRRE YDLVVVADPE ILRQRARQRG LALIVESYDP 
AQPPEPGVSG TLKVLPLRAP VPVTTGCLSP GNAAYVLACL RRSVAGCLQG EFSALVTGPV
HKGIINQAGI SFSGHTEFLA QLCNRSQVVM MLTAPGLRVA LATTHLPLRE VSAAITRRGL
EGTLRILHRD LRQRFGISRP RILVCGLNPH AGEGGHLGRE EIEIIEPVIA TLRAQGMQLF
GPLPADTLFV PRYLKEADAV LAMYHDQGLP VLKHVGFGRA VNITLGLPII RTSVDHGTAL
DLAGKGPVGS GSLEAAVEAA LAMAEREQLS K