Gene Noc_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0472 
Symbol 
ID3706643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp507877 
End bp508863 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content58% 
IMG OID637736981 
Productshort chain dehydrogenase 
Protein accessionYP_342525 
Protein GI77164000 
COG category[R] General function prediction only 
COG ID[COG4221] Short-chain alcohol dehydrogenase of unknown specificity 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTTT CACCCAAAGT CGTTGTGATC ACGGGCGCCT CCGCGGGGGT AGGTCGGGCA 
GTCGCCCAGG CGTTTGCTAG AGGCGGAGTC TCGATCGGAC TGCTGGCGCG GGGGCGTGAA
GGATTAGAAG GGGCGTGCCG GGAAGTGGAA TCCCAGGGAG GAAAAGCTTT GATTCTCCCA
ACGGATGTGG CTGATGCGGA TCAAGTGGAG GCGGCCGCGG CTGCCGTTGA AAAAGCCTTT
GGCCCCATTG AGGTCTGGAT TAACGACGCC ATGACCAGCG TGTTCTCTCC GGTCAAGGAA
ATGACCCCCG AGGAATTTCG CCGCGTCACC GAGGTGACCT ACCTCGGTTG CGTCAATGGG
ACCCTGGCCG CCCTTAAGCG TATGCTACCC CGCAACCGAG GGGTGATTAT TCAGGTGGGT
TCGGCTTTGG CCTATCGGGC CATCCCTTTG CAAGCCGCCT ATTGCGCGGC CAAACATGCC
ATTCGGGGCT TTACCGATTC GCTGCGATGC GAACTGCTCC ATGAAAAATC CCAAGTCCGC
GTGACCATGG TGCAAATGCC CGCGCTCAAT ACGCCTCAGT TTGACTGGAT TAAATCACGG
CTGCCTCGCA AAGCCCAGCC CGTACCGCCA GTCTATCAGC CGGAAGTAGC GGCCCGGGCT
ATTCTTTGGA CCGTCAGGCA TCCCTGCCGC GAGCTGAAGG TGGGACTGCC CACGATTTTA
ATCGTAGCTA TCAATAAGTT CATGCCGGGC CTGCTAGATC ATTATTTAGC CCGCACTGGC
TATCAGTCCC AGCAGCGGGA CGAACCGGAA GATCCTAACC GCCCCCACAA TCTTTGGAAT
CCGGTCGCTG GGGATTTCGG CACCCACGGC AGCTTTGATG AGATAGCCCA TCGCGCCAGT
ATCTCTCTTT GGGTAACCAC CCACCCCCGT TGGTTTGCCC TCGCGGCAGG GTTGATTTTA
GCGCTGTTTA TTCTAGCTTT TTTATAA
 
Protein sequence
MPFSPKVVVI TGASAGVGRA VAQAFARGGV SIGLLARGRE GLEGACREVE SQGGKALILP 
TDVADADQVE AAAAAVEKAF GPIEVWINDA MTSVFSPVKE MTPEEFRRVT EVTYLGCVNG
TLAALKRMLP RNRGVIIQVG SALAYRAIPL QAAYCAAKHA IRGFTDSLRC ELLHEKSQVR
VTMVQMPALN TPQFDWIKSR LPRKAQPVPP VYQPEVAARA ILWTVRHPCR ELKVGLPTIL
IVAINKFMPG LLDHYLARTG YQSQQRDEPE DPNRPHNLWN PVAGDFGTHG SFDEIAHRAS
ISLWVTTHPR WFALAAGLIL ALFILAFL