Gene Noc_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2454 
Symbol 
ID3704856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2800000 
End bp2801310 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content52% 
IMG OID637738933 
Producthomoserine dehydrogenase 
Protein accessionYP_344437 
Protein GI77165912 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.38424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGCCGG TTAAGGTAGG TTTGCTTGGT TTAGGTACGG TGGGTGGCGG TACAGTCAAT 
GTTCTCTCGG GTAATGCCGA AGAAATTACC CGCCGCGCTG GGCGTGGTAT CCAGGTCTAC
TGTGCTGCGA CCCGCGACTC CCGTAAGGCC CGCAGCTGTG ATACAGAAGG TATCTGGTTA
ACGACTAACC CCCATGAAGT GGTGGCCGAT CCCCAGATCG AAATTATCGT TGAACTCATA
GGGGGGACTG ATTTAGCCCG TACCCTAGTG CTGAAGGCGA TTGCCGAGGG TAAGCATGTG
GTTACAGCTA ACAAAGCGCT GATTGCACTT TATGGCAACG AGATCTTTGA GGCGGCTCAA
AAAGCGGGCG TAATGGTTGC TTTTGAAGCG GCGGTAGGGG GAGGTATTCC CATTATCAAG
GCTCTGCGTG AAGGTCTTGC GGGTAACCAT ATCGAATGGT TAGCCGGAAT TATCAATGGT
ACGAGTAATT TTATCCTCAC CGAGATGCGG GATAAAGGGT GTGATTTTGC TGAAGCGCTA
GTGGATGCTC AACACCGAGG CTATGCAGAG GCAAATCCTA CTTTTGACAT AGAGGGCATT
GATGCTGCCC ATAAGTTGAC GATTTTAGCC TCTATTGCTT TCGGTATTCC GTTACAGTTT
GAGCAAGTCT ATACAGAGGG AATTGGTGCC ATTACCCGTG AGGATATCGA TTACGCTGGA
CAATTGGGTT ACCGGATCAA ACATCTGGGC ATTGCGCGCC GCTTGGCAGA GGGGGTTGAA
CTGAGAGTAC ACCCTACCCT TATTCCCTAT CGCCGCTTGA TCGCCAATGT AGAAGGAGTC
ATGAATGCGA TCCTGGTAAA GGGAGATGCG GTGGGATCGA CTCTTTACTA TGGCCCTGGT
GCAGGCGCTG ATCCCACGGC CTCCGCCGTG GTTGCTGATC TGGTGGATGT GGTTCGGGCA
TTAACTTCTG ACCCAGAAAA CCGAGTACCG CATCTTGCTT TTCAACCCGA TGCCTTGGTG
GATCTCCCCA TTCTTTCGAT GGAAGAGGTA AAAACCGCCT ATTATCTGCG GATGCGGGCG
ATGGATGAGC CGGGCGTACT GGCCGAGGTC ACCCGAGTAT TTGGGGATCA AAGTATCAGT
ATTGAGGCGA TCCTTCAGAA GGAACCTGCG GCAGGGGAGA ACCACGTGCC TATTATCATG
CTGACCCAAC CTGTGCTGGA ACGGAATATG AACGAAGCTA TTCGCCGTAT TGAAGCCTTA
GAGTCCATCG CAGGGCCGGT GACCCGGATC CGCTTAGAGA CTCTGTGTTA G
 
Protein sequence
MEPVKVGLLG LGTVGGGTVN VLSGNAEEIT RRAGRGIQVY CAATRDSRKA RSCDTEGIWL 
TTNPHEVVAD PQIEIIVELI GGTDLARTLV LKAIAEGKHV VTANKALIAL YGNEIFEAAQ
KAGVMVAFEA AVGGGIPIIK ALREGLAGNH IEWLAGIING TSNFILTEMR DKGCDFAEAL
VDAQHRGYAE ANPTFDIEGI DAAHKLTILA SIAFGIPLQF EQVYTEGIGA ITREDIDYAG
QLGYRIKHLG IARRLAEGVE LRVHPTLIPY RRLIANVEGV MNAILVKGDA VGSTLYYGPG
AGADPTASAV VADLVDVVRA LTSDPENRVP HLAFQPDALV DLPILSMEEV KTAYYLRMRA
MDEPGVLAEV TRVFGDQSIS IEAILQKEPA AGENHVPIIM LTQPVLERNM NEAIRRIEAL
ESIAGPVTRI RLETLC