Gene Nmul_A1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1551 
Symbol 
ID3785273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1777319 
End bp1778638 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content56% 
IMG OID637811639 
Producthomoserine dehydrogenase 
Protein accessionYP_412246 
Protein GI229137830 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCTA TCCATGTGGG CCTGTTGGGG GCAGGCACGG TCGGCAGCGG CACGTTCGCC 
GTGCTCAAAC GCAATCAGGA AGAGATCACC CGCCGCGCCG GCCGCAGTAT CGTCGTTCGC
ATGATTGCTG ATCGAGAGGA AGAGAAGGCC CGTCAGATCG CAGGCGACGA TGAGGTCATC
GTCACACGCG ATGCGAATGA GGTGGTGATG AATCCCGATA TCGATATCGT GGTCGAACTG
ATCGGCGGCT ACACCGCGGC CAGGGACCTG ATACTGAAGG CTATCGAGAA TGGCAAGCAT
GTCATTACCG CCAACAAGGC ATTGCTTGCT TCACATGGGA CCGAAATCTT CGCCGCGGCG
CAGAAGAAAG GCGTAATGGT GGCGTTCGAA GCGGCGGTGG CGGGAGGCAT TCCTATAATC
AAGGCTCTGC GTGAGGGATT GACTGCCAAC CGCATCGAGT GGATAGCCGG CATCATCAAT
GGCACGAGCA ACTTCATTCT CTCGGAAATG CGGGATAAGG GGTTGACGTT CGAAACCGTA
TTGAAGCAGG CGCAAAAACT GGGTTATGCC GAAGCCGATC CCACTTTTGA CATCGAGGGC
ATCGATGCGG CGCATAAACT CACGATCATG GCCTCGATCG CCTTTGGCAT TCCAATGCAG
TTTGACAAGG TATATACCGA GGGTATAACC AAATTGACCC GCGAGGATAT TCGTTATGCG
GAGGAACTGG GTTATCGCAT CAAGCTGTTG GGCATTACGA AACGTACGTC CGGAGGAATC
GAGTTGCGTG TGCATCCGAC ACTCATCCCT GCTCGAAGAC TGATCGCCAA TGTCGAGGGG
GTGATGAATG CCATCGTGGT GAGAGGCGAT GCGGTAGGCT CTACCCTCTA TTATGGTCCG
GGAGCGGGTG CCGAACCTAC AGGGAGTTCA GTCGTGGCAG ACCTGGTGGA TGTAACTCGC
ATGCACACAG CCGATCCCAA GCACCGCGTT CCTCATCTCG CCTTCCAGCC AGGCCGCCTG
TCGGATACGC CGATCCTCAC GATGGACGAG GTGGAAACGT CTTATTACCT GCGGCTGCGG
GTCATGGACA AACCTGGGGC CCTGGCCGAT ATCACGCGGG TGCTTGCGGA CCTCGGCATT
TCCATCGAAG CCATGATGCA GAAAGAGCCA AGCGAAGGCG AAGACCAGGT GGATATCATT
ATGCTCACGC ATTTGGCGGT GGAAAGAAAC GTTAACGATG CGATCGCCCG AATAAAGCGA
TTGCCCATAA CGACCGGCAA GGTGACCCGC ATCCGGCTGG AGCATCTGGG CAGCAAATAA
 
Protein sequence
MKPIHVGLLG AGTVGSGTFA VLKRNQEEIT RRAGRSIVVR MIADREEEKA RQIAGDDEVI 
VTRDANEVVM NPDIDIVVEL IGGYTAARDL ILKAIENGKH VITANKALLA SHGTEIFAAA
QKKGVMVAFE AAVAGGIPII KALREGLTAN RIEWIAGIIN GTSNFILSEM RDKGLTFETV
LKQAQKLGYA EADPTFDIEG IDAAHKLTIM ASIAFGIPMQ FDKVYTEGIT KLTREDIRYA
EELGYRIKLL GITKRTSGGI ELRVHPTLIP ARRLIANVEG VMNAIVVRGD AVGSTLYYGP
GAGAEPTGSS VVADLVDVTR MHTADPKHRV PHLAFQPGRL SDTPILTMDE VETSYYLRLR
VMDKPGALAD ITRVLADLGI SIEAMMQKEP SEGEDQVDII MLTHLAVERN VNDAIARIKR
LPITTGKVTR IRLEHLGSK