Gene Nmar_0696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0696 
Symbol 
ID5773236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp635915 
End bp636925 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content37% 
IMG OID641316332 
Producthomoserine dehydrogenase 
Protein accessionYP_001582030 
Protein GI161528204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGAATAA TATTATGTGG ATTTGGCGTT GTTGGCCAAA GTTTAGTAAA ATTATTTGAA 
TCCAGAGCAG AAGACCTGTA TGCAAAGTAT GGACTCAAAC CTAGAGTAGT GGGAGTGTTT
GACAGTAAAG GGAGTGCAAT GGATTCTTCA GGATTAGAGT TAAACAAACT CATCGAAGTT
AAGAAAAAAT TTGGAACTGT AAAAAATTAC GCTGATACAA AAAATACAAT GTCAGGTATT
GACATGCTCA AAAATGTAGA GGCAGATGTG CTCATTGAAA CTACTGCAAG CAACTACAAA
GATGCTGAAC CTGGAATGAC ACACATTATC ACTGCAATGA AAAAAGGAAT GCATGTAATA
TCAGTAAACA AAGGACCTCT GGCACTAGCA TTTCCATCAT TAATGGAGCT TGCAACATAC
AATCGAGTCA TGTTCAAATT TAGTGGAACC GTAGGTGGGG GAACACCAAT TCTAGATTAT
GCAAAAAACA GTCTTAGTGG CGAAAGAATC ACATCATTTG CAGGCATTCT AAATGGAACT
ACAAACTATA TCCTAACAAA CATGGCAACA GGAGTTTCAT ATGAAGATGC ACTAAAAGAT
GCCCAAGACA AGGGCTACGT AGAGGCAGAT GAAGCATTAG ATTTAGACGG ACTTGACGCT
GCAGCCAAAT TAGTGATTCT TGCAAATTGG ATTATGGGAA TGAAAGTTAC AATGCCAGAC
ATTAATTGTA CTGGAATTCG CAAAGTAACC ACAGAAGATA TCAAAAAAGC AGAGAAAAAC
AATTGTGCAG TAAAACTGAT TGCATCATGC AATAAAGAAT TGATAGTTGC TCCAAAAGAA
ATTCCAAATG ATGATCCATT ATGTGTAAAT GGTACACTTA ATGCAATTGC ATTTACATCA
GAGCATTCAG GCACACAGAC AATTATTGGA CGTGGTGCAG GAGGCATGGA GACTGCAAGT
TCCATACTAA GAGATTTGCT AGACATTAGA CAAGAGATTG CCAGAACTTG A
 
Protein sequence
MRIILCGFGV VGQSLVKLFE SRAEDLYAKY GLKPRVVGVF DSKGSAMDSS GLELNKLIEV 
KKKFGTVKNY ADTKNTMSGI DMLKNVEADV LIETTASNYK DAEPGMTHII TAMKKGMHVI
SVNKGPLALA FPSLMELATY NRVMFKFSGT VGGGTPILDY AKNSLSGERI TSFAGILNGT
TNYILTNMAT GVSYEDALKD AQDKGYVEAD EALDLDGLDA AAKLVILANW IMGMKVTMPD
INCTGIRKVT TEDIKKAEKN NCAVKLIASC NKELIVAPKE IPNDDPLCVN GTLNAIAFTS
EHSGTQTIIG RGAGGMETAS SILRDLLDIR QEIART