Gene Nmar_1763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1763 
Symbol 
ID5773569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1609931 
End bp1611076 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content31% 
IMG OID641317418 
Productclass V aminotransferase 
Protein accessionYP_001583097 
Protein GI161529271 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTAG TATCAAAGGA TATTTCAGAT GACTTTCCAA ATTCAGATAA AATCTATCTA 
AACAATGCAT CAGTATCCCT AATGCCTATT CAAAGTATTG AGGCAATGAA AGATTTTCTA
ATTTCTTACA ACTCTCTTGG ACCTGATTCA AAAGAATCAG AGTCATTCGT AACTGAAAAA
CTAAGAGATG TAAGAAAAAC TATAGCCAAA ATTATCTCAT GTCAACCTGA TGAAGTAGTT
CTAACTCAAA GTACTACTGA TGGAATCAAT ATTGTAGCAA ATGGACTTTC ATTTGATGAA
AAATCAAATG TAATTATTCG TGGAATGACC CATGAACATC ATTCAAATTT TTATCCCTGG
TTAAAACTAA AAGAAAAAAT CTCTCTAAAG AATCTCTCAA TTGATAAAGA TGGATTTTTC
AAATCTGAAG ATTTAGAATC ATTACTTGAT GATAATACAA AATTAGTTGC TCTTAGTCAT
GCTTTGTACA ATACTGGTTC TATTTTGCCT TTAGAAGAAA TCACAAAACT ACTCAGTGAT
GTGCCTCTAT TTGTTGATAG TGCACAAACT GTAGGATGTA TTGACGTTGA TGTTTCAAAA
ATAAATTGTA ATTTTATGTC TTTTAATGGA TCAAAATGGC TTTGTGGTCC AATGGGAACT
GGATTGTTTT ATTGTAATAG AAAATCAAGT GAATTGTTAG AACCAAAAAC TATTGGGGGC
GAATCTGCAA TTATCTATGA TGATACCAGT TTAGCATTCA AAGAACTTCC TGATAAATTT
CAAACTGGTT TTAGAAATTA CGTTGGAATT GTTGGATTGG AATCTTCTGC AAACTATTTG
CTTAATTTTG GTCTCAAAAA TATACGTGAA AAAAATCAAT ACTTGTCAAA TCTTCTAAGA
GAAGAACTAT CAAAAATTCC AAAAATTATT TTGTATGGTC CTGAAGATCC TAATTCTAGA
ACAAGTATTG TGTCTTTTAA CATAGATGGA ATGGATTCAC AAGAGGTTGT TGATAGACTT
GAAAAGCAAA ACATCGTCTT AGCTCTAAGA GAAATTATGG AAACAAAGAT TGTGCGAGCT
TCACCTCATT TCTTTAACTC AGAATCTGAA ATTATGTCTG TAGTTGATGC AATAAAGAGA
CTATAG
 
Protein sequence
MNLVSKDISD DFPNSDKIYL NNASVSLMPI QSIEAMKDFL ISYNSLGPDS KESESFVTEK 
LRDVRKTIAK IISCQPDEVV LTQSTTDGIN IVANGLSFDE KSNVIIRGMT HEHHSNFYPW
LKLKEKISLK NLSIDKDGFF KSEDLESLLD DNTKLVALSH ALYNTGSILP LEEITKLLSD
VPLFVDSAQT VGCIDVDVSK INCNFMSFNG SKWLCGPMGT GLFYCNRKSS ELLEPKTIGG
ESAIIYDDTS LAFKELPDKF QTGFRNYVGI VGLESSANYL LNFGLKNIRE KNQYLSNLLR
EELSKIPKII LYGPEDPNSR TSIVSFNIDG MDSQEVVDRL EKQNIVLALR EIMETKIVRA
SPHFFNSESE IMSVVDAIKR L