Gene Nmar_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1684 
Symbol 
ID5774319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1543991 
End bp1545364 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content34% 
IMG OID641317338 
Producthypothetical protein 
Protein accessionYP_001583018 
Protein GI161529192 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.996577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCAA TTACAAAACT TTCAATTTTG GCTCTAGCCG CATGTTTGCT GTTTCCAGCT 
AGTAGTGTTT ATGGTCATGG ATTGGGAATT GATACTATAT CTTCAGTAGA TGTTGCTGGA
AAAGAAATCT CAATTTCAGT TGAGATGCCA ATGTACTTTG AAAGTGAGCA AGAACAAATC
ACAATTACTG CAACTGACAA AGAGACAGAC GAGCCTGCAA AAAATGTGAC ATTTCTCATT
GGATTGTTCC ATAGCAATGA GATGATTTTC AGAAACTACT TTTTCACAGA AAATGGTGTT
TTACCAATAA CTGTACTATC ACAACAAGGA TATGATAATT TTGTAATTAA TGGAGAGCAA
GATTCACTTT TGGGAGCATA TCATGCAACT GAATCATCTC CAATAGAGAT TGCAGGCCCC
GTCTTTGATT CAGGTGGATT GTTTACTTTT GAGATTGAAG TTAGAACCAT TGATGAGCCA
ACTAACATCA TAGAAGATTC AGGTGTATAT CGTGCAGATT TGACACTTGT TGAAACCACT
TCTCATCCTC AAGAAGATAC TGAAGGAAAT GATGTAGAAT TTAGAATGAA ATCTTATTTT
GATAAAATCC AAAATTTCCA ATATGATCCT GCAACAAAAC AAGTAACTTT TGAGATGCCT
TTTGATTGGA GTGAGAACAG CATGTCTCAC GTTACAGTTG TGCACGAAGA AGTACATTTT
CCAAAACATT TCATTGAATT TTTGAGTCCT AGTTATTCAG GATACGCAAA TGGAATTGAG
TTGTTCAAAG CTTCAGTATC AATTGATGAT TACACAGAAG AAGATGAGAG AATAGTTCAC
TTTGTTTTAT TGCAAGACCA TCTAAGATTC ATAAAAAATG AGATGAAAAA ATCTGATGAG
CCACTACCAG ACAATATTGT TTTCACTTTA ACTACAAATG AAAAAATATC ATTTCCATTA
GAGGCATTTA CAAAGAGTGA AGACTTCAAA GTAAACTTGT CATGGGATCC TATAGATCTT
GAACCAGGAG TTGAAACAAA CTTTGTCTTT ACTATTAGAG ACGGATGGAC AAATGAACCT
TTAAGAAATT CTGATTATTC TTTTGTAATC ATTCAAAATG GAGCGGAGTT ATACCGGGTA
TCTGGAACTG CAACAGTTGG TGGTGAATTT GAAAAATTCA CATTTGCTGA AGACCAAACA
GGTCCTACAA CAATTAAATT TGAAAACATA CGAAATACTG GACAAGAAAC TGAGTTTGGA
ATAATGGTTG CACCTGAATT TGGTACTATT GCAATTTTGA TACTTGTTGT TTCTATAATT
GGAATAATTG TAATTGCCAG AAAATATGAG ACTTTTTCTC TCATTAGGAT GTAA
 
Protein sequence
MMSITKLSIL ALAACLLFPA SSVYGHGLGI DTISSVDVAG KEISISVEMP MYFESEQEQI 
TITATDKETD EPAKNVTFLI GLFHSNEMIF RNYFFTENGV LPITVLSQQG YDNFVINGEQ
DSLLGAYHAT ESSPIEIAGP VFDSGGLFTF EIEVRTIDEP TNIIEDSGVY RADLTLVETT
SHPQEDTEGN DVEFRMKSYF DKIQNFQYDP ATKQVTFEMP FDWSENSMSH VTVVHEEVHF
PKHFIEFLSP SYSGYANGIE LFKASVSIDD YTEEDERIVH FVLLQDHLRF IKNEMKKSDE
PLPDNIVFTL TTNEKISFPL EAFTKSEDFK VNLSWDPIDL EPGVETNFVF TIRDGWTNEP
LRNSDYSFVI IQNGAELYRV SGTATVGGEF EKFTFAEDQT GPTTIKFENI RNTGQETEFG
IMVAPEFGTI AILILVVSII GIIVIARKYE TFSLIRM