Gene Nmar_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0471 
Symbol 
ID5773644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp423259 
End bp425484 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content34% 
IMG OID641316103 
Productcopper amine oxidase-like protein 
Protein accessionYP_001581805 
Protein GI161527979 
COG category[R] General function prediction only 
COG ID[COG4880] Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0141512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAT TGAGTTCAAA AATAATAATC CCAATTGCAG TAGCCATTTC GGTAATAGTA 
ACAGCAGGAA TAATGTATGC AATTGGTTTT GAACAAGAAC CACAGATTGT AGAAGTTCCA
ACTCCTGAAA TTGTCTATGT AGACAAATCA GCTTCAGAGT TTTTTGAAGG AACAAACGAC
ATCAAACAAA TATCATCACA AGAAGAACTA GTATCAATTC TTGAGGCATC ATCATTGTTT
GGTGGGGGAT TTTATGATAC AAGATCCATC AGAACTATGG TAGTAGCAGA ATCTGCAATG
TTTGATAGTG TAGAATCTGT TGCAGTATCA GCACCACAAG CAGAATTCAA AAGTGATGAC
GGAGGATCAG ATTATTCAAC AACTAATGTT CAAGTAGAAA ATGTAGATGA GCCAGACTAT
CTGAAAAATG ATTCAAAATA TGTATACATT GTTTCAAGAA ACACATTATC AATAATTGAT
GCATATCCAG CTGAAGATGC AAGATTAATT CTAAAAATTG CATTAGACAT TGAATCACAG
TACATCCAAA ACATGTTCCT CAATAAAGAC AGACTAGTGA TATTTTACAA TGGACAAAGT
GATGAGGAGA TAATCCCACA GTTTGACTTT ATTCCAAGAC CATCATACAA TCCAGTCACT
CACGCATTAA TTGTAGATGT GTCTGACAAG GAAAATCCAA CCATTCTCAA AGATTACTCT
ATTGATGGCC ATTTTAGAGA TGCCAGAATG ATTGGAGATT ATGCATACTT TGTTACAAAT
AATCACATCA ACCACCAGAA TCCCAGACTT CCAATAATAA TGGAGGATTC TGTTAGAATT
ATGACTCCAG ACGCATTTTA TTTTGACAAT GTTGAAGAGT TTTCAAACTT TAACACACTA
ACTGCAATTG ACATATTTGG AGATACAATA AACTCTGAAA CCTTCTTGAT GGGTTATTCA
GGAACATTTT ACGTATCTGA GAATAATTTC TACTTGACTT ATCAGCAAAA CATGCCATTT
GGATATTATG AGAATTCATC ACGCGATAGA TTCTACGATG TAATAGTTCC ATTACTTCCA
AATGACATCC AAGATGAAAT AAAGAGCATC CAAAATGATT CTTCATTGAA TTCATCTGAA
CAATGGGTAA AGATTTCAGA ATTGATGCAA AATTCCTACA ATGAAATGAG CAAAACAGAC
AAGGAAGAAT TGTTTGAAAA GATTAGAGAA GCACTAAATG AGTATGATGC AAAGATTCAA
GAAGAAAATA GAAAGACAAT CATTCACAAA ATTTCAATTG ATGAAGATAA AATAGAATAT
GTTGCAAAGG GAACAGTACC AGGTAGATTA CTAAACCAAT TCTCAATGGA TGAATCAGGA
GATAGATTCA GAGTTGCGAC AACAATAGAG TATTACATTC AACATGAAGG AACAATCCGC
TCAAATGCAG TATATGTCCT AGATGAACAA CTCAACATAG TAGGAGAACT AGAAGACATT
GCACCTGATG AGAGTATTTT CTCATCAAGA TTCATGGGAG ACAGGCTCTA TTTGGTAACA
TTTGAGCAAA TCGACCCATT CTTTGTAATT GATCTATCAA AGGATACACC AAAGATTTTG
GGAGAATTAA AGATACCAGG ATTCTCAAAT TATTTGCATC CATTTGATGA GGATCATGTA
ATTGGAATTG GACGAGATAC CAAAGTAGAT GAAAATGACA GAGTTCAACA ATTGGGAGTA
AAGGTTGCAT TATTCAATGT GGCAGATGTG AGCAACCCCA AAGTGTTAGA TGACTTTGTA
ATTGGAGACA GATCAAGCCA TTCTGAAGCA CAATACAATC ACAAGGCATT CTTCTTTGAC
AAATCAAGAA ATGTATTATC AATTCCAATA AGTGGAGATT CAGATAGATT AGAGCACATT
ACATCAAAGA TGTTTGCACC AGAATACAAC CGTTGGAGTG GATTCTATGT ATTTGATGTT
GATAGTACAA ATGGATTCTC CATTAAAGGA ACAATCACAC ATTCAGATAG TGACTCCAGA
TACTATGGAA TGGGAGATGC AAGAACATTC TACATTGACG ATGTATTGTA TACAGCATCT
CAAGGATATC TAAAGATGAA TTCATTTGAG AATCTAGAAG AAATCAACAC AATCAAACTT
GAAAACACTG GCAAGTTTAT TGATTATTTA GAAGAACCAA TGATGGAAGT AGAACCTGTT
AGATAA
 
Protein sequence
MIKLSSKIII PIAVAISVIV TAGIMYAIGF EQEPQIVEVP TPEIVYVDKS ASEFFEGTND 
IKQISSQEEL VSILEASSLF GGGFYDTRSI RTMVVAESAM FDSVESVAVS APQAEFKSDD
GGSDYSTTNV QVENVDEPDY LKNDSKYVYI VSRNTLSIID AYPAEDARLI LKIALDIESQ
YIQNMFLNKD RLVIFYNGQS DEEIIPQFDF IPRPSYNPVT HALIVDVSDK ENPTILKDYS
IDGHFRDARM IGDYAYFVTN NHINHQNPRL PIIMEDSVRI MTPDAFYFDN VEEFSNFNTL
TAIDIFGDTI NSETFLMGYS GTFYVSENNF YLTYQQNMPF GYYENSSRDR FYDVIVPLLP
NDIQDEIKSI QNDSSLNSSE QWVKISELMQ NSYNEMSKTD KEELFEKIRE ALNEYDAKIQ
EENRKTIIHK ISIDEDKIEY VAKGTVPGRL LNQFSMDESG DRFRVATTIE YYIQHEGTIR
SNAVYVLDEQ LNIVGELEDI APDESIFSSR FMGDRLYLVT FEQIDPFFVI DLSKDTPKIL
GELKIPGFSN YLHPFDEDHV IGIGRDTKVD ENDRVQQLGV KVALFNVADV SNPKVLDDFV
IGDRSSHSEA QYNHKAFFFD KSRNVLSIPI SGDSDRLEHI TSKMFAPEYN RWSGFYVFDV
DSTNGFSIKG TITHSDSDSR YYGMGDARTF YIDDVLYTAS QGYLKMNSFE NLEEINTIKL
ENTGKFIDYL EEPMMEVEPV R