Gene Nmar_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1444 
Symbol 
ID5773514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1314106 
End bp1315857 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content36% 
IMG OID641317091 
Producthypothetical protein 
Protein accessionYP_001582778 
Protein GI161528952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCTCAAA ACGGAAACGG TTTACTAGAG TACATACCTG GCTCTCACAC ACTGCTAGTT 
CAAAAAAATT CCAGCCCACC TCTTGAAGGA TTTGCTGAAA ATATCAGAGG AAGCATACAC
GAGTATGCTG AGAATTCAAA GAGCGACGTT GAGAAGGGAA ACAACTTTCT TCATTGGATA
CTGACAAGAG TATTTGAGGC AACAGAAGAT GATGCTGCTG ATGCAATTGT AGATGGTGCA
AACGATCTAG GAATTGATGC ATACTTGCCA GTAGACTTTT CAGATAACAC GATTAGATTA
TTTCAATCAA AATACGGGAC ATCTCATTCA CTTGAAGCAA TTGCAAAGTT CAAAGAAGAT
GCAAAGAGAC TGCTTGCAAA AGACGTTACA AAGATGAGAC CAGAATTGGC TCAGCTTGTT
ACAAAAATCA AAGAAAAGAA TCTCAAAGTA AAGTGCTGTT ATGTTACTGA CCAAAAAGTA
GACTATCATG ATGAATTTGT TGAAGTAATT GATGAGGAGA AGATCATTCA AAAACTCTGG
GACAGAATAA AGAAACCAGC TGCAGGCAAA AAGTCATCAA TACGACTAGA AAAGATGCTC
AGACACGAGA ATACTATTCT AGGAATTTTG AAATTACGTG AATTAACAGA GTTTGTTAGC
AAGAACAGAG ACTATGTCTT TGAATCAAAC ATCAGACAAT GGATGCAGTT CAAGACTACG
GTAAACAAGG GATTACGAGA CACATTGCAG AGTAACCCTA ACAAGTTCTT CTTTTACAAT
AACGGAATTA CAATTGTAGT AAGCGATTTC TCAGAATTGG GCGAGAACAT GATAGAGCTT
CATGCACCGC AAATTGTCAA TGGTGCACAG ACATCAAACT CTATACTAGA TCATTCAAAG
AGAACAAAGA ACATGGATGG CTCCATGACA GTTACAATAA TCAAAGCTGA TGATGAACAA
GAACAAAACA ACATTACAAA GTATAGAAAC TCGCAAAACT CTGTCAGAGG AAAAGACTTG
GTTTCTTTGA TGGACTTTCA CAAGTCAATT AAATCACAAT TAAAAAGCTG TGGATACTTT
TATGAAATTC AAGCAGGCTC TTTTGATACA AAATCAAAAT CAAAACAGTG TGACTATGCA
GGAGACTCTA CATACAACAA CTATCTTCCA GACAATCACA AAAAAGTAAT CGTTGCAAAA
GATGCAATTC AATGTCTTGT TGCAGGAATT GAACAAAGAC CAACTGAAGC TTATAGTTCA
CCAGCTCAGT TTCTTCCAAG AGGAAGCAAG TATGATGATA TCTTTAATGA CAATCTAAAG
GATGATTACA GAATTTTGTT GTATCCATAC TTGGTAAAAG AATATGCAAA AAAATCACTA
AAGTATGGAA AGAAAGGAGG TCACAAGACA AAGAGATATG CAACTCTGTT CTATGTTGCA
GTATACTTTA GAATTCTACA CAAAAAAATT CTCGAATCAA AGGGCGATTT TAAGGGAGAT
ATCAGGAAGA TAGAGCCAGT TTTTCGCAGT TTCAAACTAA ATTCTCGAAT TTTAAAGCTA
GCTGACGTCA TTGTTACCAA ATTCCTTGAG GATACAGTAG TTGATGATGA AATCGAAATG
GCAAACACAA AGCACAACTT TTTCTCTCAA CACGTATGGA ATGACACGAT GCTTCGGGTA
ATTGACAAGA AGATAAGACA AGAAGAAGAT GAAATCTTGG CACTAAAGAA ACTCACAAAC
AGTTTGTTGT GA
 
Protein sequence
MSQNGNGLLE YIPGSHTLLV QKNSSPPLEG FAENIRGSIH EYAENSKSDV EKGNNFLHWI 
LTRVFEATED DAADAIVDGA NDLGIDAYLP VDFSDNTIRL FQSKYGTSHS LEAIAKFKED
AKRLLAKDVT KMRPELAQLV TKIKEKNLKV KCCYVTDQKV DYHDEFVEVI DEEKIIQKLW
DRIKKPAAGK KSSIRLEKML RHENTILGIL KLRELTEFVS KNRDYVFESN IRQWMQFKTT
VNKGLRDTLQ SNPNKFFFYN NGITIVVSDF SELGENMIEL HAPQIVNGAQ TSNSILDHSK
RTKNMDGSMT VTIIKADDEQ EQNNITKYRN SQNSVRGKDL VSLMDFHKSI KSQLKSCGYF
YEIQAGSFDT KSKSKQCDYA GDSTYNNYLP DNHKKVIVAK DAIQCLVAGI EQRPTEAYSS
PAQFLPRGSK YDDIFNDNLK DDYRILLYPY LVKEYAKKSL KYGKKGGHKT KRYATLFYVA
VYFRILHKKI LESKGDFKGD IRKIEPVFRS FKLNSRILKL ADVIVTKFLE DTVVDDEIEM
ANTKHNFFSQ HVWNDTMLRV IDKKIRQEED EILALKKLTN SLL