Gene Nmar_1324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1324 
Symbol 
ID5774146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1213048 
End bp1214319 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content36% 
IMG OID641316969 
Productintegrase family protein 
Protein accessionYP_001582658 
Protein GI161528832 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000102488 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAGG TCAATAATCA GATCGAGCAA GAATTGTCAT TTGAGAACAT TTCACTAATC 
AAAAAATATG ACAGGGAGAT GGTATCACAA TCAATTGCCA TTGCAACACG TCAAAAGCAT
CTAAGAACAT TGCTAACACT ATCAAAATTG CTAAAGAAAA ACTGGAAGGA TGTGACCAGG
GATGATATTG ATGACTTGGT ATTTCTGATA ATGGACCAGT TTGCAGATGA AAGTGGTCAG
GAAACACATT ACTCTTATGA TCACAAAAAG ATTCTCAAGA TTTTCTTTAG ATGGTACAAG
CTTGGCTCTA GAGAATTTGT TCAAGTTGGA GATCCACCTG AGACAAAAAA TGTCAAGATG
AAAAAAGTCA AAGACAAGAT TGCACGTGAA GACCTCCTAA ATGAAGAAGA CAGAATAAAG
ATACTGTATG CATGTGGCGA GAATGCAAGA GACAGAGCTC TAATTGATTG TCATATGGAA
GCTGGAACCA GACCAGGTGA GATTCTAAAT TTGAAGTTAA AACATGTAAA GTTTGACAAG
CATGGTTGTG TACTTCAAGT GGACGGAAAG ACAGGAGCTA GAACAATTAG AATCGTAAGG
GCTACTCCAA ACTTGGCTGC ATGGATTGCA GTACATCCAT ACAAAGATGA ACCTGAAATG
CCATTATGGC CAAATATTAG CCATCATAAG AAAGGCAGTC CAATTACATA TGCTGCAGCA
AGACAGATCT TACATAGAAG ATGCAAGATT GCAAATATCT CAAAACGTGT TTATCTGAAT
TTATTTAGAC ATAGTGAGGC CACAACTACA GCAAACTTCA TGACTGAAGC TCAGATGAGA
AAAAGACATG GATGGTCGTC TGACTCTAAA ATGCCTGCAA GATATGTCCA CTTGGTAAAT
TCTGATGTGG AAGATGCAAT CTTCAAGCAC TATGGAATCA AAAAAGAAGA TGAAAAGATG
CCAGAAATGC CTGTAAAGTG TCATTTTTGT GAAATGTACA ATCCATCAGA CAGCGTAACA
TGTACAAAAT GTGGAAAACC ATTGAATCTT GAGAGTGCAA TAAAAAGAGA AGAGCAAGAA
AATGCTGAAA AGAAAAAACT TGAAGAAAAG ATCAAGATGC TAGAGCAAAG ACAGATTGAA
TCAGAAAAGA ATCAGAAAGG ATATTCAGAT TTAAAATCAA TTGTAGATGA ATATTTGAAA
GAATACTTTG AGGACGTATT TGACAAGATA GAGTTTGTAA AGAATCAAAA ACAAAATAGT
ATTACAAACT GA
 
Protein sequence
MKQVNNQIEQ ELSFENISLI KKYDREMVSQ SIAIATRQKH LRTLLTLSKL LKKNWKDVTR 
DDIDDLVFLI MDQFADESGQ ETHYSYDHKK ILKIFFRWYK LGSREFVQVG DPPETKNVKM
KKVKDKIARE DLLNEEDRIK ILYACGENAR DRALIDCHME AGTRPGEILN LKLKHVKFDK
HGCVLQVDGK TGARTIRIVR ATPNLAAWIA VHPYKDEPEM PLWPNISHHK KGSPITYAAA
RQILHRRCKI ANISKRVYLN LFRHSEATTT ANFMTEAQMR KRHGWSSDSK MPARYVHLVN
SDVEDAIFKH YGIKKEDEKM PEMPVKCHFC EMYNPSDSVT CTKCGKPLNL ESAIKREEQE
NAEKKKLEEK IKMLEQRQIE SEKNQKGYSD LKSIVDEYLK EYFEDVFDKI EFVKNQKQNS
ITN