Gene Nmar_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1103 
SymboluvrC 
ID5773952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1005416 
End bp1007002 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content32% 
IMG OID641316745 
Productexcinuclease ABC subunit C 
Protein accessionYP_001582437 
Protein GI161528611 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTG ATATCTCTAA AATCACAATT CCCACAGATC CTGGAATATA TTTGATGAAA 
GATTCTGATG GAACAATAAT TTACATTGGT AAGGCAAAGA ATCTCAAAAA ACGTGTAAAG
TCATATTTCT TAAAAAATCA AAATTACAAA ACACAAAAAC TGGTTCAAAA CATTTCTGAT
ATTGAATTTG TTTTAACTGA TAATGAAAGT GAAGCATTTC TTTTAGAATC AAATATGATC
AAAAAATACC GTCCTAGATT TAACATCGAG TTAAAGGATC AACAACGATA CACTTACCTT
AGAATATCTG ATGAAAAATA TCCACGATTG TTAGTTGCAA GACGAACAAG AGATGGAAAA
TTTCTTGGTA AAGGAAAAAC TTTTGGTCCT TTTACTCAAG GTAGCTCAAA ACTGCTTACA
ATTGGTGCAT TACGTAAAGC ATTTCAAATC CGAATTTGTA AAACATTGCC AAAAAAAGTC
TGCCTTGAAT ATCATTTGGG TAATTGTGAA GGTCCTTGTG AGTTTAATGA TGCTCAGGAT
AGATATCCAA AACATGTTGC TGCATTAGAA GAAGTTCTAA AAGGAAAAAA CCAGACAAAA
ATCTTTACTA AAAAATTAGA AGAAGAGATG CATCAGGCAG CTGAATTGCA ACAATTTGAG
CGTGCAAAAG ATATTCGTGA CACTCTGATT AGACTTGGCA GTCTTCAGAC AAAACAAAAA
ATGGAATATG TAGAAAATTC TGATGAAGAA TATTTCGGTA TTGGAATTCA AGAACAATCT
GCAACTGTGA TGAATTTTAG AATGATTAAT GGTGTTATTC GAGATAGTGA CAAATTCTTT
TTTGATCTAG TTGGTGACAA CTCTTTTTCA AATTTTCTTT ATCAATACTA CTCAACACAC
AAAATTCCAA AATACATTAT TGTAAGTGAA CTTCCTGAAA ATCAAAAACT TTTGGAATCT
TTACTTTCTG AACAAGCTGG TTTTTCTGTA AAAATTTCTA CTCCGACAAA GGGCAAGAAA
AAAGACATTA TGAATTTGAT TTTGAAAAAT ATCAAACTAA TTCATTCTAA AGGTGGAGAA
CCTGGATTAG TTGAATTAAA AGATATTCTG CATCTACCTG TAATTCCAAA TGTTATCGAA
TGTTTTGATA TATCTAATCA TGGTGAAGAT TTTGCAGTTG GATCCATGGC CCAATTTGTT
AATGGTAAAC CAAACAAATC TGGATACAGA AAATTCAAAA TCAAAACTGT GTCTGGTAGA
GATGATTTTG CAATGATTGG TGAAGTTATC AAGCGAAGAT ACTATAGATT GTTAGAAGAA
AATTCTGAAT TACCTGATTT GATAGTAATT GATGGTGGTA AAGGACAACT TAATGCTGCT
ACAAAGTCTT TACAGTCTCT TGGATTGAAA CTACCTTGTA TCTCTTTGGC AAAAGAAAAT
GAAGAAATCT ACTTGCCTAA AACCAAAAAT CCTGTTACTA TTGCAAAGAA CAAACCCTCT
CTTCAAATTT TACAGCATGC TAGAGATGAG ACTCACAGAT TTGGCGTGGC ATACAATAGG
ACTATTAGAA AAAATCAGAT AAAATAA
 
Protein sequence
MTFDISKITI PTDPGIYLMK DSDGTIIYIG KAKNLKKRVK SYFLKNQNYK TQKLVQNISD 
IEFVLTDNES EAFLLESNMI KKYRPRFNIE LKDQQRYTYL RISDEKYPRL LVARRTRDGK
FLGKGKTFGP FTQGSSKLLT IGALRKAFQI RICKTLPKKV CLEYHLGNCE GPCEFNDAQD
RYPKHVAALE EVLKGKNQTK IFTKKLEEEM HQAAELQQFE RAKDIRDTLI RLGSLQTKQK
MEYVENSDEE YFGIGIQEQS ATVMNFRMIN GVIRDSDKFF FDLVGDNSFS NFLYQYYSTH
KIPKYIIVSE LPENQKLLES LLSEQAGFSV KISTPTKGKK KDIMNLILKN IKLIHSKGGE
PGLVELKDIL HLPVIPNVIE CFDISNHGED FAVGSMAQFV NGKPNKSGYR KFKIKTVSGR
DDFAMIGEVI KRRYYRLLEE NSELPDLIVI DGGKGQLNAA TKSLQSLGLK LPCISLAKEN
EEIYLPKTKN PVTIAKNKPS LQILQHARDE THRFGVAYNR TIRKNQIK