Gene Nmul_A0586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0586 
Symbol 
ID3783984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp670292 
End bp672013 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content55% 
IMG OID637810668 
Producthypothetical protein 
Protein accessionYP_411286 
Protein GI82701720 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTTA AAGTTCCCGG CGCATTACCG CTGCTCTTTC TTGCCGCCTG TTCACATCTT 
CCTGCCAAGA CTCCTCTCAC GGCGGAAAAA CCGGCAGAGA AGAAGGAGTC AGAGCAGTCA
AGAGTACCCA GTCAGGATTT GACCCCCTCC ATGCTGTTCG ATTTTTTGCT GGCGGAGACA
GCGCTGCAGC GGGGCGACAC GGAAATCGGT CTTCGTACCT ATCTCAAGCT TGCAAAAAAC
ACTCAGGATC CGCGCGTGGC CCAGCGCGCC ACGGAAGCTG CGCTTCAGGC GCGCCAGCCG
GCATTCGCCC TGGAAGCGGC AAAAATCTGG ACGGAACTCG ATCCGGAATC GATTCCAGCC
CGTCAGATGA TGGCTGCATT GCTGGTGCAT TTTGACAGAT TAGATGAGGC CCGTCCCCAT
CTGGAAAAAT TGCTGGCGGT TGCGGGAGAC AAGATCGATG ATGCTTTCAT GCAACTGAAC
AGCCTGCTGG TGCGCAGTCC CAACAAGAAT GCAATCTTCG AATTGGTAAA GCAGCTCGCG
CAACCTTATC CCGATCTGCC GGAAGCGCAT TTTGCCGAGT CCCAGGCTGC ATGGTTTGCC
GAGCGTTTCG ACATCGCTCT GGAAGAGATG AAAAAAGCGC TGGCGCTGAG GCCTGAGTGG
GAGATGGCGG CAATTTACGA AGGACGTATT CTGTCGCGCG AATCCAACGC CCGCGCAATC
GAATTTTTCG ATGATTATCT CAAGCGCTAT CCCAAAGCCA ACGATACACG CATCACCTAT
GCACGCTTGC TGCTAGCTGA AAGAGATTAT AGCAAAGCCC GCGAGCAGTT CCAGAAGTTG
CTGACAGAAA ATCCGGATAA CCCGGATGTT GCCATTGCCG TCGGCCTGTT ATCCCTGGAG
CTGCAGAACT ACGATGTAGC CGAATCGAAT TTCAAAAGAG CGCTGGAGCT GGGTTACCGG
GACCCGGGAA TGGTGCGCTT TTATCTCGGC GGCATCAGTG AAAAAAAACA GCAGATCCCG
CAGGCTTTGA ATTGGTACCG CTCGGTTACG GAGGGAACCC AGTTTATCCC GGCACAGATC
AAATATGCCA TTCTGTTGAG CAGAACCGGC AAAACCAAGG AGGGTCTCCA TCATTTGCAG
CAACTGCCAG TGGCCAATGA CCAGCAGCGT GCGCAGGTGA TCATCGCCGA GGCGCAATTG
TTGCGCGAGT CCGGCGCTTA CAAGAAAGCC TTTCAGCTCC TGAGCAGCAG TCTCGAAAAA
CTTCCCGATT CCCCCGAACT GCTTTATGAC CGCGCCCTGG CTGCGGAGAA AATAGGCAAG
GCGGATATCA TGGAGCAGGA TCTGCGCAAA CTGATCGAAC TGAGGCCCGA CCATGCTCAC
GCCTACAACG CCCTGGGTTA CGGTATTGCC GAGCACTCCA GCAAGCGCCT GCCGGAGGCG
CTGGAATTGA TCGAGAAGGC GATCAAGCTT TCGCCTATCG ATCCGTACAT CATCGACAGT
CTGGGATGGG TGCATTACCG CATGGGGGAC ATCAACCAGG GATTGAGCTA CCTGCGACAG
GCGTTTGCAA TGAATCCCGA TCCGGAAATT GCTGCTCACC TGGGGGAAGT GTTGTGGGTG
CAGGGAATGA AGGACGAAGC GAAAGAGATC TGGCAGACGG CCCTCAAGAA TCATCCCGGT
AACGAAGCGT TGATTGGCGT GATGAAGAAG TTCATGAAGT AG
 
Protein sequence
MSLKVPGALP LLFLAACSHL PAKTPLTAEK PAEKKESEQS RVPSQDLTPS MLFDFLLAET 
ALQRGDTEIG LRTYLKLAKN TQDPRVAQRA TEAALQARQP AFALEAAKIW TELDPESIPA
RQMMAALLVH FDRLDEARPH LEKLLAVAGD KIDDAFMQLN SLLVRSPNKN AIFELVKQLA
QPYPDLPEAH FAESQAAWFA ERFDIALEEM KKALALRPEW EMAAIYEGRI LSRESNARAI
EFFDDYLKRY PKANDTRITY ARLLLAERDY SKAREQFQKL LTENPDNPDV AIAVGLLSLE
LQNYDVAESN FKRALELGYR DPGMVRFYLG GISEKKQQIP QALNWYRSVT EGTQFIPAQI
KYAILLSRTG KTKEGLHHLQ QLPVANDQQR AQVIIAEAQL LRESGAYKKA FQLLSSSLEK
LPDSPELLYD RALAAEKIGK ADIMEQDLRK LIELRPDHAH AYNALGYGIA EHSSKRLPEA
LELIEKAIKL SPIDPYIIDS LGWVHYRMGD INQGLSYLRQ AFAMNPDPEI AAHLGEVLWV
QGMKDEAKEI WQTALKNHPG NEALIGVMKK FMK