Gene Nmul_A1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1520 
Symbol 
ID3786106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1739314 
End bp1740297 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content55% 
IMG OID637811608 
Productheat shock protein DnaJ-like 
Protein accessionYP_412215 
Protein GI82702649 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTCA AGGATTACTA CAAAATCATG GGGGTTCCCC GCGACGCCTC ACAGGATGAC 
ATCAAGCGCG CCTACCGGAA ACTGGCGCGC AAATATCATC CCGACGTCAG CAAGGATCCG
CAAGCGGAAG CCCGCTTCAA GGAGTTGGGT GAGGCCTATG AAGTCCTCAA AGATCCGGAG
AAGCGCGTAG CATATGACCG CCTTGGCACA AACTGGAAAG CCGATCAGGA ATTTCGTCCT
CCCCCGGATT GGAATGCCGG CTTCGAGTTT TCACAACAGG GATTTACAGG AGCAGATGCC
GCCCAGTTCA GCGAATTTTT CGAATCCCTG TTTGGGCGCA GTTTTCGTGC CGAGCAAGCG
AGACGCGGAG GAGAGACTCA TGGGGGTCCC GGTGGAGCTT TCTTTCATGC ACCTGGCGAG
GATCGGTATG CCAAAATAAT GATCGATCTG GAAGATTCAT ATCACGGCGC TACCCGCACC
ATCTCGCTGC AAGTACCAGA GGTCGATGCA GAAGGACATG TATCGACGCG CGAACATAAG
TTGAACGTGG TTATCCCCCG TGGTATCCGG CCCAGGCAAT ACATTCGTCT TGCTGGCAAG
GGTGCGCCGG GCCATGGTCA GGGAAAGGCG GGCGATCTGT ATCTGGAAAT CGAGTTTCGC
TCTCATCCCA TCTATCGAGT AGACGAGCAC GACGTCTATC TTGACCTCCC GGTAGCCCCC
TGGGAGGCGG CGTTAGGCGC AACGATAACT GTTCCCACTC CGGAAGGAAT GGTTGACCTG
AAAATACCTG CTGATTCCAC TACCGGACGG AAGCTGCGAC TCAAAGGACG CGGTATTCCC
GGCAAAATAC CGGGTGACTT CTATGTTGTA TTGCGCATTG TGCCACCGCC TGCCACCGAT
GAAAGTGATA AGGCCTTTTA TCGCAGCATG GCGGAGCAAT TCAAATCGTT CAACCCGCGG
GCCAAACTGG GAGTGCAGGC ATGA
 
Protein sequence
MEFKDYYKIM GVPRDASQDD IKRAYRKLAR KYHPDVSKDP QAEARFKELG EAYEVLKDPE 
KRVAYDRLGT NWKADQEFRP PPDWNAGFEF SQQGFTGADA AQFSEFFESL FGRSFRAEQA
RRGGETHGGP GGAFFHAPGE DRYAKIMIDL EDSYHGATRT ISLQVPEVDA EGHVSTREHK
LNVVIPRGIR PRQYIRLAGK GAPGHGQGKA GDLYLEIEFR SHPIYRVDEH DVYLDLPVAP
WEAALGATIT VPTPEGMVDL KIPADSTTGR KLRLKGRGIP GKIPGDFYVV LRIVPPPATD
ESDKAFYRSM AEQFKSFNPR AKLGVQA