Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1034 |
Symbol | |
ID | 3785161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1195316 |
End bp | 1196197 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637811118 |
Product | heat shock protein HtpX |
Protein accession | YP_411729 |
Protein GI | 82702163 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.3427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGAA TTTTTCTATT TGTAGTTACC AATCTGGCCA TCTTGCTGGT GCTGAGCATC ACATTGCGCT TGTTGGGTGT TGATCGCATG CTTGACGAGC AGGGTGTGGG TATCAATTAC AATTCACTGC TTGTCATGGC GGCAGTAATC GGTTTTGGCG GTTCGCTGAT TTCGCTGGCC ATGTCCAAGT GGAGTGCGAA ACGCATGACG GGTGCGGTCG TGATCGAGCA ACCTTCCGAT CCGACCGAGC GCTGGCTGGT GGAAACTGTC CGGCGGCAGG CTGAGCGCGT GGGTATCGGT ATGCCGGAAG TGGCGATCTA CGAGGCACCG GACATGAATG CCTTTGCAAC CGGCATGAAC CGCAACGAGG CATTGGTGGC GGTAAGTACC GGCTTGCTTC AGGCTATGAC AAAAGATGAA ATCGAGGGAG TGCTGGCGCA TGAAATAAGC CATGTTGCCA ACGGCGATAT GGTGACACTT GCGCTGATTC AGGGTGTCGT CAATACCTTC GTGATTTTTT TATCGAGAGT CATCGGCCAT TTCGTGGACA GGGTCATATT CAAAACCGAG CGGGAGTACG GCCCCGCCTT CATGGTTACC ACACTGATCG CCCAAATGGT ACTTGGGATA TTGGCGAGCA TCATTGTAAT GTGGTTCAGC CGCCAGCGTG AGTTTCGTGC GGATGCGGGT GGAGCGCAGC TGGCTGGACG CAACAAAATG ATCGCAGCGC TTGAACGCCT GCAACGCCGG CACGAGCCCT CCCAGTTGCC GGAGCGGTTG GAAGCATTCG GTATTTCCGG AGGGGAAGGG GGTCTCAGAA CCCTGTTCAT GTCTCATCCG CCTCTTGAAG TGCGTATTGC AGCGCTTCGC GCGATGACCT GA
|
Protein sequence | MKRIFLFVVT NLAILLVLSI TLRLLGVDRM LDEQGVGINY NSLLVMAAVI GFGGSLISLA MSKWSAKRMT GAVVIEQPSD PTERWLVETV RRQAERVGIG MPEVAIYEAP DMNAFATGMN RNEALVAVST GLLQAMTKDE IEGVLAHEIS HVANGDMVTL ALIQGVVNTF VIFLSRVIGH FVDRVIFKTE REYGPAFMVT TLIAQMVLGI LASIIVMWFS RQREFRADAG GAQLAGRNKM IAALERLQRR HEPSQLPERL EAFGISGGEG GLRTLFMSHP PLEVRIAALR AMT
|
| |