Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0521 |
Symbol | |
ID | 3784510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 591768 |
End bp | 594557 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810603 |
Product | organic solvent tolerance protein |
Protein accession | YP_411221 |
Protein GI | 82701655 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTGC GTTTTATCCG TTCCGCTGGC TGGTTGTTTT TGTTGTTTTG CCTTGCCTGC AATGCGCGTG CCGATCTGCC GCCGCTTTCT TCAAAGCCGG AGCAGGGACG CGCAACCCCT TCTGGTGAGG GTGACGATAA ACCGGTGGTT ATCGATACGG AACGCATCCG GGGCCATCAT GAATACGAGT CAGGCACCAG AAGCGAGAGT GAATTGCGCA GTCGCTCGAC CATTTCAACC GACCAGATAA AAAAACCCAA CCAAAAAGCG GACCCCGCCG CAAAAGACAC GCCATCCGCG CCTCAGCAGA ACTATACACT ATCCCCCGCG ATAAAAACCG ACTCCCGGAC AGGCACCTCA GCCCAGGAAT CGGAGAAGGC GGAAAGTATG GTTTTACCGG GTGGCGTGGA ACGCCTTCCC GGGCCTGCCG CAGAAGAAGG AGAGCCCAGG CTGCGCACCA GGACCCAGTC CGCACCGCGC ACTTTATCCG CGCAAAAGCG CGGGGAGAAG CCCGCGAAGA CACCTGCACC GGCCGAAGCA GACCAAGATA GACCGGGGTT TGCAGAAGGC GAGCGCATTG GAGGTCACAG GGAAGAAGCA GGCGACGAAA AGCTGCGTCT TGCCGGCGAG ACTGAGCCCG AGGCAATCGA GCAAAAACTG GCAGAAGCTG AAGCGGAAAC GGACAAACAG TCCCCCGTGT TCGTGGTTGC GGATCGCTTG CAAGGCCACG TGGAGGAGGA AATCGAAGCG ATAGGCAAGG CGGAACTGTC TGCCGGCCCT CAGTTTATTT CCGCCGAGCG GATGAAATAC AACCAGGGCA CCAACGATGC CGAAGCCCAG GGCAACGTCC GTGTGGAAAA GGAAGGTGAC ATCCTGGAAG GATCCGATCT CAAATTCAAT CTGCTGAGCA AAACGGGCCA GTTAAGCGAA CCCAGCTATC GTCTGAAGGA TGCGAGCAGT CGCGGTTATG CGGGCATGCT CCTGTTCGAA GGCGAGAACC AGTACCGCCT GCAGAAGGCC AGTTATACCA CGTGCCCCGT GGGAGACGAC AGCTGGGTTC TCCAGGTGGC CGACCTGAAG CTCGACAATG ACAAGAAAGT GGGCACCGCC AAAAATGTGA AGCTCACCTT CAAGGATGTG CCGATACTGT ATACCCCCTG GATGAATTTC TCATACAGCG GCGAGCGCAA ATCAGGATTG CTGGCGCCGA CCTACGGTAC CGGCAGCAGG ACCGGCCTTG AACTGGCTGT ACCCTTCTAC TGGAACATCG CCCCCAACTA TGACGCCACG TTTTCCGCAC GCCTGATGTC AAAGCGCGGC CTGGCGATCA ACAACGAATT TCGCTTTCTG GGCCAAAACT CGAGCAGCAA TCTGCTCGCC GACATCGTGC CTCGTGACCT GGATACGCAA ACGACGCGGT GGCGCACGTC GTTCTGGCAC AATCATTATC TGGGCGCTGG TTTTTCCGCT CGCCTGGATT ACAACAGGGT GTCGGATGCA ACCTATTTTC GCGACTTTGG CAACAACCTG AATCTCACAT CCCGCACCAA CCTGCTGCAG CAGGGATTGC TGTCTTACAA TCGCGGGCTG GGGGATGACG GCACATTTAA CGTAACCTCG CTTGTCCAGA GCTTCCAGAC GATTCAGGAT CCCCTGGCCG CAATTGTCGT GCCTTACAAA CGCCTGCCCC AGGTGGGATT GAACGCGAAT AAGCCGGACG TCTTCGGAAC GGGGGTCGAT GTCAATCTTT CCGGGAGCTG GACCAACTTC TCCCACCCCA CCCTCGTCAA CGGCAGCAGG ACCGTGCTCT TCCCAAGCAT GAGCTACCCT CTTCGCAATT CGTTCGGTTT CATCACGCCC AAGGTGGGGA TGCACTACAC CCGTTACAGC CTCGGGGAGG GTGCCGGCGT GTCCGAGGAA AACCCCACCC GCACCTTGCC GATATTCAGC CTCGACAGCG GGCTTGCCTT CGATCGCAAA ATGTCGCTGG GCGGAGAAAG CTTTACGCAG ACGCTCGAAC CGCGGGTGTT CTATGTTTAC GTCCCATTCC GCGCGCAAGA TCAGTTGCCG AATTTCGATT CCGCCAAGAC TGATTTCAGC TTTGCCCAGA TGCTGGCGGA AAACCGTTTC AGCGGGAGCG ACCGTATCAA TGATGCCAAC CAGGTGACTT TTGCCCTGAC GACCCGCCTG CTGGAATCCA GTACCGGGAG GGAGCGTTTG CGTTTGGCGG TCGGGCATCA ATTAAGCTTT ATCGATCGCC GGATCACACT GGAGACCCCG CAAACCATCG ATCGCCGACC TGATTTTATT GCCGCAGTGT CGGGTTTTCT TACACCGACC ATCAGTACTG ACACCAGCTT CCAGTTTGAC CAGACGCGCC TGCTAGCGGA TGTGGTCCGC TCGGGTGTGA GCTATCGTCC GGAGCCGGGT CGCGTGTTGA ATTTCGGTTA CCGTTTTACC CGGGATGTGC TGCATCAGGT GGATGCTTCC AGCCAATGGC GATGGTCGGA AAGATGGCAG ACGGTGGCCC GCCTGAATTA CTCGTTACAG GATAAGAGAA TTCTGGAAGG GCTGGCAGGA GTTGAGTATA ATGCCTGCTG CTGGTCGTTG CGGTTTGTGC TCCAGCATTT GACCCTTGCT ACGCAGAAAT CGACCACAGC GGCTTTTTTG CAACTTGAGT TGAACGGCCT GATGCAAATC GGATCGAACC CGTTGACCGT ATTGCAACGC AGCATTCCCG GGTATATCAG GACGGGTAGC CAGGGAAGCG GCTTGATAGA AGGGCCATAG
|
Protein sequence | MKLRFIRSAG WLFLLFCLAC NARADLPPLS SKPEQGRATP SGEGDDKPVV IDTERIRGHH EYESGTRSES ELRSRSTIST DQIKKPNQKA DPAAKDTPSA PQQNYTLSPA IKTDSRTGTS AQESEKAESM VLPGGVERLP GPAAEEGEPR LRTRTQSAPR TLSAQKRGEK PAKTPAPAEA DQDRPGFAEG ERIGGHREEA GDEKLRLAGE TEPEAIEQKL AEAEAETDKQ SPVFVVADRL QGHVEEEIEA IGKAELSAGP QFISAERMKY NQGTNDAEAQ GNVRVEKEGD ILEGSDLKFN LLSKTGQLSE PSYRLKDASS RGYAGMLLFE GENQYRLQKA SYTTCPVGDD SWVLQVADLK LDNDKKVGTA KNVKLTFKDV PILYTPWMNF SYSGERKSGL LAPTYGTGSR TGLELAVPFY WNIAPNYDAT FSARLMSKRG LAINNEFRFL GQNSSSNLLA DIVPRDLDTQ TTRWRTSFWH NHYLGAGFSA RLDYNRVSDA TYFRDFGNNL NLTSRTNLLQ QGLLSYNRGL GDDGTFNVTS LVQSFQTIQD PLAAIVVPYK RLPQVGLNAN KPDVFGTGVD VNLSGSWTNF SHPTLVNGSR TVLFPSMSYP LRNSFGFITP KVGMHYTRYS LGEGAGVSEE NPTRTLPIFS LDSGLAFDRK MSLGGESFTQ TLEPRVFYVY VPFRAQDQLP NFDSAKTDFS FAQMLAENRF SGSDRINDAN QVTFALTTRL LESSTGRERL RLAVGHQLSF IDRRITLETP QTIDRRPDFI AAVSGFLTPT ISTDTSFQFD QTRLLADVVR SGVSYRPEPG RVLNFGYRFT RDVLHQVDAS SQWRWSERWQ TVARLNYSLQ DKRILEGLAG VEYNACCWSL RFVLQHLTLA TQKSTTAAFL QLELNGLMQI GSNPLTVLQR SIPGYIRTGS QGSGLIEGP
|
| |