Gene Nmul_A0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0521 
Symbol 
ID3784510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp591768 
End bp594557 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content57% 
IMG OID637810603 
Productorganic solvent tolerance protein 
Protein accessionYP_411221 
Protein GI82701655 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTGC GTTTTATCCG TTCCGCTGGC TGGTTGTTTT TGTTGTTTTG CCTTGCCTGC 
AATGCGCGTG CCGATCTGCC GCCGCTTTCT TCAAAGCCGG AGCAGGGACG CGCAACCCCT
TCTGGTGAGG GTGACGATAA ACCGGTGGTT ATCGATACGG AACGCATCCG GGGCCATCAT
GAATACGAGT CAGGCACCAG AAGCGAGAGT GAATTGCGCA GTCGCTCGAC CATTTCAACC
GACCAGATAA AAAAACCCAA CCAAAAAGCG GACCCCGCCG CAAAAGACAC GCCATCCGCG
CCTCAGCAGA ACTATACACT ATCCCCCGCG ATAAAAACCG ACTCCCGGAC AGGCACCTCA
GCCCAGGAAT CGGAGAAGGC GGAAAGTATG GTTTTACCGG GTGGCGTGGA ACGCCTTCCC
GGGCCTGCCG CAGAAGAAGG AGAGCCCAGG CTGCGCACCA GGACCCAGTC CGCACCGCGC
ACTTTATCCG CGCAAAAGCG CGGGGAGAAG CCCGCGAAGA CACCTGCACC GGCCGAAGCA
GACCAAGATA GACCGGGGTT TGCAGAAGGC GAGCGCATTG GAGGTCACAG GGAAGAAGCA
GGCGACGAAA AGCTGCGTCT TGCCGGCGAG ACTGAGCCCG AGGCAATCGA GCAAAAACTG
GCAGAAGCTG AAGCGGAAAC GGACAAACAG TCCCCCGTGT TCGTGGTTGC GGATCGCTTG
CAAGGCCACG TGGAGGAGGA AATCGAAGCG ATAGGCAAGG CGGAACTGTC TGCCGGCCCT
CAGTTTATTT CCGCCGAGCG GATGAAATAC AACCAGGGCA CCAACGATGC CGAAGCCCAG
GGCAACGTCC GTGTGGAAAA GGAAGGTGAC ATCCTGGAAG GATCCGATCT CAAATTCAAT
CTGCTGAGCA AAACGGGCCA GTTAAGCGAA CCCAGCTATC GTCTGAAGGA TGCGAGCAGT
CGCGGTTATG CGGGCATGCT CCTGTTCGAA GGCGAGAACC AGTACCGCCT GCAGAAGGCC
AGTTATACCA CGTGCCCCGT GGGAGACGAC AGCTGGGTTC TCCAGGTGGC CGACCTGAAG
CTCGACAATG ACAAGAAAGT GGGCACCGCC AAAAATGTGA AGCTCACCTT CAAGGATGTG
CCGATACTGT ATACCCCCTG GATGAATTTC TCATACAGCG GCGAGCGCAA ATCAGGATTG
CTGGCGCCGA CCTACGGTAC CGGCAGCAGG ACCGGCCTTG AACTGGCTGT ACCCTTCTAC
TGGAACATCG CCCCCAACTA TGACGCCACG TTTTCCGCAC GCCTGATGTC AAAGCGCGGC
CTGGCGATCA ACAACGAATT TCGCTTTCTG GGCCAAAACT CGAGCAGCAA TCTGCTCGCC
GACATCGTGC CTCGTGACCT GGATACGCAA ACGACGCGGT GGCGCACGTC GTTCTGGCAC
AATCATTATC TGGGCGCTGG TTTTTCCGCT CGCCTGGATT ACAACAGGGT GTCGGATGCA
ACCTATTTTC GCGACTTTGG CAACAACCTG AATCTCACAT CCCGCACCAA CCTGCTGCAG
CAGGGATTGC TGTCTTACAA TCGCGGGCTG GGGGATGACG GCACATTTAA CGTAACCTCG
CTTGTCCAGA GCTTCCAGAC GATTCAGGAT CCCCTGGCCG CAATTGTCGT GCCTTACAAA
CGCCTGCCCC AGGTGGGATT GAACGCGAAT AAGCCGGACG TCTTCGGAAC GGGGGTCGAT
GTCAATCTTT CCGGGAGCTG GACCAACTTC TCCCACCCCA CCCTCGTCAA CGGCAGCAGG
ACCGTGCTCT TCCCAAGCAT GAGCTACCCT CTTCGCAATT CGTTCGGTTT CATCACGCCC
AAGGTGGGGA TGCACTACAC CCGTTACAGC CTCGGGGAGG GTGCCGGCGT GTCCGAGGAA
AACCCCACCC GCACCTTGCC GATATTCAGC CTCGACAGCG GGCTTGCCTT CGATCGCAAA
ATGTCGCTGG GCGGAGAAAG CTTTACGCAG ACGCTCGAAC CGCGGGTGTT CTATGTTTAC
GTCCCATTCC GCGCGCAAGA TCAGTTGCCG AATTTCGATT CCGCCAAGAC TGATTTCAGC
TTTGCCCAGA TGCTGGCGGA AAACCGTTTC AGCGGGAGCG ACCGTATCAA TGATGCCAAC
CAGGTGACTT TTGCCCTGAC GACCCGCCTG CTGGAATCCA GTACCGGGAG GGAGCGTTTG
CGTTTGGCGG TCGGGCATCA ATTAAGCTTT ATCGATCGCC GGATCACACT GGAGACCCCG
CAAACCATCG ATCGCCGACC TGATTTTATT GCCGCAGTGT CGGGTTTTCT TACACCGACC
ATCAGTACTG ACACCAGCTT CCAGTTTGAC CAGACGCGCC TGCTAGCGGA TGTGGTCCGC
TCGGGTGTGA GCTATCGTCC GGAGCCGGGT CGCGTGTTGA ATTTCGGTTA CCGTTTTACC
CGGGATGTGC TGCATCAGGT GGATGCTTCC AGCCAATGGC GATGGTCGGA AAGATGGCAG
ACGGTGGCCC GCCTGAATTA CTCGTTACAG GATAAGAGAA TTCTGGAAGG GCTGGCAGGA
GTTGAGTATA ATGCCTGCTG CTGGTCGTTG CGGTTTGTGC TCCAGCATTT GACCCTTGCT
ACGCAGAAAT CGACCACAGC GGCTTTTTTG CAACTTGAGT TGAACGGCCT GATGCAAATC
GGATCGAACC CGTTGACCGT ATTGCAACGC AGCATTCCCG GGTATATCAG GACGGGTAGC
CAGGGAAGCG GCTTGATAGA AGGGCCATAG
 
Protein sequence
MKLRFIRSAG WLFLLFCLAC NARADLPPLS SKPEQGRATP SGEGDDKPVV IDTERIRGHH 
EYESGTRSES ELRSRSTIST DQIKKPNQKA DPAAKDTPSA PQQNYTLSPA IKTDSRTGTS
AQESEKAESM VLPGGVERLP GPAAEEGEPR LRTRTQSAPR TLSAQKRGEK PAKTPAPAEA
DQDRPGFAEG ERIGGHREEA GDEKLRLAGE TEPEAIEQKL AEAEAETDKQ SPVFVVADRL
QGHVEEEIEA IGKAELSAGP QFISAERMKY NQGTNDAEAQ GNVRVEKEGD ILEGSDLKFN
LLSKTGQLSE PSYRLKDASS RGYAGMLLFE GENQYRLQKA SYTTCPVGDD SWVLQVADLK
LDNDKKVGTA KNVKLTFKDV PILYTPWMNF SYSGERKSGL LAPTYGTGSR TGLELAVPFY
WNIAPNYDAT FSARLMSKRG LAINNEFRFL GQNSSSNLLA DIVPRDLDTQ TTRWRTSFWH
NHYLGAGFSA RLDYNRVSDA TYFRDFGNNL NLTSRTNLLQ QGLLSYNRGL GDDGTFNVTS
LVQSFQTIQD PLAAIVVPYK RLPQVGLNAN KPDVFGTGVD VNLSGSWTNF SHPTLVNGSR
TVLFPSMSYP LRNSFGFITP KVGMHYTRYS LGEGAGVSEE NPTRTLPIFS LDSGLAFDRK
MSLGGESFTQ TLEPRVFYVY VPFRAQDQLP NFDSAKTDFS FAQMLAENRF SGSDRINDAN
QVTFALTTRL LESSTGRERL RLAVGHQLSF IDRRITLETP QTIDRRPDFI AAVSGFLTPT
ISTDTSFQFD QTRLLADVVR SGVSYRPEPG RVLNFGYRFT RDVLHQVDAS SQWRWSERWQ
TVARLNYSLQ DKRILEGLAG VEYNACCWSL RFVLQHLTLA TQKSTTAAFL QLELNGLMQI
GSNPLTVLQR SIPGYIRTGS QGSGLIEGP