Gene Nmul_A1641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1641 
Symbol 
ID3785583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1879350 
End bp1880789 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content54% 
IMG OID637811729 
ProductOuter membrane efflux protein 
Protein accessionYP_412333 
Protein GI82702767 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCCCG GAGTACTAGT TGATCGTGAG CAAGTTGATG TGCTTAATCT TCTGAGCAAT 
GTGCTGCCTG CAGTACGAAT TGCTCTGGGG ATCATGCTGT TTTCGAGTAT ATCGGCAATG
CTTCATGCAG CTTCTTTCAC GGTTCAGGCA GAACCTGCAA AAGAGGAATT TTCCATCACC
GAACAGCAAG CGATCGCACT GTTCTATCAG CGCAACCTCG GCTTGATCGC AGCCAGCCTC
AACATCGATA ACGCCAGGGC CCAGGAAATC ATCGCCGCAG CGATTCCCAA TCCGGTATTC
AGTTTTACAG TTCACGAACT GGCCCCAAAG GCATTCGCGC CGGAAAGCCG TCACCTGGCA
GTCCCCGCGT ATTTACCGCA GATTCAGCAG CTCATAGAAA CGGCCGGAAA ACGGCGTTTA
CGGATAGAAA GCAGCGAGCT GGCTACCGAG GCTGTGAACT TCGACGTGCA AGACGTTGCA
CGCGTGCTCA CAAACACCGT GCGGCGCAGT TTTTACAACC TCCTGTTGGC CCAGAAGACG
ATCAAGGTTG CACGGGATAA CCTCGAGCAT TACCGGGAAA TTCTGAGGGT AAACGAAATA
CGGCTCAAGG TAGGCGATGT TGCGGAGATG GATTTCGTTC GTATCGAGGT TGAAAGCCTC
AAGGTTCAAA GCGATCAGGA TCAGGCAAGG GCTGCATTGA ATCAGGCACG GGCCGACCTG
CTATTGCTGC TGGGCTGGCC TGAAAACAGC ATAGAAATCA ATGCCGCCGA AACCTGGCCT
CAAGCAACGC CCGAGATTGC GCTGGCGACG CAGGATCAAT TGGTTGAACG CGCGCTGGAA
CGACGTCCGG ATATGCGCGC TGCAAGGATA CGCATCGCCC AGGCGCGAAA AGTGCTCACA
CTCGCGCAAC GGCAGGTTAT TCCCGATGTG ACAATAAGCG CGTTCTACGA TCGGGATCAG
GGTAACCAGT TTCCGCGTAC TGGCGGCGTG GGTATCAGCA TACCAATCCC TTTGTTCTAC
CAGCAAAAAG GTGAAATTTC CCAGGCTCGC GTAGGTTTGA CTTCCAGCGA ACTGGCATTA
AGGCAGGCCG AGTATGACGT GCGCGCTGAA GTCATGAAGG CCTCGGCAGC TTGGCAAAGC
GCCGACGCCA TAGCCCGGCG CTTTGAAACT TACGTGGTCA AAAAAATCGA GGCATTGCGC
AAGGCACAGG AAATTGCTTA TCAAAAAGGG GCAGTGGGAG TGCTGGATCT GATCGATGCT
GAGCGAAGCT ATCGGACAAT TATGCTGGAT TATTATGCCG CGCTGGCAAA CCGCAGCAAA
GCCTGGGCTG ATTTGCTGAT GGCATATGGC GAGGAAACCG GAAATCCGCG CTATCAATCC
GGCAGCAACC AGGATGATTG GCGGTCCGCG CGTTCCCACC GGGTGAATTT CGGTAAATAA
 
Protein sequence
MYPGVLVDRE QVDVLNLLSN VLPAVRIALG IMLFSSISAM LHAASFTVQA EPAKEEFSIT 
EQQAIALFYQ RNLGLIAASL NIDNARAQEI IAAAIPNPVF SFTVHELAPK AFAPESRHLA
VPAYLPQIQQ LIETAGKRRL RIESSELATE AVNFDVQDVA RVLTNTVRRS FYNLLLAQKT
IKVARDNLEH YREILRVNEI RLKVGDVAEM DFVRIEVESL KVQSDQDQAR AALNQARADL
LLLLGWPENS IEINAAETWP QATPEIALAT QDQLVERALE RRPDMRAARI RIAQARKVLT
LAQRQVIPDV TISAFYDRDQ GNQFPRTGGV GISIPIPLFY QQKGEISQAR VGLTSSELAL
RQAEYDVRAE VMKASAAWQS ADAIARRFET YVVKKIEALR KAQEIAYQKG AVGVLDLIDA
ERSYRTIMLD YYAALANRSK AWADLLMAYG EETGNPRYQS GSNQDDWRSA RSHRVNFGK