Gene Nmul_A2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2106 
Symbol 
ID3784677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2399177 
End bp2400592 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content58% 
IMG OID637812194 
ProductOuter membrane efflux protein 
Protein accessionYP_412791 
Protein GI82703225 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.343125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTTC ATGCGTTGTT CATCATCAGC CGGTGTGTGC GTGTGGGAGC CATGCTTGTC 
CTTGCGATGA CTTGCTTGCC CTCAGCAGGT GCTGACAGTT CCTTTACGGG CATCCGCACG
CTTGAGGACG CGCACTCTCC CAGTGCTGCC GGCCCTGAGG TCAAGAGCAA TCTCACCCTG
CGGGATGCGG TGCGGCTTAC GCTTCAGCAC AACCCCGAAC TGTCCTCCTT CGATAAGGAG
ATGCGCGCTC TGGAGGGTGT TACGTTGCAG GCCGGGTTGT TGCGCAACCC TCAGTTGTCG
GTGGACGTCG ACAATGCCGG AAACATGGGA GGCGTAAGTG GACAAGGAGC CATCAAGCAA
AATGTCGAGC AGCAGGATTT GATCATTCGC ATCAGCCAAT TGGTCGAGTT GGGAGGAAAA
CGTGCAGCGC GGGTAAATGC TGCGTCGCTC GGGCAGGCAC TGGCGGGCAA GGACTTCGAA
ACCAAACGGC TCGAACTCGT GGCACGGGTA GCGAACGTAT TTACAGAGGT GCTGGCGGGG
CAGGAGCAGT TGCGGCTGGC CGAGGAGAGT CAGCAGCTGG CTCAGCGCGT GGTGGATACT
GTCAAGCGCC GGGTGCAGGC GGGAAAAGTG CCGCCCATAG AAGAGACTAA AGTGGGAGTA
GCATTTTCCA CGACGCGAAT TGCCCTGGGC CAGGCGCAAC GCGAGCTGGC CGCCGCGCGC
AAACGCCTTG CGCTGCTATG GGGTGACAAT TCGCCCCAGT TTGGGGAAGC GCTAGGAGCT
CTGGAATCGA GGATCGTCCT GCCCGATTTG GCCGCATTGA CCGAGCGAGT CTTGTCGAGT
CCCATGGCGG ATCGCGCCAG AAAAGGCATA GAACATCGCC AGGCGCTGCT CGAAGTGGAG
CAATCCCGCC GCATTCCCGA TATCACCCTT GCGGGCGGCA TGATCAAGCA TTGGGAATCA
GGGGGAACGA CTGCGATCGT AGGCGTCTCC ATGCCGCTGC AATTCTTCGA CCGGAACCAG
GGAAACCTGC GGGAAGCCTA TCAACGCCTG GATAAGGCAC AGGATGAGCA AGCCGCGACC
GACCTGCGCC TCAAGGCGGA ACTGGTACAG GCCTACGAAT CGTTGACCGC AGCCGAGAAC
GAGATATCGA TATTGCGCGG GGAGATATTG CCTGCGGCCC GAAGTGCTTT CGATGTGACG
AACAAGGGTT ATGAGCTCGG CAAATTCGGC TTTCTTGAAG TGCTCGACGC ACAGCGCACC
TTGTTTCAGA ACCAGGTTTT ATATGTGCGT GCGCTCGCCA ATTACCACCG CCTTGTCAAT
GAAATCGAAC GTTTGATTGC AGCCCCCCTC GATGGGAGGG CGAGACAGGA CACCGATGAA
CCGGCCTATA CCGATTTTAC GGATGATAAG GAGTAG
 
Protein sequence
MNFHALFIIS RCVRVGAMLV LAMTCLPSAG ADSSFTGIRT LEDAHSPSAA GPEVKSNLTL 
RDAVRLTLQH NPELSSFDKE MRALEGVTLQ AGLLRNPQLS VDVDNAGNMG GVSGQGAIKQ
NVEQQDLIIR ISQLVELGGK RAARVNAASL GQALAGKDFE TKRLELVARV ANVFTEVLAG
QEQLRLAEES QQLAQRVVDT VKRRVQAGKV PPIEETKVGV AFSTTRIALG QAQRELAAAR
KRLALLWGDN SPQFGEALGA LESRIVLPDL AALTERVLSS PMADRARKGI EHRQALLEVE
QSRRIPDITL AGGMIKHWES GGTTAIVGVS MPLQFFDRNQ GNLREAYQRL DKAQDEQAAT
DLRLKAELVQ AYESLTAAEN EISILRGEIL PAARSAFDVT NKGYELGKFG FLEVLDAQRT
LFQNQVLYVR ALANYHRLVN EIERLIAAPL DGRARQDTDE PAYTDFTDDK E