Gene Nmul_A1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1422 
Symbol 
ID3786620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1633332 
End bp1634411 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content55% 
IMG OID637811510 
Productsecretion protein HlyD 
Protein accessionYP_412117 
Protein GI82702551 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0497268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCCAT CCGTCTCAAA TAAAGCAGCG CAACCGCATC TGACATCATC GGTGAATAAA 
AAACTCGTCC TTATAGGAAT AGCAATAGGA CTTGCTCTGA TTTCGGTGGG AACGGTAAGC
TGGTTTTTTA CCCATAAGAA AAGCAATGGC GAGTTCCTGA CTCTCTTTGG CAATGTGGAT
ATCCGCCAGG TTTCTCTCGC CTTCAACGGA AACGATCGGA TCGCTGAAAT GCGAGTGGAG
GAAGGAGACC GGGTCAGGGC CGGACAAGTT CTGGCAAAGC TGGATACCCG CATTCTCACG
TTGCAAATTG CGCAAGCCGA AGCCCAGGTT GCCGCCCAGG AGCAAGCTCT GTTACGGCTT
GAGAACGGTA CCCGTCCCGA GGAAATAGCA CAGGCCAAAG CCGAAGTTGC TTCCGCTCAG
GCCGATGCCG ATCTCGCCCG GCAGTTTCTC GGCCGCTTGA TGGAGATTGA AAGTGACTCG
GGGGCGGCCG TCAGCCAGCA GGATCTCGAC AATGCCAGGT CTCGCCGTCG GGTGGCCGTA
GCGCAACTCG AAAATCGTAA AAAGGCACTG CAACTGGCAT TGATCGGGCC GCGCAAGGAA
GATATTGCGC AGGCGGAGGC GCAGTTGAAC GTTTTTCGTG CTGAGCTGGC CTTGCTGCGG
CACCAGCTTG ATTTGGCCGA ATTGAAATCC CCTATTGATG CTGTCATACG CTCACGTCTT
CTCGAACCGG GAGACATGGC TTCGCCACAA CGTCCGGTTT ATGCGCTGGC CATAACCGAT
CCAAAATGGG TCCGAGCCTA CGTATCCGAG ATCGATCTAG GCCGAATCAA GCTTGGCATG
AGGGCAGAGG TTGTTACCGA CAGTCATCCG GAGGAGTCCA TTCATGGTCG TATTGGCTAT
ATCTCGTCGG CTGCCGAGTT CACCCCAAAG CCTGTACAAA CCGAGGAGCT GCGCACCAGC
CTTGTCTATG AGATACGGGT GTATGTGGAA GACGCGGAGG ACAGGCTGCG TCTGGGTATG
CCCGCCACCG TGCATATCGC TCTCAGAAAT AATGGAAATT CCAGCGAAGT GAAGCATTGA
 
Protein sequence
MRPSVSNKAA QPHLTSSVNK KLVLIGIAIG LALISVGTVS WFFTHKKSNG EFLTLFGNVD 
IRQVSLAFNG NDRIAEMRVE EGDRVRAGQV LAKLDTRILT LQIAQAEAQV AAQEQALLRL
ENGTRPEEIA QAKAEVASAQ ADADLARQFL GRLMEIESDS GAAVSQQDLD NARSRRRVAV
AQLENRKKAL QLALIGPRKE DIAQAEAQLN VFRAELALLR HQLDLAELKS PIDAVIRSRL
LEPGDMASPQ RPVYALAITD PKWVRAYVSE IDLGRIKLGM RAEVVTDSHP EESIHGRIGY
ISSAAEFTPK PVQTEELRTS LVYEIRVYVE DAEDRLRLGM PATVHIALRN NGNSSEVKH