Gene Nmul_A1407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1407 
Symbol 
ID3786437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1614147 
End bp1615769 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content57% 
IMG OID637811495 
Productsecretion protein HlyD 
Protein accessionYP_412102 
Protein GI82702536 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0845] Membrane-fusion protein
[COG5569] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCTTG GTGTCAAATT AACAGCTTTT GTTGCCCTGG CTGCAATCTT GTTTGTCGCC 
GGCTATTGGT GGGGCGCAAT CCGGCCTTCG ACTTCTGCCG CCAAGGCCTC GAGTGTGGTT
TCAGTCGGCG GTAAGCCTGA AAGGAGGATT CTCTATTATC GCAACCCCAT GGGGCTGCCG
GACACTTCGC CGATACCTAA AAAAGATCCC ATGGGCATGG ACTACATACC TGTTTACGAG
GACGAGGAAT CCACTTCCGC CGATGCGGCA GTCAAGATCG GCGTCGAGAG GATACAAAAA
CTGGGAGTAA GAACCGAGCG GGTCGGCATG CGCGAACTCA GCCGCACCGT AAGGGCGGTA
GCGACAGTGC AGGCGGATGA ACGACGGCTC TACGTCGTGG CCCCCAGGTT CGAGGGATGG
ATCGAGCGGC TCCACGTCAA CACGACGGGA CAGATGGTCA GAAAGGGTGA TCCATTGATG
GACGTGTATA GTCCCGACCT GATCACAGCC CAGCATGAAT ATCTTATCGC CCGGAGAGGA
ACAAAGGACG TGGAGGATGG CGGCCTTGAG GTTCGGGCGG GGATGGAACG ACTAGCTGAG
AGTGCCTTGC AGAGACTGCG CAATTGGGAC ATATCCGCGG CCGATCTGAA GACACTCCAG
CAGGAAGGCA AGTTTACACA TTCCGTCATA TTTCGTTCTC CGGCAAGCGG CGTGGTTTTG
GAGAAACGGG CGATTCAGGG ACAGCGCTTC ATGGCGGGAG AGATTTTGTA TCAGATTGCG
GATCTCTCCA GAGTATGGGT GCTCGCAGAC GTGTTTGAGC AGGATCTTGC CACGATACAA
CCGGGGCAGG CTGCCGCCAT TCGGGTTGAG GCCTATCCGG ACAAGGTGTT CCCAGGCGAG
GTGACGTTCA TATACCCCAC GGTCAATCCG GAAACCCGCA CAGCCAAGGT TCGCATGGAG
CTGCCCAATC CACAGGAGTT GCTGAAACCG GCGATGTACG CGAAAGTGGA GTTTGGTTCA
ATCCAACGTA AGGACAGAGT GCTCGCCGTC CCCGAATCGG CCGTACTCGA TGCCGGTACC
AGGCAGTCTG TGCTGGTGGA TCTCGGCGAA GGCCGCTTTG AACCGAGACT CGTGAAGCTG
GGCAAGCACG CGGACGACTA TGTGGAAGTT CTGGGGGGAC TGAAGACTGG GGAAATGATT
GTGGTGAAAG CCAATTTTCT CATAGATGCC GAAAGCAACC TCAAGGCTGC ACTGAGCAGT
TTTACCCGCA GCAGCCAAGC CTCGATGCCT GGTGAAGAGA GGGAGGGAGG CCGGACCGCC
TCTGGCTCAT CCGGAACTGT ACCTTCATCC CCGTCCCCAT CTTCATCGAG CCATCGCGGG
GAAGGCATCA TCGAAGCGAT CGATGTGGCC AATACAACCC TGACTCTTGC CCACCGCCCG
ATCGCAAGCC TCGCCTGGCC GGAGATGTTG ATGGACTTCA AGGTTCTGGA CCCAGCGTTA
CTACGGATGT TGAAACCCGG CCAGAAGGTT GCTTTCGAGA TAACGGAAGC ATCGCCGGGC
GAATACGTTA TTGTGCGTAT ACAGCCCCAG GCCAGCGCTG CCGATCACGG AAAGAAGCCC
TGA
 
Protein sequence
MKLGVKLTAF VALAAILFVA GYWWGAIRPS TSAAKASSVV SVGGKPERRI LYYRNPMGLP 
DTSPIPKKDP MGMDYIPVYE DEESTSADAA VKIGVERIQK LGVRTERVGM RELSRTVRAV
ATVQADERRL YVVAPRFEGW IERLHVNTTG QMVRKGDPLM DVYSPDLITA QHEYLIARRG
TKDVEDGGLE VRAGMERLAE SALQRLRNWD ISAADLKTLQ QEGKFTHSVI FRSPASGVVL
EKRAIQGQRF MAGEILYQIA DLSRVWVLAD VFEQDLATIQ PGQAAAIRVE AYPDKVFPGE
VTFIYPTVNP ETRTAKVRME LPNPQELLKP AMYAKVEFGS IQRKDRVLAV PESAVLDAGT
RQSVLVDLGE GRFEPRLVKL GKHADDYVEV LGGLKTGEMI VVKANFLIDA ESNLKAALSS
FTRSSQASMP GEEREGGRTA SGSSGTVPSS PSPSSSSHRG EGIIEAIDVA NTTLTLAHRP
IASLAWPEML MDFKVLDPAL LRMLKPGQKV AFEITEASPG EYVIVRIQPQ ASAADHGKKP