Gene Shewmr4_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1940 
Symbol 
ID4252514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2312172 
End bp2315339 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content51% 
IMG OID638118551 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_734071 
Protein GI113970278 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family
[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0454025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00180133 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCAGT TTTTTATTAA TCGGCCGATT TTTGCCAGCG TTATCTCCAT CGTTATCGTG 
CTGCTCGGGG TGATCGCTAT GTTTAAGCTG CCCGTCGATC AGTATCCTTA CATCACGCCG
CCACAGGTGA CGATTTCGGC CTCTTATCCG GGCGCCAGTT CTACTACGGC CGCCGAGTCC
GTCGCCACAC CGCTCGAACA GGAAGTGAAC GGTGTGCCCA ATATGATTTA TATGAGTTCA
AAGAGTACGA ACTCGGGCAG CACCAGTGTG ACTATTACCT TCGATGTGGG TACTAATGCG
GACTTGGCTG CGGTGGATGT GCAAAACTCC GCTCAACAAG CTTCGGGTGG ACTGCCAATC
GACGTGCAAA CCGAAGGGGT GACTGTCTCT AAAGATGCGT CGGTGGAATT ATTAAAACTG
GCGTTAACCT CGAACGATGA AAGGTTCGAT GAGATTTACC TCAGTAATTA CGCCACCATC
AATATCGAAT CGGCGCTCAA GCGGATCCCC GGCGTGGGCC GCACCCGTAA CACGGGTTCA
CGCAGTTATG CGATGCGTAT TTGGTTAAAA CCCGATGCCA TGGCGGGGTA TAGCTTAACC
ACCACAGATG TGATCAACGC CATCAAGGCA CAAAATAAAG AATCCCCCGC AGGCACCATT
GGTACTCAAC CCAATAATGA CGATATCAGT TTAACCTTGC CCATCAGTGT GGCGGGCAGA
TTAAGCTCGG TGCAAGCATT TAACGAGATC ATAGTGCGCG CCAACCCCGA CGGCTCAATT
ATCCGGCTAC GGGATATTGC TGGCGTAGAA CTCGGCTCCT CCGCCTACAC GTTGCAATCC
CAGTTAAACG GTGAAAATGC CACCATCTTA CAGGTGTATT TGCTCCCCGG CGCCAATGCC
CTCGAGGTTA CCCATAAGGT GAAACAGGCG ATGGCCGAAC TGTCGCAAAA GTTTCCCCAA
GGGATGAAAT GGGAAGTGTT TTATGACGCA TCCATCTTTA TTCAAGAATC TATCGATGAG
GTGATCCACA CCCTCATCGA GGCCTTAGTG CTTGTGGTGT TGGTGGTTTA CTTGTTCCTG
CAGAATGTGC GCGCCACTTT GATCCCCGCC ATTGCGGTAC CAGTTTCCTT GATAGGCACC
TTGGCGGCCA TGCTGGCCTT TGGTTTTACC ATTAATACTG TGAGTTTACT CGCGCTGGTT
CTCGCCATTG GGATTGTGGT CGACGATGCC ATTGTGGTGG TCGAAAACGT CGAGCGTTTA
ATTCACGAGA AAGGCATGAG CGCCATCGAT GCGACTCGGA TTGCGATGAA AGAATTGTCG
GGTGCACTGG TTGCCACCAG TTTAGTGCTC TGCGCGGTAT TCGTCCCCGT GTCATTCTTA
GCGGGGATCA CAGGCATCAT GTACCGCGAG TTTGCGGTGG CAATTACGGT TGCCGTGTTG
ATTTCTACTC TAGTGGCGTT GACCCTAAGT CCCGCACTCT GCGCCTTACT ATTAAAACCG
AGTAAGGCAC CACAGCGGGG ATTCTTTCAC TGGCTGAATC GCAAACTCGA CTTGGGGACC
AATCAATATG TCGGTTTAGT CGCCTTAACC AATAAATACG CCAAACGCAG TTACTTAGCT
TTTGCAATTA TGTTTGGTGG TACCTATTTC ATCATGTCCC ACTTACCGAG CAGCTTTATG
CCCGACGAGG ACCAAGGTCG ATTCTTTATC GATATGACAT TACCCGATGG CTCTACGGTT
AATCGTACCG AGGCGATTTT AAAGAAAGCC GAGCAATATG TGCGCGCCAA TCCCGCTGTT
GCCTATTCAT TTACGCTGGC GGGGGAAAAC CGCCGCTCGG GGGCAAATCA GGCTAACGGC
CAATTTGAAG TGGTGCTTAA GCCTTGGGCT GAGCGCGAAG CCAGTCATGC CACGGTACAA
TCCGTGATGA AGGCGATAGA TAAGGATCTG AAAAACGTCT TAGAAGCTGA GTTTAACCTG
TATTTGCCAT CGGCCGTCCC CGGTTTAGGC AATGGATCTG GGGTGGAAAT GCAGCTCCAA
GATACCTCAG GCACCCATTT TGATGGCTTG ATTGAAACCG CCAACGAATT AGTCGAGCAG
CTAAAGCTGC AGCCTGAGGT TGCCAGTGCC AGCGTGTCGT TGCAGAGCGC AATCCCGCAA
TTGCACTTAA CGGTCGATGA GGCAAAGGCG ATGGCGATTG GGGTGAATGT GGGTGATATC
TACAGCACAA TCAAAACCTT AACCGACTCA TCAACCGTGA ATGACTTTAA CCTCTTTGGC
CGGGTGTATC GGGTGAAGAT CCAAGCGGAA GAGAGTTACC GTCAATTCCC ACACCAAATC
AAAGATTATT ATGTGCGCTC ATCGAACGGC GCTATGGTGC CGATTGGTGT GCTGGCTAAG
TATGATTATA CGGTGGGACC ATCTTCCGTG ACCCACTACA ATTTATTCTC CAGCGCCTCC
ATCAATGTGA CGCCCGCAAC GGGCTACGCG ACAGGCGATG TTATTCAAGC CATCGAGCGG
GTCGCAACGC CGATCCTGCC CGATGAGTTT AAATACGAAT GGACAGGCAT TACCTACCAA
GAGGTGCAGT CGGCAAACCA AACCGGCATT GCCATTGGGT TAGCCCTGTT ATTTGTATTC
CTGTTTTTAG CTGCGCTCTA TGAGAGTTGG AGTATCCCTG TGGCGGTATT GCTGATCGCG
CCCATCGCAC TCTTAGGCGC GGCGGTCACG ACCTTAATCA GTGGTATGCA GAGTAACTTA
TTCTTCCAAG TGGCATTTAT TGCCTTAATC GGCATGGCCG CGAAGAATGC GATTTTAATT
GTCGAGTTTG CCAATCAGTT GCACCAGCAG GGGCGTACGC GAATTTCGGC GGCGCTCGAA
GCGGCGACCA TGCGCTTTAG GCCGATTTTA ATGACCTCAA TGGCGTTTAT TCTTGGGGTG
TTACCGCTGG TGTTATCCGA AGGGCCAGGG GCCGTGAGTC GGCAATCAAT TTCGTTACCT
ATCCTGGGTG GAATGGTGTT GGCAACCACC ATAGGTATAG TGTTTGTGCC GCTATTCTTT
GTGACGACCG CAGGCTGGGT AAAAAAGAAA ACCGTTAAAC AACCTATTAA AACAAAAGAG
GAATTACTCC TAGATCAAGA AACCGTTGAG GAGGTGAGCC GTGGTTAA
 
Protein sequence
MAQFFINRPI FASVISIVIV LLGVIAMFKL PVDQYPYITP PQVTISASYP GASSTTAAES 
VATPLEQEVN GVPNMIYMSS KSTNSGSTSV TITFDVGTNA DLAAVDVQNS AQQASGGLPI
DVQTEGVTVS KDASVELLKL ALTSNDERFD EIYLSNYATI NIESALKRIP GVGRTRNTGS
RSYAMRIWLK PDAMAGYSLT TTDVINAIKA QNKESPAGTI GTQPNNDDIS LTLPISVAGR
LSSVQAFNEI IVRANPDGSI IRLRDIAGVE LGSSAYTLQS QLNGENATIL QVYLLPGANA
LEVTHKVKQA MAELSQKFPQ GMKWEVFYDA SIFIQESIDE VIHTLIEALV LVVLVVYLFL
QNVRATLIPA IAVPVSLIGT LAAMLAFGFT INTVSLLALV LAIGIVVDDA IVVVENVERL
IHEKGMSAID ATRIAMKELS GALVATSLVL CAVFVPVSFL AGITGIMYRE FAVAITVAVL
ISTLVALTLS PALCALLLKP SKAPQRGFFH WLNRKLDLGT NQYVGLVALT NKYAKRSYLA
FAIMFGGTYF IMSHLPSSFM PDEDQGRFFI DMTLPDGSTV NRTEAILKKA EQYVRANPAV
AYSFTLAGEN RRSGANQANG QFEVVLKPWA EREASHATVQ SVMKAIDKDL KNVLEAEFNL
YLPSAVPGLG NGSGVEMQLQ DTSGTHFDGL IETANELVEQ LKLQPEVASA SVSLQSAIPQ
LHLTVDEAKA MAIGVNVGDI YSTIKTLTDS STVNDFNLFG RVYRVKIQAE ESYRQFPHQI
KDYYVRSSNG AMVPIGVLAK YDYTVGPSSV THYNLFSSAS INVTPATGYA TGDVIQAIER
VATPILPDEF KYEWTGITYQ EVQSANQTGI AIGLALLFVF LFLAALYESW SIPVAVLLIA
PIALLGAAVT TLISGMQSNL FFQVAFIALI GMAAKNAILI VEFANQLHQQ GRTRISAALE
AATMRFRPIL MTSMAFILGV LPLVLSEGPG AVSRQSISLP ILGGMVLATT IGIVFVPLFF
VTTAGWVKKK TVKQPIKTKE ELLLDQETVE EVSRG