Gene MCA2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2231 
Symbol 
ID3102333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2420397 
End bp2423510 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content49% 
IMG OID637171376 
ProductHAE1 efflux family protein 
Protein accessionYP_114650 
Protein GI53803743 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family
[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.601813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTTTC TGTCGATCAA ATCATTGCCG GTATCACAGT TTCCGGACAT TGCCCCTCCC 
CGCGTAACCG TAACTCTGTC ATTCCCAGGA GCGAGTGCGG ACGTATTGGT GAAATCGTCC
ATCATCACAA TCGAACGTGC GATCAATGGC GTTCCCGGCA TGAAATACAT GGTTTCCGCC
GCGACGAGCG CTGGGGAGGC TACGATCCAG ATCCTGTTCG ATCTGGGAAT CGATCCACTG
CAAGCCATGA TCGACGTCAA GACTCGGCTG GATACGACCA TGAGCCGGCT CCCCCAGCTG
GTGACGCTCG AAGGCGTCTG GGTCCAGCGC ATCCAGCCAA GCATGCTGAT GTATATCAAT
CTGTTCAGCA AAGACAAGAA CGCAGATCAA AAATTCCTTT TCAATTACGC CTACGTCAAC
GTCGTTCCAG AGATCCAACG TGTAAACGGC ATAGCACAGG CTCGGATATT GGGCAGCCGC
CAATATGCCA TGCGAATATG GCTGAATCCG GACAGGATGA GAGCCTATAA GGTTTCAACA
GAGGAAGTAC TCAAAGCGAT CGAAGAGCAA AGCATTATCG GCCGGCCTGG TAGATTGGGA
CAGAGCACGG GCGTCACTGC CCAATCGAAA GAATATGTTC TGGTGTATGA AGGCTGGTAC
AACAAGCCGG AGCAATATGA GGATATCATC ATAAGGGCCA ACTCGGAAGG CGAACTGTTG
CGTATAAAGG ATATCGGCAC CGTCGATCTG AACAGTGAAT TCTGGAATAT CTATTCGGAT
AAGGATGGCC TTCCTTCCGC CTCGATCGTC CTCAATCAAA ACTACGGAAC CAATGCCAGC
AAAGTAATCG AGGACGTAAA GAAGAAACTG AAAGAGTTGA AGGAGACATT TCCTGAAGGA
ATGGATTATG AAATCAATTA CGACGTCTCC CGGTTTCTGG ATGCATCCAT CCACAAGGTT
CTCCATACGT TGATGGAAGC CTTCGTGCTA GTCGCACTGG TTACGTTTGT TTTCCTGGGT
GACTGGCGCT CGACGTTGAT ACCGATCCTT TCGATTCCAG TCTCGTTGAT CGGTACGTTT
GCAGTCATAC AGGCTTTCGA ACTATCGATC AACCTGATCA CTCTATTCGC TCTGGTGCTC
GCCATCGGCA TTGTGGTCGA CGATGCTATC GTGGTAGTCG AAGCCGTACA TGCCAAGATG
GAGGCCGAAC ACCTCTCCCC CTATCAGGCA TCCAAGAAGG TTTTGGGCGA AATCGGTGGC
GCCATCATTG CCATCACCTT GATCATGATA TCGGTTTTCG TGCCCATTTC CTTCATGAGC
GGACCTGTCG GTGTGTTCTA TCGGCAGTTT GGCATTACTA TGGCGTCCGC GATTCTCATT
TCCGCACTGG TCGCGCTGTC ACTTGCACCT GTACTCTGCG GCATGATCTT GAAAAATACT
CACGGACAAC CAAAAAAGCG AACCCCAGTC CGCATCTTTA TTGATACATT CAATTATTTT
TTTGAAAAGA TCACCGGCAG ATACGTCACG ATTCTAGAAA AGATCATTAC TCGCCGCCTG
GTGACCATTC TTGCCTTTGC GGGTTTCTCT ATTGGTATTT ATGCGGTCAA TCTTGAACTA
CCGGCGGGCT TCATTCCTGG CGAAGATCAA GGAATGATAT ACGCGATCAT TCAAACCCCT
CCGGGATCGA CAATAGAAAC CACCAACAAG GTCGCTCGTG AACTCGAGGA ACTGGCTAAA
GGAATCGAAG GAGTCAAGTC CGTTTCGTCA CTTGCCGGAT ATGAGGTTCT GACCGAGGGA
CGAGGATCTA ACGCTGGCAC CTGCATCATC AACCTGAAAG ATTGGGAAGA GCGCAAGAAT
ACCGTGCAGG ATATCATCAA GGAGCTAGAG GAGAAGTCCA AGGATTTTGG GGCGATTATT
GAATTTTTCG AACCTCCTCC CGTACCCGGT TACGGCGCGT CATCAGGTCT AGCGTTACGA
TTGCTTGACA AGTCTGACGT CACCGATTAT CAGCAATTTG ACAAGATCAC CCAAGATTTT
CTCATGCAAT TGAAAAAGCG CAAGGAATTA ACCGGCTTGT TTACTTTCTA TGCAGCCAAT
TTTCCCCAAT ATGAGCTTGT AATAGACAAC AAGCTTGCGA TGCAAAAACA TGTATCGATA
GCCAAAGCCA TGGAAAATCT TGACATCATG ATCGGGAGCA CCTATGAACA AGGGTTTATT
CGGTTCAACA ACTTTTTCAA GGTATACGCA CAGGCACTTC CAGAATACCG CAGATACCCT
TCAGATATTC TAAATTATTT TGTCAAAAAC GAAGAAGGAG AAATGGTTCC TTACTCGTCA
TTCATGACGA TGAAAAAAAC CCAGGGGCCG AATGAAATAA CCCGTTACAA TCTCTACAAC
TCTGCGGTAA TTCGGGCTGC TCCGGCAAAA GGATATACTT CGGGAGACGC CATTGAAGCC
GTCAAAGAGG TCGCAGCAGC GACTCTTCCA CGTGGTTACG ACATCGCGTG GGAAGGATTG
TCGTTCGACG AAGCGGCCCG AGGCAATGAA GCGGTCGTCG TTTTCATTGT CGTCATCGTG
TTCGTTTATC TCGTTCTCGC AGGCCAATAT GAGAGTTTCA TCTTGCCATT GGCTGTGCTG
TGTTCGCTGC CTGCAGGCAT CTTTGGCTCC TATTTCTTAC TGAAATTGTT CGGACTCGCC
AACGATGTGT ATTCACAAAT AGCCTTGATC ATGCTTATCG GGCTATTAGG AAAAAACGCC
GTACTGATCG TAGAGTTTGC CGTTCAACGG CACGGTCAGG GGTTGACCGT CAAGGATGCC
GCAATCGAAG GAGCAAAAGC GCGCTTCCGT CCCATTCTGA TGACTTCGTT TGCTTTTATA
GCAGGTCTGA TACCATTGGT TAAAGCCACG GGCGCAGGCG CTATAGGAAA CCGCACCATC
GGAACCGCCT CAATGGGCGG AATGGTGTTC GGAACCGTCG TAGGCGTCGT TTTGATACCA
GGGCTCTATT ACCTATTCGG CAAAATGATC GAAGGTAAAT CCCTGATCAG AAACGAAGAT
AAAGAACCTT TTACCGAAAC TTACCAGTAC AGCAGGGACG ACCTTGATGA CTAA
 
Protein sequence
MGFLSIKSLP VSQFPDIAPP RVTVTLSFPG ASADVLVKSS IITIERAING VPGMKYMVSA 
ATSAGEATIQ ILFDLGIDPL QAMIDVKTRL DTTMSRLPQL VTLEGVWVQR IQPSMLMYIN
LFSKDKNADQ KFLFNYAYVN VVPEIQRVNG IAQARILGSR QYAMRIWLNP DRMRAYKVST
EEVLKAIEEQ SIIGRPGRLG QSTGVTAQSK EYVLVYEGWY NKPEQYEDII IRANSEGELL
RIKDIGTVDL NSEFWNIYSD KDGLPSASIV LNQNYGTNAS KVIEDVKKKL KELKETFPEG
MDYEINYDVS RFLDASIHKV LHTLMEAFVL VALVTFVFLG DWRSTLIPIL SIPVSLIGTF
AVIQAFELSI NLITLFALVL AIGIVVDDAI VVVEAVHAKM EAEHLSPYQA SKKVLGEIGG
AIIAITLIMI SVFVPISFMS GPVGVFYRQF GITMASAILI SALVALSLAP VLCGMILKNT
HGQPKKRTPV RIFIDTFNYF FEKITGRYVT ILEKIITRRL VTILAFAGFS IGIYAVNLEL
PAGFIPGEDQ GMIYAIIQTP PGSTIETTNK VARELEELAK GIEGVKSVSS LAGYEVLTEG
RGSNAGTCII NLKDWEERKN TVQDIIKELE EKSKDFGAII EFFEPPPVPG YGASSGLALR
LLDKSDVTDY QQFDKITQDF LMQLKKRKEL TGLFTFYAAN FPQYELVIDN KLAMQKHVSI
AKAMENLDIM IGSTYEQGFI RFNNFFKVYA QALPEYRRYP SDILNYFVKN EEGEMVPYSS
FMTMKKTQGP NEITRYNLYN SAVIRAAPAK GYTSGDAIEA VKEVAAATLP RGYDIAWEGL
SFDEAARGNE AVVVFIVVIV FVYLVLAGQY ESFILPLAVL CSLPAGIFGS YFLLKLFGLA
NDVYSQIALI MLIGLLGKNA VLIVEFAVQR HGQGLTVKDA AIEGAKARFR PILMTSFAFI
AGLIPLVKAT GAGAIGNRTI GTASMGGMVF GTVVGVVLIP GLYYLFGKMI EGKSLIRNED
KEPFTETYQY SRDDLDD