Gene Acid345_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3094 
Symbol 
ID4072658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3676184 
End bp3677782 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content60% 
IMG OID637985113 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_592169 
Protein GI94970121 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGCAA CAACTGCGCC TCCGGGCGTT TTGGCAACAG AGGCAGGCGG CGTTCGCGAA 
ATCAATCCGT GGTGGGTAAC CGCCGCGGTG ATGATGGCCG TTTTCATGGA GGTGCTCGAC
ACCACCATCG TCAACGTGTC GCTGCCGCAC ATTGCGGGAA ATCTCTCCGC CAGCGTTGAC
GAGTCAACGT GGGTCCTCAC CTCGTATCTC GTCGCCAACG CCATCATCCT TCCCATGACG
GGATGGCTTG CGAACCACTT CGGCCGCAAG CGCATCCTCA TGTCTTCCAT CATCGGCTTC
ACGCTCGCTT CCGTCGCGTG CGGCATGGCG CCAAACCTGT CATCGCTGAT CTTCTTCCGC
GTCGTGCAAG GCGCCACCGG TGGAGGTCTC CAACCGCTCT CCCAAGCCAT CATGCTCGAG
GCCTTCCCTG GCAAGAAGCG CGGGAAGGCG ATGGCCATCT GGGCCCTCGG CATCGTCGTC
GCTCCCATGC TTGGCCCCAT GCTCGGCGGC TGGATCACCG ACAGTTATAG CTGGCGCTGG
ATCTTCTACA TCAACTTGCC CGTCGGCTTG CTCGCCACCA TGATGTCGCA GTGGTTCATC
TTCGACCCGC CCTACATCAA ACGCTCGTCT GACGTCGTGG ATTACTGGGG AATCGGCTTC
TTAATCGTCG GCATTGGTTC ACTTCAGGTC ATGCTCGACA AGGGCCAGGA AGACGACTGG
TTCGGCTCGC ACTTCATCAC CACCCTCGCC GTGCTCACGG TTGTCGGCTT GATCTTGTTC
ATCATGCGCG AACTCATGGC GGAGCACCCC ATCGTAGACC TGCGTGTCTT TAGAGTGCGA
ACCTATGCCA CCGGTGTCTT TCTCATGACG ATTCTCGGCT TCGTGCTCTA CGGCAGCACG
GTGCTGATCC CCATCTGGCT ACAAACACTC ATGGGCTATA GCTCGCTCGA AGCCGGATTC
GCCGTCCTCC CAAGAGGTCT CGGGTCGTTC CTCTTCATGC CTCTAGTCGG CGTGCTCATG
GGATTTGTCG AGCCACGCAA GCTCTTAGCC ACTGGCCTGA TCACCGCCGG CGGCAGCCTC
TACTTCTTAT CGTTGTTGAA TACCCAGGCC GGATATTGGG ACTTCTTCTG GCCGCAACTC
ATCCAGGGCG CCGCCATGGG CCTCCTCTTC GTCCCGCTCA CGACGATCAC CAACGATCCC
ATCGCCCCCG AAAACATGGG CAACGCCACC AGCATCTTCA ACCTCATGCG CAACATCGGC
GGCAGCATCG GGATCGCCAT GACGACCACG ATCGTCGCGC GCAGCCAGCA GCTCCACTAC
AACAACCTCG TCCACAACAT GAGCTCGTAC AACCCCAAGG TCCAGCAAAT GATGGCCGGC
GCACGTGGCA TGTTCATGTC GAAAGGCATG GACGCCCATT CCGCCGGCCT GCAGGCCTAC
CACTCCCTGT GGGGCATGGT CATGCAGCAG GCCATGATGC TCTCGTTCAT CCGCGCCTTC
CAAATCCTCT CCGGGCTCTT TTTCATTTGC CTCCCGCTAA TCCTCCTAAT GCGCAAACCC
AAGCACAACG AAAAAGGCGG CGGAGGCATG GCCCACTAA
 
Protein sequence
MSATTAPPGV LATEAGGVRE INPWWVTAAV MMAVFMEVLD TTIVNVSLPH IAGNLSASVD 
ESTWVLTSYL VANAIILPMT GWLANHFGRK RILMSSIIGF TLASVACGMA PNLSSLIFFR
VVQGATGGGL QPLSQAIMLE AFPGKKRGKA MAIWALGIVV APMLGPMLGG WITDSYSWRW
IFYINLPVGL LATMMSQWFI FDPPYIKRSS DVVDYWGIGF LIVGIGSLQV MLDKGQEDDW
FGSHFITTLA VLTVVGLILF IMRELMAEHP IVDLRVFRVR TYATGVFLMT ILGFVLYGST
VLIPIWLQTL MGYSSLEAGF AVLPRGLGSF LFMPLVGVLM GFVEPRKLLA TGLITAGGSL
YFLSLLNTQA GYWDFFWPQL IQGAAMGLLF VPLTTITNDP IAPENMGNAT SIFNLMRNIG
GSIGIAMTTT IVARSQQLHY NNLVHNMSSY NPKVQQMMAG ARGMFMSKGM DAHSAGLQAY
HSLWGMVMQQ AMMLSFIRAF QILSGLFFIC LPLILLMRKP KHNEKGGGGM AH