Gene Acid345_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4021 
Symbol 
ID4071158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4754579 
End bp4755616 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content60% 
IMG OID637986049 
Productsecretion protein HlyD 
Protein accessionYP_593095 
Protein GI94971047 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR02971] ABC exporter membrane fusion protein, DevB family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.127081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA GAATATTTAC GATCGGCGCT GCAATCCTCG TTACCGCGTC CACCGCTACC 
GTCATTTGGG GCCACTCGGC GCGCCAAGCC CGCGAGAGCG TGGTTCAGAA GAACACCGCG
GATTGGATCG TTGCCGGCGC CGGCCGCGTC GAGCCCGAGT CAGAAGACAT TAAGCTCGGA
GCCGAGTTGA GCGGAAAGCT CAAATCGGTC TACGTGGAAG AAGGCGATCG CGTGAAGCAA
GGCCAACTCC TTGCTGAATT GGAGAACGCG GACTATCACG CGCAGGTTGC GTCCGCGCAG
GCCGAGGTGC ACGAGACGGA AGCGCAGCTT CGCAAAGTGG TGAACGGCGC TCGGCGCGAA
GAGCGGCGTG AGGCGCTCTC CACCGTCGAG CAGGCGCGCG CGGTGATGAA CAACTCCGAA
AGCGAAATGC TGCGTCGCCA GAAGCTCTAT GAAGCGGGAG TCATCTCGCG CGAAGAAGCC
GAGCAGTATG CGAAGGAATA CGACGTCGCC AAAGCGAAAT TCCAGGAAGT CTCCGAGCAC
CACAAGCTGG TAGACGAAAC CGCTCGCGAA GAAGATCGCG ACATCGCGAC GGCCAACTTG
CTTGCGGCGC GAGCACGGCT GGAGCAGGCG CAAGCGATGT TGGCGAAGAC GTTCGTTCGC
TCTCCCATCG AAGGCACGGT GCTGCGTAAA CATCATCGTC TCGGTGAAAG CGTTTCTAAC
GGTTCGACGG TCCCGGATCC CATCGTGACG ATTGGAGCAA CAGAACGGCT GCGCGTCCGC
GTTGATGTGG ACGAGGCCGA TGTCAGCAAG CTTACGGTCG GGCAGAAAGC GTATGTGACC
GCCGATGCGT ATGGCGATAA ACGCTTCCGG GGGCACATCG TGCGCATTGG CCAGGAGCTC
GGACGCAAGA ACGTTCGCAC CGATGAGCCC ACCGAGCGGG TAGACATGAA GATCCTCGAG
ACCATGATCG AACTCGATGG CGGAGACGAA TTGCCAATCG GACTGCGCGT GAATACGTTT
ATCGTTCCGA AGAGCTAG
 
Protein sequence
MKKRIFTIGA AILVTASTAT VIWGHSARQA RESVVQKNTA DWIVAGAGRV EPESEDIKLG 
AELSGKLKSV YVEEGDRVKQ GQLLAELENA DYHAQVASAQ AEVHETEAQL RKVVNGARRE
ERREALSTVE QARAVMNNSE SEMLRRQKLY EAGVISREEA EQYAKEYDVA KAKFQEVSEH
HKLVDETARE EDRDIATANL LAARARLEQA QAMLAKTFVR SPIEGTVLRK HHRLGESVSN
GSTVPDPIVT IGATERLRVR VDVDEADVSK LTVGQKAYVT ADAYGDKRFR GHIVRIGQEL
GRKNVRTDEP TERVDMKILE TMIELDGGDE LPIGLRVNTF IVPKS