Gene Acid345_1699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1699 
Symbol 
ID4070482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2060074 
End bp2063208 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content57% 
IMG OID637983707 
Productacriflavin resistance protein 
Protein accessionYP_590774 
Protein GI94968726 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.920783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0913661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGAT TCGCTATCAA GTATCCGTTT TTCATCATTA TGCTCTGCCT GGTCATTCTT 
GTGGTGGGAA CTACGATGAC GGCACGCATG CCGGTCGATC TCTTTCCTGA GATCAAGATC
CCCGTGGTGG TGGTTGCGAC GTTTTATTCC GGAATGCCTC CGGAGCAAAT TGAAACCGAC
ATCACCGGAC GCTTCGAGCG CTTCTTCACC TTGGGCAGCG GGATTGATCA CATGGAATCG
CGATCGTTGC CGGGTGTAAG TCTCATCAAG GTCTATTTCC AACCGGGCAC GGATCCAAAC
GCGGCCGTCA GCACGATTTC GAATCTTGCC ATGGCAGATT TGCGCAAACT TCCGCCGGGA
ACCCTGCCGC CGGTGATCTT GAAGTTCGAC GCCTCGAGCT TGCCGGTCTG CCTGATCACA
CTGAAGGGCG CGGGACTTAA CGAGACGCAA CTGCGCGATA TTGGCCAGTA CAACGTACGC
AACCAGGTGG CGAACGTGCC TGGCGCTTCG GTTCCGCAGC CTTTCGGTGG AAAGTACCGG
CAGATCCAGG TGTATGTGGA TCCGGTGAAA TTGCAGGCGG CGCAACTGAG CGTGATGGAC
GTGGTTCGGA CTGTCAACAA TTCGAACATG ATCCTGCCCG CAGGCGATGT TCGTATTGGG
CCGAAAGACT TCAACCTGTA CACCAACAGT CAACTTCCGG ACATCGAGGA GATCAATCGC
CTTCCGCTGA AAACTGTAGG CAACGCTTCG CTACTGGTGG GAGATGTTGG GCACGCGCAA
GACGCAGCGC AGATCCAAAC GAGCATGGTG CGCGTCGACG GACAAAAGTC CGTGTACTTG
CCGGTGCTGA AACAAGGCGG CGACAGCAAC ACGATCGCGA TTGTGGATGG CGTTGAGAAA
TCGCTGAAAG ACCTGGTAGA CGTACCGAAG AGTTTGACCG CGAAAGTGGT GTTCGACCAG
TCGGTGTACG TGAAGACGGC GATTCGGAAC CTGATCAACG AAGGTGGCAT TGGCCTGGTG
TTGACCGCGT TGATGATTCT CATTTTTCTC GGTAGCGTAC GCGGGACAGT TGCCGTCATG
TTGTCGATCC CGCTGTCAGC TCTTGCTGCG TTCCTCGCGA TTAACGCGAG TGGCGGAACC
ATCAACACGA TGGTGCTCGG CGGACTGGCT TTGGCGTTCT CGCGCTTGAT TGATAATTCG
GTCGTGGTGC TGGAGAACAT CTTCCGGCAC CTTGAGAACG GAGAGCCAGC GGAGATCGCA
GCGGAACGCG GTGGACGCGA AGTAGCGTTG CCGGTACTGG CCGCGACGTT TACAACAACG
ATTGTGTTCT TCCCGGTCGT GTTCCTTTAC GGCGTGAGCC GGTTCTTGTT CACAGCATTG
GCGGCGGCGG TGGTGTTCTC GCTCTTTGCG TCGTACATGG TGGCAATGAC GGTCGTACCG
TTGTTCTGCG CACAGTTCAT CAAGAACGCG CACAACGAAG CAGGGCACGT CGGAGCTCAC
GCGAACTGGT TCAGCCGCTT CGTAGGCAAG TTCAACCACG CCTACAACCG TATGCTCATG
CGTTACGACA TTGCGGTGGG CAAGACGTTG CTGCGGCCGA TTGCGACGAC GATCAGCATT
CTCGGTGTCT GTGCGTTCAG CCTGGCCATC TACCCTCTTC TTGGCGTCTC GTTCTTTCCG
CGCACAGATC CTGGGCAGTT CGTTTTGAAC CTGAAGGCGC CTACGGGAAC TAGGCTTGAA
CTGACGGACG CGTATGCGCA GCGCGTCGAA AAGGACATTC GCGAGATTGT CCCGAACGAT
GACCTCGGGA TGGTCGTTTC GAACATCGGC GTGACGCCGG GATTTTCATC GATGTACACG
AGCAACTCGG GCCAGCATAC GGCGACGATC CAGGTGAGTT TGAAAGAAGG CCACAAGGTC
GGAAGCTACG AATATATGCG CCGGGTGCGG CACAAATTGG CAGACGATTT GCCGGAGCTT
TCGACGTACT TACAATCGGG CGGACTGGTG GATGCCGTAA TCAATCTCGG CTTACCGGCG
CCGATTGATA TCCAGGTCAG CAGTAATCAT TTGCATGATG CGTACAACGT CGCACAGGAA
CTTCAGGGGA AGATTGCGAA GGAGCGTGGA GTGAGTGATG TGCTCATACC GCAGGACCTC
GACTATCCCG CATTGAAGCT CAATGTGAAC CGCGAGATGG CGTCGCGCCT CGGATTGAAT
TCGCGCGAGG TCGTGGATAA CGTGATCACC GCGCTCACTT CGAACCAGAT GATCGCCCCA
AGTTTCTGGG TAGATCCAAG ATCGGGCAAC GATTACATGC TCACCGTGCA GTATCCGGAT
TCGCAAGTGA AGTCGTTGAA CGATCTGAAA CAGATCCCGC TGCGTTCAGA CCATGGGGAA
GAGACCACGC AACTCGGTGC GGTAACGGAC GTGAAAGTGG TGGATTCGCC GACGGAGGTG
GACCACTACC AACTGCGACG GGTGATCGAC ATCTACGTGG CGCCGTCGGG ACAGGACCTC
GGCTCGCTTG CAACCCGAGT AAACAAGATC ATTGCGGAAA CGAAGGCGCC GGAGGGGGTG
CGGGTGACGA TGCGCGGCTC GGTCGAAGGC ATGAACCAGT CGTTCAAGAG TTTCGGCCTT
GGACTGATTC TTTCTATCGT GCTGGTGTAC CTCATCCTGA TGGCCCAGTT TGCATCGTTC
CTCGATCCGT TCATCATTCT GCTGGCGATC CCGCCGGGAA TAACGGGAGT GCTGCTGTTC
TTGTGGGCTA CACGCACTAC TTTGAATGTC ATGTCTTTGA TGGGCGTAAT CATGATGACT
GGCATCGTGG TGTCGAACAG CATTCTGATC GTCGACGTGG CGCGCGAACT GCGGAAGACG
GGAATGCCAA TTGCCGAAGC GGTCGCGACG GCTTGCCGAA TGCGGCTGCG TCCGGTGCTG
ATGACGTCTC TAGCGACGAT CCTCGGCATG ATCCCCATGG CATTGGCGCT GGAGGCAGGA
AGTGAGCAAT ATGCACCGCT GGCCCGCGCG ATCATTGGCG GATTGACGCT GTCGGTGATC
GTGACGGTGT TTCTCGTACC CGCAGCGTAC CTGTGGTTGC ATCGTCGCGA AGAAAGGCCG
GTGGTGCAGG CATGA
 
Protein sequence
MSRFAIKYPF FIIMLCLVIL VVGTTMTARM PVDLFPEIKI PVVVVATFYS GMPPEQIETD 
ITGRFERFFT LGSGIDHMES RSLPGVSLIK VYFQPGTDPN AAVSTISNLA MADLRKLPPG
TLPPVILKFD ASSLPVCLIT LKGAGLNETQ LRDIGQYNVR NQVANVPGAS VPQPFGGKYR
QIQVYVDPVK LQAAQLSVMD VVRTVNNSNM ILPAGDVRIG PKDFNLYTNS QLPDIEEINR
LPLKTVGNAS LLVGDVGHAQ DAAQIQTSMV RVDGQKSVYL PVLKQGGDSN TIAIVDGVEK
SLKDLVDVPK SLTAKVVFDQ SVYVKTAIRN LINEGGIGLV LTALMILIFL GSVRGTVAVM
LSIPLSALAA FLAINASGGT INTMVLGGLA LAFSRLIDNS VVVLENIFRH LENGEPAEIA
AERGGREVAL PVLAATFTTT IVFFPVVFLY GVSRFLFTAL AAAVVFSLFA SYMVAMTVVP
LFCAQFIKNA HNEAGHVGAH ANWFSRFVGK FNHAYNRMLM RYDIAVGKTL LRPIATTISI
LGVCAFSLAI YPLLGVSFFP RTDPGQFVLN LKAPTGTRLE LTDAYAQRVE KDIREIVPND
DLGMVVSNIG VTPGFSSMYT SNSGQHTATI QVSLKEGHKV GSYEYMRRVR HKLADDLPEL
STYLQSGGLV DAVINLGLPA PIDIQVSSNH LHDAYNVAQE LQGKIAKERG VSDVLIPQDL
DYPALKLNVN REMASRLGLN SREVVDNVIT ALTSNQMIAP SFWVDPRSGN DYMLTVQYPD
SQVKSLNDLK QIPLRSDHGE ETTQLGAVTD VKVVDSPTEV DHYQLRRVID IYVAPSGQDL
GSLATRVNKI IAETKAPEGV RVTMRGSVEG MNQSFKSFGL GLILSIVLVY LILMAQFASF
LDPFIILLAI PPGITGVLLF LWATRTTLNV MSLMGVIMMT GIVVSNSILI VDVARELRKT
GMPIAEAVAT ACRMRLRPVL MTSLATILGM IPMALALEAG SEQYAPLARA IIGGLTLSVI
VTVFLVPAAY LWLHRREERP VVQA