Gene Hoch_4864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4864 
Symbol 
ID8547271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6715506 
End bp6718610 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content68% 
IMG OID646389537 
Productacriflavin resistance protein 
Protein accessionYP_003269246 
Protein GI262198037 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.120441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG GCGGGCAGCG CGGGCCCATC GCCTGGATGG CGAACAACGC CATCGCCGCC 
AACCTGCTGA TGGTACTGCT GCTCGGCGGC GGCATCTGGA CGGCGCGCTC GATGCAAAAA
GAGGTCTCGC CGCAGTTTCA GCTCGACGTC GTCGAGGTCA ACGTGGCCTA TCCGGGCGCG
GCGCCGGCCG AGGTCGAGAC CGGCATCCTC TTGCCCATCG AGGAGGCCGT GCGCGGCGTC
CAGGGCATCA AGGAGATCGT GTCCACGGCC CGGGAGGGCC GGGGCGAGGT CACCATCGAG
CTGGTCGCCG GCACCGAGCG CATGACGGCG TTCCAGGATA TCGACCAGGC GGTCGCCCGC
ATCCGCACCT TCCCCGACGA CATCGAGGAG CCCGAGGTAC GGCTGCAGTC GGACCAGCGC
GGGGTGCTCG ATATCAATCT GTTCGGCGAT GTCGATGTGT GGACGCTGCG TAAGCTGGCC
GAGCGCCTGC GCGACCGGCT GCTCAGCGAT CCCACCATCA CCCAGATCGA GCTGGGCAAC
GCGCCCGAGT ACGTCACCCA CGTCGAGATC CCGATGGCGC GGCTGCGCGA GCACGGGCTC
ACGCTCGGAC AGGTGGCCGA TATCATCCAG GAGTCGAGCG AGGACGTGCC CGCGGGCTCG
GTCGAGACCC ACGCCGGCGA GATCCTGCTG CGCATGAAGG AGCGCAAGCA GTGGGCCGAG
GAGTTCGGCG ACATCGTGGT GCTAAACTCC AGCTCGGGCG CCACGCTGCG GCTCGCCGAC
ATCGCCGAGA TCAGCGATGG CTTCGAGGAG ATCGGCTTCT ACGGCCAGTT CAACCGCCAG
CCCGCGATCG GCCTCGAGAT CTACCGCATC GGCGCGCAGT CGCCGCTCGA GATCGACGCC
GCCGTCAACG CGATCATCGC CGAGTTCGAG CCCACGCTGC CGCCCGGGGT GTCGATCCGC
ATCGACGGCA ACAGCGCCGA GGACTACCGC GACCGGCTGT GGCTGCTGAT CGAAAACGGC
ATCATGGCCG TGATCATCGT GCTGGTCATC CTGTCGCTGT TCCTCGAGCT GCGGCTGGCC
TTCTGGGTGA TGATGGGCAT GTCGATCTCG TTCGTCGGCG GCCTGCTGCT GCTGCCGCTG
GTCGACGTGA GCATCAACAT GATCTCGATG TTCGGCTTCC TGGTGGTTCT CGGCGTGGTG
GTCGACGACG CCATCGTGGT CGGCGAGAAC GTCTACGAGC ACCGCGAGCG CGGCGAGGGA
TCGCGCATGG ACGCGGCCGT GCGCGGCAGC CGCGAGGTCG CGCGCCCGGT CGTCTACAGC
ATCCTCACGA CGATCATGGC CTTTGTGCCG CTGCTGTTCA TTCCCGGCAC CACGGGCAAG
TACTGGTGGC CGCTGCCGGC CGTGGTCATC GCCGTGCTGC TGGTCTCGCT CGCCGAGGCG
CTGTTCATCC TGCCGGCGCA TCTCGGACAC ACGTCCCGCT TCGGCGTGAG CGCGCCCGAG
CGCTGGCTGG GCGAGCGGCA GCGCAAATTC GCCGGCCTGG TTCAGCGCCT CATCGAGCGC
TACTACGGCC CGCTGCTGGC CGCCTGCCTG CGCTACCGCT ACGTCACCCT GAGCGCGGCG
GTGGCGCTCC TGGCCGTGAT CGGCAGCTAC GGCTACAGCG GTCACATGGG CATGATCATG
ATGCCCGAGG TCGCCGCCGA CGAGATCGAG GCCGGCGTGC GGCTGCCGGT CGGCACCACG
CCCGCGCAGG CGGCCGCGGT GGCGCGTCAG ATCACCGACT CGACCTTCGA GATGTTCGAC
AAGCACGACC TCTTCGAGGT CGCCGATGGC GTCAAGACCA ACGTCCGCGG CGAGAGCTTC
ATCGACGTCG AGATCGTCAT GAAACCGCCG GACGAGCGCG ACATGAGCGC CAACCAGGTG
ATCGCGCTGT GGCGCGACGA GATCGGCGAC ATCGAGGGCG TCGACCAGAT CAGCTTCGAG
GCCGAACGCG GCCCGGGCGG CTACGCGCAG GACATCAGCG TGGACCTCAG CCACGACGAC
ATCGAGGTGC TCGAGAAGGC CAGCCGCGCG TTCATCGAGC GCCTCGAGAG CTTCGAGGCC
ACGCGCGACG TGAGCGACAA CTACCAGAAG GGCAAGACCC AGTTCGACCT CGAGCTCTTG
CCCGAGGGCC GCAACCTCGG CCTCAGCTCG AACTATGTCG GACAGCAGGT CCGCGACGCC
TTCTTCGGGG CGCTGGCTCT GCGCCAGCTC CGCGGCACCA ACGAGATCGA GGTGCGCGTC
AAGCTGCCCA AAGCCGAGCG CGAGGACATC CGCTTCTTCG ACGACTTCGT GGTGCGCACG
CCGGCCGGCG TCGAGGTGCC GCTGCGCGAG GTGGTGCGGG TCAACCGCAG CGAGGCGTTC
AATAGCATCG CGCGCCGCGA CGGCCGCCGC GTGGTCTCGG TGAGCACCGA CGTGGAACCC
AAGAGCGCGG TCACGCGGGT CATCGACTCG CTACAGCGCG AGGAGTTGCC GGCCCTGCGC
GCCGACTACC CGGGCCTCAC CTGGAGCTTC GAGGGCAGCC AGGCCGAGAT GCGCGAGTCC
ACGCAGGCGC TGTGGGGCGG CTTCGCGTTC GCCCTGGGCA TCATCTACTC GCTGCTGGCG
ATCGCGTTTC GCAGCTATCT GCAGCCGCTC ATCGTGCTCA GCGCCGTGCC CTTTGGCGTC
ATCGGCGCAG TGATCGGCCA CATCCTCTTT GGCTACGATC TGTCGCTGGT GAGCCTCATG
GGCATCATCG CCCTGGCCGG CGTGGTGGTC AACGACGCCC TGATCATGCT CGACTTCGCC
AACCGCAACC GCGGCCGGGA TTCGGCCTTC GACGCCATCC ACAGGGCCGG TCTGCGCCGC
TTCCGGCCCA TCATGCTGAC CACGCTCACC ACCTTTGGCG GCCTGGTGCC GATCATCTTC
GAGACCTCCA ATCAGGCCAA TCACCTCATC CCCATGGCCA TCTCGCTCGG CTTTGGCATC
CTGTTCGCCA CCGGGCTCAT CTTGCTGCTG GTACCGTGCC TGTACCTCAT CCTCGAAGAC
CTCGCCGGCG CGTTCGGCGC CAAGTCACCG AGCGCTCAGC CCTGA
 
Protein sequence
MSEGGQRGPI AWMANNAIAA NLLMVLLLGG GIWTARSMQK EVSPQFQLDV VEVNVAYPGA 
APAEVETGIL LPIEEAVRGV QGIKEIVSTA REGRGEVTIE LVAGTERMTA FQDIDQAVAR
IRTFPDDIEE PEVRLQSDQR GVLDINLFGD VDVWTLRKLA ERLRDRLLSD PTITQIELGN
APEYVTHVEI PMARLREHGL TLGQVADIIQ ESSEDVPAGS VETHAGEILL RMKERKQWAE
EFGDIVVLNS SSGATLRLAD IAEISDGFEE IGFYGQFNRQ PAIGLEIYRI GAQSPLEIDA
AVNAIIAEFE PTLPPGVSIR IDGNSAEDYR DRLWLLIENG IMAVIIVLVI LSLFLELRLA
FWVMMGMSIS FVGGLLLLPL VDVSINMISM FGFLVVLGVV VDDAIVVGEN VYEHRERGEG
SRMDAAVRGS REVARPVVYS ILTTIMAFVP LLFIPGTTGK YWWPLPAVVI AVLLVSLAEA
LFILPAHLGH TSRFGVSAPE RWLGERQRKF AGLVQRLIER YYGPLLAACL RYRYVTLSAA
VALLAVIGSY GYSGHMGMIM MPEVAADEIE AGVRLPVGTT PAQAAAVARQ ITDSTFEMFD
KHDLFEVADG VKTNVRGESF IDVEIVMKPP DERDMSANQV IALWRDEIGD IEGVDQISFE
AERGPGGYAQ DISVDLSHDD IEVLEKASRA FIERLESFEA TRDVSDNYQK GKTQFDLELL
PEGRNLGLSS NYVGQQVRDA FFGALALRQL RGTNEIEVRV KLPKAEREDI RFFDDFVVRT
PAGVEVPLRE VVRVNRSEAF NSIARRDGRR VVSVSTDVEP KSAVTRVIDS LQREELPALR
ADYPGLTWSF EGSQAEMRES TQALWGGFAF ALGIIYSLLA IAFRSYLQPL IVLSAVPFGV
IGAVIGHILF GYDLSLVSLM GIIALAGVVV NDALIMLDFA NRNRGRDSAF DAIHRAGLRR
FRPIMLTTLT TFGGLVPIIF ETSNQANHLI PMAISLGFGI LFATGLILLL VPCLYLILED
LAGAFGAKSP SAQP