Gene Hoch_5623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5623 
Symbol 
ID8548037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7718701 
End bp7721571 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content69% 
IMG OID646390294 
ProductWD40 domain protein beta Propeller 
Protein accessionYP_003269996 
Protein GI262198787 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.101899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCTGT GCTTGGCCGC GCTGCAGGCC GCGTCCGCGT GGGCCGGTGA CCCGAAGCTG 
CGCTGGCGCA CCATCGAGAC TGAGCACTTT GTCATCCACT ATCACGAGCC GCTCGGCGAG
GTCGCCCAGC ACGTGGCCGC GGCTGCCGAG CGCAGCCACG AAGTCCTGAG CCCGACCTTC
GAGCACGCGC CCGACGACAA GACCCAGATC GTCATCACCG ACGACACCGA CAGCGCCAAC
GGCTTCGCCA GCGTGATCCC GCGCAATCGC ATTCGACTAT TCGCGACCGC GCCGACCAGC
CTGTCCTCGC TCAACGATCA CGACGACTGG CTGTATCTGC TCGTCGCCCA CGAGTACACG
CACGTGCTGC ATCTCGACTC CATCGGCGGG ATCGCGCGCT GGGTCAACCG GGTCTTCGGC
AAAGTGTGGG CGCCCAATCA GGTGCAGCCG CGCTGGGTGA TCGAGGGAAT CGCCACGTAT
CAGGAGTCGG AGCAGAGCGC GGGCGGACGC ACGCGCAACG CGGTCTTCGA CATGGATCTG
CGCGCCGCGG TGCTGGCCGA GGAGGAGCAC GATCTCGACG CGGTGACCCA TCTGCCGCGC
GAGTGGCCGC ACGGCAACGC CGCCTACCTG TACGGCTCGC ATTTCCTCAA GTACGTATTC
GACCGCCACG GCAGCGATGC CCTGCGCCAT CTGAGCTGGG CCTACGGCTC GCAGCCGATT
CCCTACGGCC TCAATCGCGC GATCCGCGAA GCCACGGGCC ACACCTTCGA AGACCTGTAC
GAGAGCTGGC GCCGCCACCT GCGCGACAAG TACAGCAGCC AGCTCGAGGC GATCGAGCGC
CGCGGTCGCC GCGAAGGGCA CCGGCTGAGC TTCACCGGCG AGACCAACCG CAACCCGCGC
TTCTCGCACG ACGGCCGCTA TCTCTACTGG CATCAGAGCG ACGGTCTGCG CCCGGGTCAC
ATCCGCCGGG TGGCGGTGGG CGAGCACGTC GGCGAGGCCG AGGACGTGAC TGACGCCGAC
CGCCTGGGCG ACTTCGTCGT GCTCGACGAT GGTTCCCTGG TGTTCGAGCA GACGACCTCG
TACCAGGGCA ACTACAGCTT TCAGGACATC TATCGCTGGG ATGTGCGCGA GACCGCGCCG
CAGCCGATCA CCCACGGCCT GCGCGCGCGC CAACCCGCGG TGTCACCGGA CGAGCGGACG
GTCGCCTTCG TGCTGAGCGG GGAGTCACGC AGCCGCCTGG CGCTGATGCC GCTCGAGCCC
TACGCCGAGC ATCGCGTGGT GTGGAGCGGC GAGCACCGCT TCGACCAGGT AGCGACGCCG
GCGTGGTCGC CCGACGGCCA GCGCATCGCG TTCTCCGCCT GGGAGAGCGG CGGCTATCGC
GATATCTGGA TCTTTGATGT CGCCAGCGAG CGGGCTACCC GCCTCACCCG CGATCGCGCC
CTCGACGTGA GCCCGGTGTT CAGCCCCGAC GGCGCGTACC TGTTCTACGC CAGCGACCGC
AGCGGCATCT TCAACATCTA CGCGCACGAG TGGGCCACTG GGGCGCTGTA CCAGGTGACC
AACGTGATCG GCGGCGCGAT GGCGGCCGAG ATCTCGCCCG ACGGCACGCG CCTGGTGTAT
CAGGGCTTCG GCGTCGGCGG CTACGATCTC TACGAATTGC CCCTCGAGCG CTCGCGTTGG
CTCGAGCCGC TGCCGTATGT GGACATCCGT CCGAATTCTG TCGAGATCCG CGACGACGAG
GTGGCGGTGA CGCCGCCGCG GCCGTACCGC GCGCTCGAGA CGCTGGCGCC CGAGAGCTAC
GGCCTGGAGC TTATCGTCGG CAGCTTCGGC AGCGCGCTCA GCGTCACCAC CAGCGGCGGC
GACGCGGCCG GCCTGCACGC CTACAATCTG GCGGCCACCA TCGGCCTCGC CCGCCCCGGC
GTGGATCTCG GGGTCTCGTA CGGCTACGCG GGGCTGTGGC CGTCGCTACG ATTGTCGGCC
GCGCGTACGA CTTCGCGCCG CAGCGGGTTC ATTGTCGACG GGGTCAACAC CGTCTACGAG
CGCGATGCCT ACGGCCTGAC GGCGAGCGTG GGACTGCCGG TGCTCGCCAC CGCGAGCGGC
GGCGGCACGC TGTCTCTGGA CTACGACCTC GACTACTACC GCGTGGTCGA CGCGCCGGAC
GAAATGCCCG ATCCCAACGC GCTGTTGCCG CGGCCGGTCG GCGATACCTT GCTGGCCGGT
CTGGCCCTGC GCTTCAGCTA TTCGGACGCG CGCGGCACCG TCTACTCGCT CGGCCCGGTC
GAGGGCACAT CGTTCTCGAC CTCGCTGCGG CTCGATCACC CGGGCTTGGG CTCGGACAGC
CACGCCCTGT CGCTGGGCTA TCGCTGGGCC ACCTACCGCG CGCTCGACTG GTCGCCGACC
TCGTCGCTGT CGCTGCAGAT CGCGGGCGGA CTGCGCATCG ACGCCGACGG CACGCCCTCG
GGGTTCTCGC TCGGCGGCGT GCCCGAGCAG GACGTGGTCG GCTCGCTGAT CGATAGCGCT
CGTTTCAGCT CGAGCGGCTA CTTGCGCGGC TACGCGCGCG GCTCGGTTTT CGGACGCCAG
TACCATCTGG CCAACCTCGA GTATCGACAG GAGCTGTTCG ACTTCGAGAG CGGTCTGGCG
ACGCTGCCCT TCTTCGTGCG CCGCATGCAC GTGGCCGGGC TGCTCGACGT CGGCAACGCC
TTCTACGGCG ACTTCGACCC GCGCGATTTT CGCGTCGGCG TGGGCGGCAG CGTGCGCCTC
GACGTGGTAG TCGGCTATTA TCTCCCGGGC TCGCTGGACC TCGGCTACGC GCGCGGGCTC
ACGGGCGAAG GCATCGACGA ATTCTGGATG CTCCTCACCG GCACGATCTG A
 
Protein sequence
MALCLAALQA ASAWAGDPKL RWRTIETEHF VIHYHEPLGE VAQHVAAAAE RSHEVLSPTF 
EHAPDDKTQI VITDDTDSAN GFASVIPRNR IRLFATAPTS LSSLNDHDDW LYLLVAHEYT
HVLHLDSIGG IARWVNRVFG KVWAPNQVQP RWVIEGIATY QESEQSAGGR TRNAVFDMDL
RAAVLAEEEH DLDAVTHLPR EWPHGNAAYL YGSHFLKYVF DRHGSDALRH LSWAYGSQPI
PYGLNRAIRE ATGHTFEDLY ESWRRHLRDK YSSQLEAIER RGRREGHRLS FTGETNRNPR
FSHDGRYLYW HQSDGLRPGH IRRVAVGEHV GEAEDVTDAD RLGDFVVLDD GSLVFEQTTS
YQGNYSFQDI YRWDVRETAP QPITHGLRAR QPAVSPDERT VAFVLSGESR SRLALMPLEP
YAEHRVVWSG EHRFDQVATP AWSPDGQRIA FSAWESGGYR DIWIFDVASE RATRLTRDRA
LDVSPVFSPD GAYLFYASDR SGIFNIYAHE WATGALYQVT NVIGGAMAAE ISPDGTRLVY
QGFGVGGYDL YELPLERSRW LEPLPYVDIR PNSVEIRDDE VAVTPPRPYR ALETLAPESY
GLELIVGSFG SALSVTTSGG DAAGLHAYNL AATIGLARPG VDLGVSYGYA GLWPSLRLSA
ARTTSRRSGF IVDGVNTVYE RDAYGLTASV GLPVLATASG GGTLSLDYDL DYYRVVDAPD
EMPDPNALLP RPVGDTLLAG LALRFSYSDA RGTVYSLGPV EGTSFSTSLR LDHPGLGSDS
HALSLGYRWA TYRALDWSPT SSLSLQIAGG LRIDADGTPS GFSLGGVPEQ DVVGSLIDSA
RFSSSGYLRG YARGSVFGRQ YHLANLEYRQ ELFDFESGLA TLPFFVRRMH VAGLLDVGNA
FYGDFDPRDF RVGVGGSVRL DVVVGYYLPG SLDLGYARGL TGEGIDEFWM LLTGTI