Gene Hoch_3317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3317 
Symbol 
ID8545705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4576802 
End bp4579237 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content71% 
IMG OID646387984 
ProductProtein of unknown function DUF1592 
Protein accessionYP_003267712 
Protein GI262196503 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0474144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGGAC ATGTGTCCCA AAAACAAACT GATTTCGCGA CCGCTGCGCC CGCGCGCGGC 
GGCCTGCGCT TCCCCTCTCT GACCATGACG CTGGCGCTGG GCCTGGGCCT GAGCGCTGCG
GCGCTGAGCG GCTGCGACGG CGCCGCGGAC GACGGCCCGG GCGCTGCGCC CGAGAGCCCG
TGCCCGCCCG ATCAGGAGTA TTTCGTCGAG AACATCTGGA CGCCGATCGT GTCGCTCTCG
TGCATCTCGT GCCACAGCGA CACCGGCGCG GCCAAGAGCT CGCGCCTGGT GCTGCGCGAG
CCCACCGAGC CCGACTTCCT CAGCGAGAAC TTCGCCATCA TGCGCGCGCT CGCGGCCGAG
GAGGAGGGCG GCACCTCGAT CCTGCTGTCG CGGCCCAGCG GCCGCCACCC GAGCGGCCAC
CCCGGCGGCG TGCTGTTCGA CATCGGCACC ATCGACTACG AGGCCATGGC CGCGTTCGTC
GGACGCGTGG TCGGTGACCC CGAGGCGTGC GAGAGCGCGC TCGATAGCTG CGAGAGCGGC
ACGCCCGGCC CGCGCATGCT GCGCCGGCTG AGCCGCTCGG AGTACGATCG CACCATCGTC
GATCTGCTCG GCATCGAGGG CACATACGGC AAGAGCTTCG CCGCCGATAC CGTGGTCAAC
GGTTTTGACA ACAACGCCGC GGCCTTGACC GTGACCCCGC TGCTCGCCGA CCAGGTGCGC
AAGGCGGCCG AGTCGCTGGC CGCCGAGGCC ATGGCCAACC CGGGCGCGAT CGTGCCCTGC
GCGGCCAGCG AGGGCCGCGC GTGCGCGCGC AGCTTCCTCG AGAGTTTCGG CGAGCGCGCG
TTCCGCCGGC CGCTGAGCGA GGACGAGATC ACAGGCTATC TCGGCATCTA CGATCTCGGC
GACGAGGACG GCGACGGCGC GGTGTCCGAG GGCGAGTTCC CGGGCGCCAT GGAGGTGGTG
CTGTCGGCGC TCTTGCAGTC GCCGAGCTTC CTCTACCGCC CCGAGCTGGG CACGAGCGTG
GGCGACGGCG CCTACGCGCT GTCGTCATAC GAGATCGCGT CTGAGCTGTC GTACTTCCTG
TGGGGCTCGA TGCCCGATGA GGAGCTGCTG GCGGCCGCGC GCGAGGACGC GCTGCGCGAT
CCGGCCGAGA TCGAGAGCCA GGCCCGGCGC ATGCTGGCCT CGCCCAAGGC GCGCTTTGCC
ATCGATCGCT TCACCGAGCA GTGGCTGGGC ATCGATCAGC TCGCCACGGT GCCCAAGGAC
ACCATGCTGT TCCCCGAGCT CACGCCCGAG CTGCGCCAGT CGATGCTGGT CGAGGCGCAG
AGCCTGGTGG CCGATATCAT CGCCGAGGGC GGCTCGCTGG GCGAGCTCTT GCGCGCCTCG
CACACCTTCC TCGACCAGCG CCTGGCCGAC TTCTACGGTC TGCCGGCGCC CGCCGAGGCC
GGCGCGGGCG GTTTCGGACG CGTGGACCTG GGCGGCAGCG AGCGCGGTGG TCTGCTCACC
CTGGGCGCGA TCCTCACCAC CCACGCGCGC TCCAACGGCA CCTCGCCCAT CCACCGCGGC
AAGCTGGTGC GCGAGCGCCT GCTGTGCCAG CACCTGCCGC CGCCGCCGCC GGGCGTCAAC
GCCGAGCCGC CGGCCCTCGA CCCCGGCCTG ACCACGCGCG AGCGCTACCG CCAGCACTCG
GTGGACGAGG CCTGCGCCGG CTGTCACGAG CTCATGGACC CCATCGGCTT TGGCTTCGAG
CACTTCGACG GCATCGGCCG CTTCCGGGCC GACGAGGGCG GGCTGGCCAT CGACGCCAGC
GGCTACGTGA GCGGCGTGGG CGAGAAGAAT CTGGAATTTG ACGGCGTGGA CGATCTCGCC
GCGCAGCTCG CCGGCAGCCC CGAGGCGCAC GCCTGCTTTG CCCTGCAGTG GACGCGCTTT
GCCTACGGCG TGCGCGAGAA CAGCCAGCTC TCGTGCCTGG TCGACGACGT CGCCGCGCAG
CTCACGCCCG ACACCCGGCT CGACGACTTC ATCGTGTCGC TGGCGCTCAG CTCGCACTTC
ACCGCCCGCG TCGGCAAGGA CGCGGCGCCC GGCGAGGAGC CCGGCGATGG TGGCGAGGAC
CCGGGCGACG GCGGCGAGGA CCCGGGCGAC GGCGGCGAGG ACCCGGGCGA CGGCGGCGAG
GACCCGGGCG ACGGCGGCAG CTCTGACGAT CTTGGCGTCG CCGTGGTCAC CGACTCGATG
TGGGCGACCG GCGCGTGCTA CTCGGTCACC GTGACCAACG AGAGCGACGC CGAGCTCGAC
TGGCAGATCA CGCTGAGCGT GGCCGGCGAG ATCAACAACC ACTGGAACGC CACGCTCACG
CAGACCGGCA ATCAGGCGCA GTTTGGCGGC GTGGACTTCA ACGACCGCAT CGCGCCCGGC
GCCACGGCCT CGTTCGGCTT CTGCGAGGCC TTCTGA
 
Protein sequence
MTGHVSQKQT DFATAAPARG GLRFPSLTMT LALGLGLSAA ALSGCDGAAD DGPGAAPESP 
CPPDQEYFVE NIWTPIVSLS CISCHSDTGA AKSSRLVLRE PTEPDFLSEN FAIMRALAAE
EEGGTSILLS RPSGRHPSGH PGGVLFDIGT IDYEAMAAFV GRVVGDPEAC ESALDSCESG
TPGPRMLRRL SRSEYDRTIV DLLGIEGTYG KSFAADTVVN GFDNNAAALT VTPLLADQVR
KAAESLAAEA MANPGAIVPC AASEGRACAR SFLESFGERA FRRPLSEDEI TGYLGIYDLG
DEDGDGAVSE GEFPGAMEVV LSALLQSPSF LYRPELGTSV GDGAYALSSY EIASELSYFL
WGSMPDEELL AAAREDALRD PAEIESQARR MLASPKARFA IDRFTEQWLG IDQLATVPKD
TMLFPELTPE LRQSMLVEAQ SLVADIIAEG GSLGELLRAS HTFLDQRLAD FYGLPAPAEA
GAGGFGRVDL GGSERGGLLT LGAILTTHAR SNGTSPIHRG KLVRERLLCQ HLPPPPPGVN
AEPPALDPGL TTRERYRQHS VDEACAGCHE LMDPIGFGFE HFDGIGRFRA DEGGLAIDAS
GYVSGVGEKN LEFDGVDDLA AQLAGSPEAH ACFALQWTRF AYGVRENSQL SCLVDDVAAQ
LTPDTRLDDF IVSLALSSHF TARVGKDAAP GEEPGDGGED PGDGGEDPGD GGEDPGDGGE
DPGDGGSSDD LGVAVVTDSM WATGACYSVT VTNESDAELD WQITLSVAGE INNHWNATLT
QTGNQAQFGG VDFNDRIAPG ATASFGFCEA F