Gene Hoch_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3020 
Symbol 
ID8545408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4176617 
End bp4178248 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content69% 
IMG OID646387692 
Producthypothetical protein 
Protein accessionYP_003267420 
Protein GI262196211 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0806492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.54712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAT TGCGACAGGT ATCGGCCTTC CTGGGGCTGA GCGCCATCTT CCTGGGCGGC 
TGCACGCTGC CGGCGGACGA CGCTATTGCG CGCGATGAGG CGCCGCTCGC CGCCGACTGC
ACGGCGCTGG GCGCGAGCAT CATCAGCCAC GTCTGTTTCC ACGCCAACTA CGGGCCGTTC
CAGAGCGTCA CGGGCAGCTC GAGCACCAGC TTCAGCGGCA GCTCGCCCAA CGTCAACGCG
ACCCACACGC ACCACACTGT GTCGCTGCCG GGCTCCCTCG GCAGCAATCA GGGCACGGTG
AAGTTCACGC CCGGCTCGAG CGGTGACTGG GCCATCTTCC TCGAGCCCGA TGTATCGCTC
CAGGTGCTCG ACGCCAGCGG GTCGGCGCTC TCGGTGCTGC TCACGCACAC CATCCCCTCG
GGCGACTGCA GCGAGCTCTC GCGCGTGGCC GTGGTCAACA TGAGCGCGAG CAACACCTAT
CGCCTGGTGC TCGGGCCCTC GTCTTCGGTG TCGAGCGTGG GTGTCGTTCT CGAACGAGTG
GACGATTTCA ACGCCTTCTA CTTCGAAGAC GCCGATGGCG ATGGCTACGG TGACACCGAC
GAGCTGTTGC TGACCGCGTG CGAGCCACCG GCCGACTACG TCAGCGACGA CACTGACTGC
GACGACAGCG ACGACGAGGT CTACCCGGGC GCGGCCGAGG TCTGCGACGG TCTCGACAAC
GACTGCAACG GCTCGGTCGA CGACGGCATC GGCACCTGCA GCGCGGGCGC GCGCTCGGCG
CACGACGCAG CCGCGGCCGC GGTCGCGCAC TCGGACGCCA GCGGCGACCA GCAGCTATCC
TGCTTCTGCG ACTGCGAGGG CGATCCGGAT AGCTGCAGCG TGTATTACCG CGACCGCGAC
CCCGCCATCC ACCCCGAGGC CGAGGAGATC TGCGACGGCG TGGACAACGA CTGCGACGGC
ACGCTCGACA TTCTGCCGGA CCCGCTGGAC GAGGTCATCG AGCATAGCTG CGACCACGCC
GAGCTCGGCC CCTTCGTCCA GGTCAGCGCC AGCGCGGTCG GCGCCGCCAG CTCGCCCAAT
GTCAACGCAG CGCACACGGG CTACGTCATC AGCTTGCCGT CGGCGAGCGG CGGGTTCGCG
GGCCAGGTTC GCTACCGCCC GGTGGAGAGC ACCGATTACG CGCTGATGAT CGACCCCGGC
GTGTCCGTGA GCGTGTTCGA CGCCAGCGGT GTCGAGGTCG AGATCGAGCT GCAGCGCGAC
GCCACCGCCT GCCCTGCCCT CACCCGGCTG GATCTCGTCG AGCTCGAGGA GCTGGTGAAC
TACCGGCTGG TGTTCGAGTC CGCGAGCGCG GCCGAGGTCC TGCTGGTCGT CGAGGAAGCC
GCCCACGAAC ACGGGGACGA CGAGGATGAG GACAGCGGCG CGCTCGAGTT CTTCGCCGAT
GAGGACGGCG ACGGTTTCGG CAGCCCGGAC GAGGTCGTCG ACGCCTGCGT CGCACCGCCC
GGCCACGTCG CCGATGACGG CGACTGCGAC GACGCGGACG CAAGCGTCCA CCCGGGGGCG
AGCGAGCTGT GCGACGGCAT CGACAACGAC TGCGACGGCG TCATCGACGC CTTCTGCGAA
TCGTCGCGAT GA
 
Protein sequence
MKRLRQVSAF LGLSAIFLGG CTLPADDAIA RDEAPLAADC TALGASIISH VCFHANYGPF 
QSVTGSSSTS FSGSSPNVNA THTHHTVSLP GSLGSNQGTV KFTPGSSGDW AIFLEPDVSL
QVLDASGSAL SVLLTHTIPS GDCSELSRVA VVNMSASNTY RLVLGPSSSV SSVGVVLERV
DDFNAFYFED ADGDGYGDTD ELLLTACEPP ADYVSDDTDC DDSDDEVYPG AAEVCDGLDN
DCNGSVDDGI GTCSAGARSA HDAAAAAVAH SDASGDQQLS CFCDCEGDPD SCSVYYRDRD
PAIHPEAEEI CDGVDNDCDG TLDILPDPLD EVIEHSCDHA ELGPFVQVSA SAVGAASSPN
VNAAHTGYVI SLPSASGGFA GQVRYRPVES TDYALMIDPG VSVSVFDASG VEVEIELQRD
ATACPALTRL DLVELEELVN YRLVFESASA AEVLLVVEEA AHEHGDDEDE DSGALEFFAD
EDGDGFGSPD EVVDACVAPP GHVADDGDCD DADASVHPGA SELCDGIDND CDGVIDAFCE
SSR