Gene Hoch_5025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5025 
Symbol 
ID8547435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6933643 
End bp6935334 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content67% 
IMG OID646389701 
ProductPatatin 
Protein accessionYP_003269407 
Protein GI262198198 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.71929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.155032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAC GGAAAATCGC GCTGGTTCTG GCTGGCGGAG TGTCGCTCGG CAGCTACGAG 
GCCGGCGTCC TCGCCGAGCT GCTCTACGCT CTCGACTGGC TCAATCGCCC CGAAACCCTC
GACGGACGCG ACCCCTTCGA GCTCGACATC ATGACCGGCG CCTCGGCCGG TGGCATGACG
GCCGCCCTGG TGGCCCGCAT CATGATGTAC GACCTACCCG GGCGGCGCGA CCACCTGCGC
AGAGTATGGG TCGACGGCGT GGACATCGTC GGCTTACTCG ATCAACAAGA CGTGCCGCTC
AACGCGCTGC TGTCCAAACG CGTGCTCGCG CAGCTCGCGC ACGACTACGT GGTCAAGGGC
AGCGACGCGC CGCCCATCGC GCCGGCCTCC TTCGCGCCCG AGCGCCTGCA CCTGAGCCTA
ACGCTCAGCA ACATGCACGG CATCAGCTAC GAGATTCCGT ATTTCGCCTC TACTGACAGC
GAGAGCCAGT TCGTGACCAC GATGTTTTCG GACATGGCGA ACTTCACCCT CGAGCGCGCA
GACCTCCCGG GGCGGCGCGT CTGGGACACC ATCGTCCAGT CCGCGCTGGC CTGCGGCAGC
TTCCCCTTCG CGTTTCAGCC GCACCCGCTG CGGCGCAGCA GCAGCGACTA CCGCGGCTCG
ATGCAGGAGC ACGACACGGC GCTCTTCGAC CGCGACATGA TGTTCATCGA CGGCGGCATG
TTCAACAACG AGCCGCTGGG CGAAGCCATC GACACCGCCA GCGATGTCGA CGGCGGCACC
ATCGAACCCG ACCGCATTTT CCTGCTCGTC GATCCCAACA TCAACGCGTC GAACCACGTG
AGCGAGATCC TGCTCGACGA CAGCATCACC AAGCACGCCC TGCGCATGGG CCAGATGCTG
CTCGGCGAGA GCACGGCGCG CGAGTGGCTG CGCGCCAACC GCACCAACGT CGACATCGAG
TGGCGCGACC ATCTGGTGAG TCAGGTCGGC CGCATGCTGC GCGAGGCTGA GCTGGCCGAT
GCCGCCGCCA TGGCCGCGGC CATGAAAGCC CTGGCGTCCG AAATCGTCGC CCGGCGGCGC
GCGCTCCTGG GCGCGGACGA GGTGCCCGAT AGCTACCTCG AGAGCCGCAG CGCGCGCACG
ATGAAGGGCG AGCCATTCGC CGCGCTCTAC GCTTCGCTGG GCGCCTCCTC GGGCGAGGAT
GGCGGCGCGC AGCCGACGCA CAAACAGGAG CTGTTCCGCT ACCTGGTGTT CGTGCTCAAC
ACGGTCTCCA ACCTCAAGAA CAAGCAGAAG ATCCACCTGT CGCTGATCGG CGCCGACAAG
CGCCAGGTCG CCGGCGACAC GCTCAGCGGC TTCGGCGGCT TCTTCGAGCA TAGCTGGCGC
CTGCACGACT ACCGCGTGGG CCGGCGCAAC GCCCACGCGC TGTTGCCCAC GATGCTCGGT
GTGCCCGCCT ACGCCAAGGA GCCGGGCACG CATGAGGGCG CGCACGAGGA CTACCACCTC
CCGCCCGAGT GGGCCGACTA CCCCAACATC ACCATCGAAA AGGCCTCGCA CGCGCGCCGC
ATCGCGTTCT CCGAGGTGGT GCTGGCGCGG GCTGACACAG TGATGAGCGA GTTTGGCATA
CCCAAGCCGG TGCGCTGGGT GGCGCGCACG GTGTTCCTCA AGCGCTGGCT CGCGGGCAAG
CTGGGCCTGT AA
 
Protein sequence
MKTRKIALVL AGGVSLGSYE AGVLAELLYA LDWLNRPETL DGRDPFELDI MTGASAGGMT 
AALVARIMMY DLPGRRDHLR RVWVDGVDIV GLLDQQDVPL NALLSKRVLA QLAHDYVVKG
SDAPPIAPAS FAPERLHLSL TLSNMHGISY EIPYFASTDS ESQFVTTMFS DMANFTLERA
DLPGRRVWDT IVQSALACGS FPFAFQPHPL RRSSSDYRGS MQEHDTALFD RDMMFIDGGM
FNNEPLGEAI DTASDVDGGT IEPDRIFLLV DPNINASNHV SEILLDDSIT KHALRMGQML
LGESTAREWL RANRTNVDIE WRDHLVSQVG RMLREAELAD AAAMAAAMKA LASEIVARRR
ALLGADEVPD SYLESRSART MKGEPFAALY ASLGASSGED GGAQPTHKQE LFRYLVFVLN
TVSNLKNKQK IHLSLIGADK RQVAGDTLSG FGGFFEHSWR LHDYRVGRRN AHALLPTMLG
VPAYAKEPGT HEGAHEDYHL PPEWADYPNI TIEKASHARR IAFSEVVLAR ADTVMSEFGI
PKPVRWVART VFLKRWLAGK LGL