Gene Hoch_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3022 
Symbol 
ID8545410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4179162 
End bp4180454 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content71% 
IMG OID646387694 
Producthypothetical protein 
Protein accessionYP_003267422 
Protein GI262196213 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.46728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.519638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTAC CCTCGCCATT CTGGAAGCAT CCCGACCTCC GTGTCTCTGG CCCTGTCGCC 
GGTGTTCTCT CTGGCCCCGG CCCCACGCGT ACCGGTTCGG CGCGCCGCCG CCCCCGGCTC
GCGCTCGCGT CCGTCTCGCT GGCGTCGCTG GTCGCGGTGG CGTGCGGGGA CAATCACGCC
CCCGCCACCC CCGACGCCAG CGTGCCCGAT GCGCTCCACC CGGTCGATGC AGCGCCCGAG
TGCGAGCAGA GCGTGGGCGC GTTCGTCGCC GAGCACGCGT GCTTGCACGC CATGCACGGT
CCCTTCGCCA CGCTCAGCGC CAGCCCACCA TCCCAGCCGC CGAGCGCCGA CGCGAGCCAG
CCGCACACCG CCTTTCAGGT CGCCTTGCCC GCGCTCGGCG GCGGCGACGG CTACCGGGGC
GTGCTGCGCT ACCTGCCGCG GCGGAGCGGC GAGTTCGCGT TCTTCACCGC CCCCGGCGTG
ACCGTGTCCA TCGCGGACGA CATGGGCCGG CCCACGCCGG CGCGGCTGGC TCACGATCTC
ACGGTCTGCG ATGAACTCCA GCGCGTCGAT ACCATTGTAC TCGACGCCGA GACCAGCTAC
TCGCTCACCC TCGCCAGCCC GGACCGTGCG GATACCGTCC TGGTGGTCGA AATGCTGGAC
GAATTCCTGC CCAGCGAAGC GTATGACTCA TTCTGTGAGC AGCCATCCGA TGCTGGCGTG
CCTGACGCAT CGGTCCCGCC CGCCGACGCC GCGCCCGCCG ACGCGCCACC TATCGACGCC
GCGCCCGTCG ACGCGACGCC GCCGGCGGAC GCCGACTGCA CGCCCAGGCT CGGCGCCGCC
GTCGTCGAGC ACACCTGCTT GCACGCGACC CACGGCCCCT TCGGCACCGT GACCGCGCAG
CCCGATCCGG CCACGGCGAC CGTGAACATC AACGGCTCGC ACACCTACTA CACGGTCGAG
CTGCTGCCCG CCGACGCGCT CTACCACGGC GTGGTGACCT ACCGGCCGAC CGCGACCGGT
CTCTACACCT TGTTCCTCGA TCCCGACCTC GCCGCGCAGG TGCAGCAGCC CGACGGCACG
CCGGTACCGA GCGTTCACGA AGAGATCGTC ACCACCTGCC CAGGTCTCAC GCGCGCGCTG
GTCGTCGATC TCGATATCGC CGTGCGCTAC CGGCTGGTGT TCGCAGCCAC CGCCGAACCC
GAGGCTCACG TCGCCGTGGA GTTTCTGGAC AGCTTCGCCC CGGGCGAGCG CTGGGACGAT
CCGTGCTCGG AGACAGACTG GGCGGACTTC TAA
 
Protein sequence
MELPSPFWKH PDLRVSGPVA GVLSGPGPTR TGSARRRPRL ALASVSLASL VAVACGDNHA 
PATPDASVPD ALHPVDAAPE CEQSVGAFVA EHACLHAMHG PFATLSASPP SQPPSADASQ
PHTAFQVALP ALGGGDGYRG VLRYLPRRSG EFAFFTAPGV TVSIADDMGR PTPARLAHDL
TVCDELQRVD TIVLDAETSY SLTLASPDRA DTVLVVEMLD EFLPSEAYDS FCEQPSDAGV
PDASVPPADA APADAPPIDA APVDATPPAD ADCTPRLGAA VVEHTCLHAT HGPFGTVTAQ
PDPATATVNI NGSHTYYTVE LLPADALYHG VVTYRPTATG LYTLFLDPDL AAQVQQPDGT
PVPSVHEEIV TTCPGLTRAL VVDLDIAVRY RLVFAATAEP EAHVAVEFLD SFAPGERWDD
PCSETDWADF