Gene Hoch_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1406 
Symbol 
ID8543788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1892179 
End bp1893369 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content65% 
IMG OID646386118 
Producthypothetical protein 
Protein accessionYP_003265853 
Protein GI262194644 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0332746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCCT ACAGGTTAGT GCTTCTATGC CTCGTCTACG TGTCTATGCT GGCCGCATGC 
CAGTCGGGCG CAAACGAGGA GGTCGCCGCA GCGAACCCGG CGACTCACAC CGCGCTTGCC
GCGCACAGCG AAGCGGGAGC GGCAGACGGC GGCGATGGCG AGTGCGTGGC GGCGCTGAGC
GACGTGTCCG TTGCCGAGGA CGGCGAGTGC ACGGGCAGCG TGGGCGATTC CGCGTACTGC
GATAGCTGCG AATGCGGCTA TGGCGAGGGC GACTGCGACG GCGACTCTGA ATGTGCGCCC
GGGCTGTATT GCTACAAGGA CGTGGGCGCG GACTTCGGCT ATGCGTCCGA CGTCGACGTG
TGTCTGGGCG AGTGCGAGGA TTACAGTGGT CACTGGGAGG CATCCTGCAC CAGCGAGTGC
CCCTGCGAGG CCGGCGAGGG CGACTGCGAC ACCGACGCCG ACTGCGCGGA AGGCTACCGC
TGCATGCACG ATACCGGTGC GCTCTACGGC ATGGATCCGG AGCTCGATAC CTGCGAGCTT
CTGTGTTCTT CGGTCGGCCT GGGCACGACC ACGTTCTGCT CGTCGGAGTG CCTGTGTGAC
GCGGGCGAGG GCGATTGCGA CTCCGATTCC GACTGTCTGC CGGGGCTGCG CTGCGCGCGC
GATGTCGGCG CACGGTACGG GTTCGACCCG GAGACCGACG TCTGTGAGGG CATCTCGACC
TGCGATCTCA CCGACGGGGC CGGCGACGAA CCCGCGCTGG CCTACCCCTT GAGCTTCGAT
TACGTGAATT CCTGGGCCGC CGGGCTGCGC ATGGACAAAG GCTTTCTGTG CAGTGGCGAA
GACGATTGGT ACCGTTTTCC CAACTCGGCG GCAGGATTTA CCCCGGTATT TCTGGCGGTC
CGCGCCGAGG CCCGCGGCGC CGATTATTGC GGCCCGGGTT GCGATGGCCT GACCTTGCCC
GATGCGCCCG AGAACAGCAT CACCATCGAA GTCTACGACG AGTCCATGAA CACCTTGTTG
TCCACGCGCT CCGATACCGA TGGAACGCTT TTCTTCGATG TCTCGCTCGA GGGCATGGAG
TACACGGACG ACCTGCTTAT CCACGTGACC GGCGCCGCGG AAGCCACCTA TCCGTATGAG
CTGACCCTGT GGGTGCAGCC GTTCGACGGT GAAGACGAAT GCGAGTGTTG A
 
Protein sequence
MMSYRLVLLC LVYVSMLAAC QSGANEEVAA ANPATHTALA AHSEAGAADG GDGECVAALS 
DVSVAEDGEC TGSVGDSAYC DSCECGYGEG DCDGDSECAP GLYCYKDVGA DFGYASDVDV
CLGECEDYSG HWEASCTSEC PCEAGEGDCD TDADCAEGYR CMHDTGALYG MDPELDTCEL
LCSSVGLGTT TFCSSECLCD AGEGDCDSDS DCLPGLRCAR DVGARYGFDP ETDVCEGIST
CDLTDGAGDE PALAYPLSFD YVNSWAAGLR MDKGFLCSGE DDWYRFPNSA AGFTPVFLAV
RAEARGADYC GPGCDGLTLP DAPENSITIE VYDESMNTLL STRSDTDGTL FFDVSLEGME
YTDDLLIHVT GAAEATYPYE LTLWVQPFDG EDECEC