Gene Hoch_3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3647 
Symbol 
ID8546037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5021949 
End bp5023082 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content65% 
IMG OID646388316 
Productprotein of unknown function DUF362 
Protein accessionYP_003268042 
Protein GI262196833 
COG category[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.01191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGAG CCAAAGTCTC CGTCGTCCGC GTCTCTCCGG CCACGATTCT CGAGGATATC 
GACCGGCTGC ACGAACTAGC CGGTGTCGAG CAGGCGCTGG ACAGCGCCGC GTCCACGATC
CTCAAAGACA ACATCTCGTG GCACTTCCCC TTCCCGGGCG CCAACACCAC GCCGTGGCAG
CTCGAGGGGA CCATCCGGGC GCTGCGCAAG CGCGGCTACG ATGACCTGGT CTGTGTGCAG
AACAAGACCG TGGTCACCGA CGCCTTCAAG GGCGAAGATC TCAACGGCTA CGTGCCCATC
TTCGACGCCT ACGGGGTGCC CGTTCGCTAC AACTTCAAGC CCGAGGAGAT GAGCTGGTCG
GTGTATCGGC CGAAGGCCAA GATGAACGTC CTCGATACGA TCTTCCCCGA GGGCATCTAC
ATCCCCGACG CCTTTCACGG CACCAACGTG GTGCATCTGC CGACGGTGAA GTGCCACATC
TACACGACCA CGACCGGGGC GATGAAGAAC GCCTTCGGCG GTCTGCTCAA CACCAAGCGG
CACTACACGC ATTCGTGGAT CCACGCGACC CTGGTCGACC TGCTGGCCAT CCAGAAGGAG
ATCCACAGCG GCCTGTTCGC GGTCATGGAC GGCTCCACCG CCGGCGATGG CCCGGGACCG
CGGACCATGC GGCCGGTGGT CAAGAACGTC ATGTTGGCGT CCGATGATCA GGTCGCGATC
GACGCGGTGG CGGCGTCGAT GATGGGCTTC GACCCGATGA GCATCGAGTA CATCCGGCTG
GCGCACGAGC AGGGTCTGGG CAAGGGCAAG CGCGACGAGA TCGAGGTCGT CGGCGACACT
GCCCTGGCCG ACGAGCGCTG GGGCTTCTCC GTGGGCGACA ACGGCGCCTC GATGGTCGGC
GACATCATGT GGTTCGGGCC GCTCAAGCGG CTGCAGAAGC TGTTCTTCCA CACGCCGCTG
GTGAACCTCT TCGTCTTCGG CTCGGAGGCG TATCACGACT ACTACCGGTG GCCGCTGCGC
GACCGCAAGG TGTTCGAGGC CTGGTGCGGC GAGACCCAGT GGGGCCAGCT CTTCCAGCGC
TATCGCGCCA GCGGCACGCT CGCCGAAGAC CAGCGCGACA CCCGCAGCGC GTAG
 
Protein sequence
MSRAKVSVVR VSPATILEDI DRLHELAGVE QALDSAASTI LKDNISWHFP FPGANTTPWQ 
LEGTIRALRK RGYDDLVCVQ NKTVVTDAFK GEDLNGYVPI FDAYGVPVRY NFKPEEMSWS
VYRPKAKMNV LDTIFPEGIY IPDAFHGTNV VHLPTVKCHI YTTTTGAMKN AFGGLLNTKR
HYTHSWIHAT LVDLLAIQKE IHSGLFAVMD GSTAGDGPGP RTMRPVVKNV MLASDDQVAI
DAVAASMMGF DPMSIEYIRL AHEQGLGKGK RDEIEVVGDT ALADERWGFS VGDNGASMVG
DIMWFGPLKR LQKLFFHTPL VNLFVFGSEA YHDYYRWPLR DRKVFEAWCG ETQWGQLFQR
YRASGTLAED QRDTRSA