Gene Hoch_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3800 
Symbol 
ID8546193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5218998 
End bp5220062 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content71% 
IMG OID646388470 
Producthypothetical protein 
Protein accessionYP_003268193 
Protein GI262196984 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00611211 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.136347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGC ATGTTCGTGA GTCTTTGGTT TTCGGCCGCG TGGCCGATCT TCTCGTTTCC 
AATGGTTTTT CGTTCCCGGC TTCGGTGCCG GGTAGCCGCC GCGTCGCCGG CGCCGCGCTG
CTCTTCGGCG CCGCCGCGCT GCTCGGCGCC TGCGGCGACA ACGACAGCGT TGGCGCCGAC
CCCGACGCGG CCCCGCCGCC CATCGAAGTC GACGCCGCGC CCGTCGAGGT CGACGCCGGC
GTCCGCGAGC TGAGCATCGA GTTTGCCGCC CAGGTCGGCG ATCTCGCGGC CGCGTGTGGC
GCGGGCCCCT ACGCCGGCCT CGGCACCGAC GGCGCCGAGG TCTCGATCGC CGATCTGCGC
TTCTACGTGG CCAACCCGCG GCTGCTCCGC GACGGCGTCG AGGTGCCGAT CACGCTCGTG
CAAGATGAAG TCTGGCAGTA CCAGGACGTC GCCTTGCTCG ACTTCGAGGA CGGCACCGGC
GCGTGCTCGA CCAACGGCAA CAGCGCCATC AACACCCGCA TCGTCGGCAC GGTCCCCGAG
CCCGAAGACG GCGGCGGCGT CGGCGACGAC GAGTACCAGG GCCTGGTCTT CGACCTCGGC
GTGCCCTACG AGCTCAACCA CCTCGACGTC ACCGCCGTGC CTTCGCCGCT CAACGTCATG
GCCATGTACT GGGCCTGGGC CATCGGCCAC AAGTTCGTGC GCATCGACCT GGTTACCGGC
GACGGCGCCT GGAACATCCA CCTCGGCAGC CAGCAGTGCG ACAGTCCGGA GGCGCCCAGC
GCGCCGCCCG CGAGCGGCTG CGCCAAGCCC AACCGGCCGC GCATCGCGCT CGCCGACTTC
CGCCCCGGCG AGGACACCGT GGTGCTCGAC ATCGCCGCGC TGGTGGCGGG CTCGAACCTG
AGCGCCGACG CCGGCGCGCC CGGCTGCCAG TCCTTCCCCG GCGACGTCGA CGAGTGCACC
GAGCTGTTTC CCGCGTTCGG CCTCAGCTTC GAGAGCGGCG ACTGCGAGGA CGATTGCGCG
GGCCAGAGCG CGTTCTCGGT CGCCGCCGGG AGCTCGCTGC TGTGA
 
Protein sequence
MLKHVRESLV FGRVADLLVS NGFSFPASVP GSRRVAGAAL LFGAAALLGA CGDNDSVGAD 
PDAAPPPIEV DAAPVEVDAG VRELSIEFAA QVGDLAAACG AGPYAGLGTD GAEVSIADLR
FYVANPRLLR DGVEVPITLV QDEVWQYQDV ALLDFEDGTG ACSTNGNSAI NTRIVGTVPE
PEDGGGVGDD EYQGLVFDLG VPYELNHLDV TAVPSPLNVM AMYWAWAIGH KFVRIDLVTG
DGAWNIHLGS QQCDSPEAPS APPASGCAKP NRPRIALADF RPGEDTVVLD IAALVAGSNL
SADAGAPGCQ SFPGDVDECT ELFPAFGLSF ESGDCEDDCA GQSAFSVAAG SSLL