Gene Hoch_3310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3310 
Symbol 
ID8545698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4567386 
End bp4568792 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content75% 
IMG OID646387977 
Producthypothetical protein 
Protein accessionYP_003267705 
Protein GI262196496 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.480473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.347566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATGGTG CCGCCGTGGA TCGCGTGATC GCTCAGTACC TGGCGCGCTA TGCCGCGCCC 
GAGGCCCAGG TGGCCACCCG GCTCGCGCGC GATTACGCCC AGGTCTGCGT GGTCCCGGCG
CTCGACGAGT CGCCGGCGCT GCTCGAGCGG CTCGGCAGCC CGCGCAGCGA TGGGCCGTGG
CTGCTCATCG TCGTGGTCAA CGCCAGCGAT GAAGTCGCCG AGCCGGCGCA CGCGCGCAAC
GCCGCGCTGC TGCGCGCGCT GGCCGAGTGC GATGACGCTC GCGCGCCCGC GCGGCTCAGC
CAATCGCCGC CGATGACCCT GGCGCGCCTC GGCCACGGCG ACGTGCTGTG CGTCGACCGC
GCCAGCGCCG GACAGCGTCT GCCGGCCGGT CAAGGCGTGG GTCTGGCCCG GCGCATCGGC
GGCGATATCG CCCTGGCCGC GTTTGCGGCG GGCCGGCTGC GCTCGCGCTG GATTCACATG
AGCGACGCCG ACGTGGCCCT GCCCGACGAC TACTTCGCGG CCGCCGAGCG GGCGCCAGCG
GCCGCGCGCG CGCTGGTCTA CCCGTTCTGG CACGAGGACA GCGGCGAGCG CGAGGTGGAT
CTCGCCACCG GACTGTACGA GCTGTATCTG CGCTATCACC GCCTGGGTCT GCGCTGGGCC
GGCTCGGCCT ACGCCGTGCA CACGGTGGGC AGCACCATCG CCGTGGACGC CCGCGCCTAC
GCCCAGGTGC GCGGCGTGCC CAATCGCCAG GCCGGCGAGG ACTTCTATAT GATGAGTAAG
CTGGTCAAGC TCGGCCCGGT GCACGAGCCC GCGTGCGCGC CGCTGCGCAT CCGCGCGCGC
CGCTCGCAGC GCGTCCCCTT CGGCACCGGC GCGGCCACCA CCGAGATCGT CGCCGCGCGC
GCGGCCGGAC GCGCCTACGC CGTCTACGAC CCGCGCGTCT ACACCCTGCT CGGCGCCTGG
CTCGACGCCC TCGCCGCGCT CGCCGATGGC GATCCCGCCG CCGACATCTC CGCCGAACAG
GCCTGGGGCG ACGCCCTGGC CAGCGCCGCG CGCGCGCGCT CGCTCGGCCC CGGCGAACGC
GACGCCCTCG ATCGCGCCCT GCACGCGCTC GCCGCCCCGG CCGCGATCGC CGAAGCCCGG
GCGCGCACGC GCTCGCCGCG CGCGCGGCGC AAACGCCTCG ACGACTGGTT CGATGCGCTG
CGCACGCTCA AACTCATCCA CGCGCTGCGC GACATGCTCC TGCCCTCGCT CCCGTGGCAC
CAGGCGCTGA CGCGCGCGCC CTTCCTCGCG CTCGACCACG CCGCCAGCGC GTCGCTAGCG
GCGCAGACAC CGACGCCCAC GATCGCCGAG CTGCGCGCGC TGCGCCGGCT GCTGCTGCAA
CACGAAGGCG CGCCCCGTCC TCCGTGA
 
Protein sequence
MYGAAVDRVI AQYLARYAAP EAQVATRLAR DYAQVCVVPA LDESPALLER LGSPRSDGPW 
LLIVVVNASD EVAEPAHARN AALLRALAEC DDARAPARLS QSPPMTLARL GHGDVLCVDR
ASAGQRLPAG QGVGLARRIG GDIALAAFAA GRLRSRWIHM SDADVALPDD YFAAAERAPA
AARALVYPFW HEDSGEREVD LATGLYELYL RYHRLGLRWA GSAYAVHTVG STIAVDARAY
AQVRGVPNRQ AGEDFYMMSK LVKLGPVHEP ACAPLRIRAR RSQRVPFGTG AATTEIVAAR
AAGRAYAVYD PRVYTLLGAW LDALAALADG DPAADISAEQ AWGDALASAA RARSLGPGER
DALDRALHAL AAPAAIAEAR ARTRSPRARR KRLDDWFDAL RTLKLIHALR DMLLPSLPWH
QALTRAPFLA LDHAASASLA AQTPTPTIAE LRALRRLLLQ HEGAPRPP