Gene Hoch_5112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5112 
Symbol 
ID8547523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7044958 
End bp7046088 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID646389788 
Producthypothetical protein 
Protein accessionYP_003269493 
Protein GI262198284 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.690635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCCT ACTCGCTCGC ACGTGCGAGC CGACACGCCT CAGCGCTCGC GCTCGCGCTG 
CTGGTGGGAT GCACCTTCGA CCCCGCCCCG CTCGATCCCG TCCGCGGGGT CGATGCCGGT
CCCGATCTGC CGCAATGCGG ACGGCCTTCC GATCTCTGTG AACACGGCCT GCGCCGCGCG
CTCGAGCTCG ACCGCAGCGG CGTCACCGAG ACCCTGCTCG AGGTGCCCGT GCTGGTGCGC
CTCGACCCCG AGCGCATCGA CTACGCCAAG CTGCGCGAGG ACGGCCGCGA CCTGCGCTTT
CGCTGGGGCG AGGAGCAGCG CGATCTGGCC TATGACATCG AGCGCTGGTC GCCCGGCGGC
AGCTCGCTGA TCTGGGTGCG TGTGCCCGAG GTGGCGGCGG CCGAGGCCGA GGCCACGCCG
CTGTGGATGT ACTACGGCAG CCCCGAGGCC GAGGCGGCCG ACGCCCACCC CAGCGCGGTG
TGGAAGCCGC AGTACCGCAG CGTCCATCAC CTGGGCGCCG ACCTCAAAGA CGCCAGCCTC
AGCGGCCACA ACGGCCACAG CCCGTCGCCG CCGCTCGAGG TCGAGGGCCA GCTCGGCGGC
GCCCGCGCCT TCGACGGCGA GAGCACGGTC ATCGTCCTGC CCAACGAGAC CGGCTACGAC
TTCGCGACCA CCATGAGCCT GTCACTGTGG ATGCGCTCGG CCGCGGCCGC GCATCCCTTC
GAGACCATCA TCGCCAAGGG CGACAGCGCC TGGCACCTGC GCCGCGACGC CAGCCAGCAG
CACATCGAAT TCCGCACCAC CTCGCTGGGC CGCGACAGCA CCAAGGTCGG CACGGTGACG
GTCAATGACG GCGCCTGGCA CCACGTCTTC CTGGTGCTCG ACGGCGAGCG CAAGCTGCTG
TACATCGACG GCGAGCTCGA CACCGCCGGC GACTACAGCG GCCGGCTCGA CAACACCGCC
GATCTCGTCC GCGTGGGCGA AAACAGCACC GTGCCGGGGC AATCCTACCG CGGCGAGCTC
GACGAGCTGC GCCTGTCCGA GGCTCCGCGC TCGGCCTCGT GGGTGCGCTT CCAGTACCGG
GCCGGCAGCG GCGCGGGCGT GGTCGCCTTC GGCCCCGAAG AGTCACTGTA G
 
Protein sequence
MLPYSLARAS RHASALALAL LVGCTFDPAP LDPVRGVDAG PDLPQCGRPS DLCEHGLRRA 
LELDRSGVTE TLLEVPVLVR LDPERIDYAK LREDGRDLRF RWGEEQRDLA YDIERWSPGG
SSLIWVRVPE VAAAEAEATP LWMYYGSPEA EAADAHPSAV WKPQYRSVHH LGADLKDASL
SGHNGHSPSP PLEVEGQLGG ARAFDGESTV IVLPNETGYD FATTMSLSLW MRSAAAAHPF
ETIIAKGDSA WHLRRDASQQ HIEFRTTSLG RDSTKVGTVT VNDGAWHHVF LVLDGERKLL
YIDGELDTAG DYSGRLDNTA DLVRVGENST VPGQSYRGEL DELRLSEAPR SASWVRFQYR
AGSGAGVVAF GPEESL