Gene Hoch_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1661 
Symbol 
ID8544043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2263799 
End bp2265829 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content65% 
IMG OID646386369 
Producthypothetical protein 
Protein accessionYP_003266104 
Protein GI262194895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.773537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.657918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC GAGCGAACTT ACTCACGCTA TTGCCGGCAG CCTTGGCTAT TCCTCTGCTG 
TCCGGACTTG CGGGACAGGC GCACGCCGAC CCGCTGCCCT TACTGGACTG GGAGACCACG
CGCTCGCCGA TGGCGGCGGG CTGTGAGGGA CAACCGGACA GCGCACCGGC GCCGCTCGAT
CCGCGCACCC TGGTGGTGCC GGGTGTCAAC GATCCCGGCG CAGCGGTGCA GTTCAACGCG
TTCTGGGTCG ATCTGCACGA TCCGCCGGCG CCGTATGTGA CCACGCTGGA GCCCAACCCG
AGCACCTGCG GTGAGTTCCG GGCCAGCGTC GAGCGCGGCC GCCAGAACAT GATGAACCGG
GCCTACTTCC AGCCCTTCAG CACCTCGCTC GGCTACTACG AGCTGTACAA GCTCTGGGGC
TATCTGTTCC GTCCTTCCGA CTTCGACGCG CAGGTGCGCA AGCGTTACGG CATGTTCGAG
GCGCCGTTTC GCAATCCCTA TCCCTATCCC TGGGAAGATC CCAATCGGAC CAACGGCGGC
AGCGGGCAAT TGCCGCTCGG CCTGGTGCAG GTCAAAGATG ACGACGGTAA TTACACCGGC
TTGATCTCGT CGAGCTGCTC GGGCTGTCAC GATTCGCGTC TGGGCACCAC CTCCGAGACC
TCGTTCCTGT GGGGCCGCTC CAACGACGCC AATGACGCCG GTTTGCTGCA ATCCGATCTG
TTTCGCGCCA ATGGGCCGCT GATCGTCTTT CAGTTGGCGC CCATCCCGTG GAGTACGGGC
CGCGGTTCGA CCGACGCCAT CGGTATCGTC GACCTGCTGC CGGCGCTGTT CGACATGGAC
TCGCTCGGCC TGGTGCCGAG CTTGATGGAG TGGTTCCCCA CGCACGCCGG CGGCATGGCC
AAGGCCCCGC ACTGGTGGCA TCGCTCGTTC AAGACCCGGC AGTTCTGGGA TGGCGCGCTG
ACCTCGGACA ACGTGCGCTC GGAGATGGCC TTCGGTATCG CTAACCTGTT CCGCAATGGT
CCTGAGCGGC GCGCGCTCAC GGCCGAGTTC GAGGACAACG ACAACTTCTT CCTGTCGCTG
TCGCCGCCGC CCTACGAGCA GGCTATCGAC ACCGACTTGG CCGAGGACGG CGCGGTGCTG
TTCCACGAGC GCGACCTCTG GGCCAACGGC GCCAACGCCG ACATTCCGCG CGCCGCCGGC
AACGGCTCGT GCGCGAGCTG CCACGGCGTG TACTCGCCGC GCTACGCGGC GGATCCCGCG
TATCTGCCCG ACCCGCGGCT CAAGGGCATC TCCGGCGTGA TCACGCCGAC CGAGACCATC
GGCACCGATC CGGCGCGCGT CGATCTGATG GCCGACGAGC GCAAGCGCCG CTCCTGGAAC
ACCTCGTTCC TGGCGTACAA CGACATGCAT CCCGACCATC CGGGCTTCTT CGATGACCCC
ATCACCAGCG CCCTGCGCCG CATCGGCCGC CGCGCCTACG ATATCGGCGA GGGCCCAGTG
TACTCGCCGA CCGGCCCCAA CGAGTGGATC GAGCCCTTCG GCTACATCGC GCCGCCGCTG
TACGGCGCCT GGGCCAGCGC GCCCTACTTC CACAACGGCA GCGTGCCCGA TATCTGGGGC
GTGCTGAAAC CGTCCGCGCG TCCGGATGTC TGGCGTCGGC AGTACACCTC GACCAACATC
TTCGGCACCA ACGCCGGCTA CGATCACAGC CTGTCGTCGT ATGACTTCGA CAAACTGGGC
TGGAAGTACA CCGAGATCCA GTGCCAGGAT AATCCCTTCA ACTTCATCTT CCTGCCGTGC
AGCGAGTACA TGGCGACCAT CGACATCGTG TTCGCCAACA TCGCCGAGAA GGTGGCCGAG
TTCAACTCGC TGGCCTACCA GTCGCCGCCG CCGATCACGC AGAAGCAGAT CCGCTCGCGC
ATGAACTACA ATTCCCATCT CTACGGCAAT GGCAACGAGG GCCACGAGTT CACGCAGTCG
CTCAGCGACG CCGAGCGCTA CGCGATCATG GAGTACCTCA AGACCCTGTA G
 
Protein sequence
MTTRANLLTL LPAALAIPLL SGLAGQAHAD PLPLLDWETT RSPMAAGCEG QPDSAPAPLD 
PRTLVVPGVN DPGAAVQFNA FWVDLHDPPA PYVTTLEPNP STCGEFRASV ERGRQNMMNR
AYFQPFSTSL GYYELYKLWG YLFRPSDFDA QVRKRYGMFE APFRNPYPYP WEDPNRTNGG
SGQLPLGLVQ VKDDDGNYTG LISSSCSGCH DSRLGTTSET SFLWGRSNDA NDAGLLQSDL
FRANGPLIVF QLAPIPWSTG RGSTDAIGIV DLLPALFDMD SLGLVPSLME WFPTHAGGMA
KAPHWWHRSF KTRQFWDGAL TSDNVRSEMA FGIANLFRNG PERRALTAEF EDNDNFFLSL
SPPPYEQAID TDLAEDGAVL FHERDLWANG ANADIPRAAG NGSCASCHGV YSPRYAADPA
YLPDPRLKGI SGVITPTETI GTDPARVDLM ADERKRRSWN TSFLAYNDMH PDHPGFFDDP
ITSALRRIGR RAYDIGEGPV YSPTGPNEWI EPFGYIAPPL YGAWASAPYF HNGSVPDIWG
VLKPSARPDV WRRQYTSTNI FGTNAGYDHS LSSYDFDKLG WKYTEIQCQD NPFNFIFLPC
SEYMATIDIV FANIAEKVAE FNSLAYQSPP PITQKQIRSR MNYNSHLYGN GNEGHEFTQS
LSDAERYAIM EYLKTL