Gene Hoch_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1040 
Symbol 
ID8543422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1327385 
End bp1330087 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content54% 
IMG OID646385792 
Productprotein of unknown function DUF450 
Protein accessionYP_003265527 
Protein GI262194318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0444477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAAA TTGACCAGGG GCACGAAGCG CTCAACGGCA TTACCCGAAC CTGGGGCCAA 
CTCAGTCTCG CTCAGGCGAA CGAGGCCGAA ACCCGCCTCA AAGTAATCGA CGAGGTTCTG
TTTAATGTAC TCGGTTGGTC AAAAGACGAT GTGAGCGTCG AAGAGAGGGT GTCGGAAGAT
GGGGAAACTA CTTTTGCGGA CTACATAATA CGTACAGCTA CGGTGCAGGT TCTTGTTGAG
GCGAAGAAAG TTGGAGTCGC CTTCGAACTG CCGACGACGC GAAAGGCGCT GCGCCTGGGT
GGGGTGCTGA GCGAAGGCGA AGTCGGCGAT GCTATAAGGC AAGCTCGTGA CTATTGCCGC
AAAAAATCGA TTCCTTTTGC CGCCGTAACC AACGGCAATA CTTGGGTCAT CTTTCCCGCC
GTTCGCACAG ATGGAATAGA GTTTGAGAAG AGTGACGCTC ACACATTCAG TTCACTCGCA
ATTGTCTCAG AACGATTCGT GCAATTCTGG GAATTGCTCT CAAGGCAGCG GGTCCTAGAA
GGGAGTCTCA CTAGTGAGTT ACTCGGCAGC GAGGACCAGA ACCGCCTTCG TCGCTGCGTT
CGAGAACTCC ATAGGGAGCC GGGCTTCCGG CTCGGGCGAA ACGCCCTTCA CAGCTATCTC
GAGCCGGCAA TTCATCAGGC GTTGACTGAT GAAGCAATCC TCAGAGACCG TGATGCACTT
GAAGCCTGCT ACGTTAAGAC TAGCAATCGC GTCAAATACG ACACGCGTCT GCGTGTATTT
TTCGGACACG GGAGGGCACC ACTGGGTCAC GCGCCAACGC GGCTAGGCAA CAAAGGCCGG
GGAAAATTCA CAACCGCTGT CACGAAGACT GTAACTGATA CCCCCCCTAG ATTCGTCGTT
CTTCTCGGAA AGGTAGGGGC CGGCAAGACC ACATTTCTTC ATTACACCCG GCTAGTGTCT
GCCGTGGACG CCATTGAAGG AAAAGTGTTG TGGCTTTATA TTGATTTCAA GGCTGCGACC
AAGGCAGACC GTCCGAGAGA TTTCATCTAT CGTATGTTGC TTGAGCTTAT AGAGAGCGAC
GAGCAGTTCG AACTTGGTGA CTGGGAGCAC TCCGTGCGCC CCGCTTACCG CGATGAAATC
GAGAAGCTAA AACGGGGTCC GCTGAATCCA CTGTTTCGAC AGAAACCAGA TGAGTTTGAG
CTCAAAATTG CAGAGCAAAT CACGCGAGAG CGCGACGAGG GAAAACCCTA CGTGGACCGA
GTCCTTCGAA ATGCAGCTTC GCGTTTGCCA GGTTTCCTCA TCCTCGATAA TCTTGACCAG
ATCGAGGACG ATGACTTCCA GGGCCAAGTT TTCCTGGAAG CCCAGGCACT TGCTCGTATT
GTTGGCTTTA ACGTTATCGT TTCGATGCGA GAGTCAACTT ATCTGCGCCA CAAAGAAAGC
CCTGCATTCG ATGCCTTTCA ATTTGATTCC TTCTACCTGG ATGCGCCGAG CATTCTCCCC
GTGCTGTCGC ACAGATTGGC TTACGCCAAG CGTTTGCTGA GCGGTCCTGC CAAGATCCAG
ACAGAAAAAG GCATGACGGT TTCCGTTGAC GATCTTGGCG TGTTTTTCGA GATTGTTTCG
TCGTCGTTAC TCTCGGACGA GAGCGGGCAG CTACTCGAGT CTTTAGCAGG TGGCAATGTG
CGTCGTGGCC TCGAACTCGT ACGAGACTTT CTAGCCAGCG GGCACACGAA CGCCGACCGA
GCCTTGATGG CCTATGTGAA GAGGGGGGGC TACGTATTTC CAACACACGA GGTTCTTCGC
GGTTGCATCC TCGGGCCCCA GAAATACTTC GATGAGCGTT ACTCAAACGT TCCAAATATC
TACGATGCGA AGACCGGTAA CCGCGCGTCG CAGATGCTGA GGCTACGGAT CGTTCAACTC
CTCGTAGAAC ATGCCTCTCT TCCAGGGTTC GAAGGTGTCG CTACGACGGA AATCGAAAAC
GCGGCAAGCC AACTCGGGGT GGCACATCAA CTTGTCCGCA ACTGTCTTTG CTGGCTGGTT
GAAAAGGGTG TGATTCAGAC ATCTGATGGT CTATCGCCGT CGGATCAAAA CAGCCTTCTG
CCAACGCGAT TTGGCGCGTT TCTCCTGAGG GTATTATGTA AGCAGTTTGC GTATACCGAA
TTTTGTACAT TCGATTCCAT TATATATGAC GACGATGCTT GGCAAGATCT CAAGGACCTT
ACCTCTGAGA TCGAGAACGA GGGAGATATC GTAGCGCGAG TAGAATTACG AGCGGAGCGA
GCCGACCGCT TTCTCGAGTA CTGCGAACGC ACGGAAGAAC TGTTGACCGT GGAAGCTAGG
CGGCGAGCGC TGCCCGAGGA CTACGCGCAG CAGCTTGTGC CAGATCTCCG GAAGAAAGTC
GGCGACGATG CCGTCAAGGC GCTGAGTTCA GCTAGGAGGC GATATGGTTC TCTCGACAAT
GATGCCGTAG GCAGTCGAGC AAGGCCGATC GCCTCCGCTT GCAACACGGG AAAAATTGCT
AACAGCTGGT TGGACCGCGA CTATGTCTTT ATAACGGACG AGTCTGGTCG AGACTGGTTC
AGCCATCGGT CGGACTTTCT TAGCCAGGAC GAGTGGAACA AGCGGCAGAC AGGTAAGCCA
TGTGAGTTCC TCGCTGGCGA GTGGCGCGGG AAGCCCCGTG CTACTCAAGT AAAGGTGTTC
TAG
 
Protein sequence
MHEIDQGHEA LNGITRTWGQ LSLAQANEAE TRLKVIDEVL FNVLGWSKDD VSVEERVSED 
GETTFADYII RTATVQVLVE AKKVGVAFEL PTTRKALRLG GVLSEGEVGD AIRQARDYCR
KKSIPFAAVT NGNTWVIFPA VRTDGIEFEK SDAHTFSSLA IVSERFVQFW ELLSRQRVLE
GSLTSELLGS EDQNRLRRCV RELHREPGFR LGRNALHSYL EPAIHQALTD EAILRDRDAL
EACYVKTSNR VKYDTRLRVF FGHGRAPLGH APTRLGNKGR GKFTTAVTKT VTDTPPRFVV
LLGKVGAGKT TFLHYTRLVS AVDAIEGKVL WLYIDFKAAT KADRPRDFIY RMLLELIESD
EQFELGDWEH SVRPAYRDEI EKLKRGPLNP LFRQKPDEFE LKIAEQITRE RDEGKPYVDR
VLRNAASRLP GFLILDNLDQ IEDDDFQGQV FLEAQALARI VGFNVIVSMR ESTYLRHKES
PAFDAFQFDS FYLDAPSILP VLSHRLAYAK RLLSGPAKIQ TEKGMTVSVD DLGVFFEIVS
SSLLSDESGQ LLESLAGGNV RRGLELVRDF LASGHTNADR ALMAYVKRGG YVFPTHEVLR
GCILGPQKYF DERYSNVPNI YDAKTGNRAS QMLRLRIVQL LVEHASLPGF EGVATTEIEN
AASQLGVAHQ LVRNCLCWLV EKGVIQTSDG LSPSDQNSLL PTRFGAFLLR VLCKQFAYTE
FCTFDSIIYD DDAWQDLKDL TSEIENEGDI VARVELRAER ADRFLEYCER TEELLTVEAR
RRALPEDYAQ QLVPDLRKKV GDDAVKALSS ARRRYGSLDN DAVGSRARPI ASACNTGKIA
NSWLDRDYVF ITDESGRDWF SHRSDFLSQD EWNKRQTGKP CEFLAGEWRG KPRATQVKVF