Gene Hoch_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3520 
Symbol 
ID8545909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4853946 
End bp4855196 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID646388188 
Productprotein of unknown function DUF214 
Protein accessionYP_003267915 
Protein GI262196706 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0642417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.207368 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAGCC TCGGCATCTT CAGTCTCGAC CGCTGGCACG AGATCTTCGA CACCATCCGG 
CGCAACAAGC TGCGCACCCT GCTCACGGCG CTGAGCGTGG CCTGGGGCAT CTTCATGCTG
GTGATCCTGC TGGCCGCCGG CACCGGCGTG CGCAAGAGCA CCGAGTACGA CTTCCGCGAC
GACGCCATCA ACAGCCTGTG GCTGTGGGCC GGGCAGACAT CGATTCCGTA CGAGGGCCAC
GCCGTCAATC GCAGCATCGA GTTCACCAAC GCCGACCTCG AGCGCCTGCG CGCGACCATC
CCCGAGGTCG AGTACCTCAC CGGCCGCTTC TACCTGCGCG GCGACCGGCT GGTGAGCCGC
GGCAGCAAGA GCTCGGCCTT CGACGTGCGG GCCTGCCACC CCGACCACCA GCACATCGAG
AAGACCATCA TCATCGCCGG CCGCTTTCTC GACGATCTCG ACATCGACGA GCGCCGCAAG
GTGGCCGTCA TCGGCATCGA GGTCGCCGAG TTCCTGTTCG CCGAGGGCGA GCAGCCGCTG
GGCCAGTGGA TCGCCATCAA CGGCATCCAG TACCGCGTGG TCGGCGTGTT CGAGGACGAG
GGTGGCGAGG GCGAGCTGCG CAAGATCTAC ATCCCCATCT CGACCGCGCA GATGGCCTAC
GGCGGCGCCG AGACCATTCA TCAGCTCATG TTCACGGTCG GCGACGCCAC GGCCGAGGAG
AGCCGGGCCA TCGAGGAGGC CGTGCGCGCC AGCCTGGCCG AGCGCCACCA CTTCGACCCC
GAGGACCAGC GCGCGCTGCG CATCCGCAAC AGCGTCGAGA ACTTCGAGCA GATCAGCGGC
ATCTTCCGCG CCATCGAGCT GTTCGTGTGG TTCATCGGCG CCGGCACCAT CGGCGCCGGC
ATCGTCGGCG TGAGCAATAT CATGCTCATC TCGGTCAAGG AGCGCACCAA GGAGATCGGC
GTGCGCAAGG CCCTGGGCGC CAGCTCGGGC GACATCATCG GCCAGATCCT GCAGGAGTCG
ATCTTCCTCA CCGCGGTCGC CGGCTACCTG GGTCTGCTCG CCGGCATCGG CCTGGTCGAG
CTGTTCCGCC GCTACGCGCC CGCGCTCGAC TCGCTGCGCG ACCCCGAGGT CGATCTCGGC
GTCGCGCTCG CGGCCACGCT CATCCTCATC GTCGCCGGCG GCATCGCCGG CTACTTCCCC
GCTCGCCGCG CCGCCCGGGT CGATCCGGTG GTGGCGCTGA GGGACGCCTG A
 
Protein sequence
MYSLGIFSLD RWHEIFDTIR RNKLRTLLTA LSVAWGIFML VILLAAGTGV RKSTEYDFRD 
DAINSLWLWA GQTSIPYEGH AVNRSIEFTN ADLERLRATI PEVEYLTGRF YLRGDRLVSR
GSKSSAFDVR ACHPDHQHIE KTIIIAGRFL DDLDIDERRK VAVIGIEVAE FLFAEGEQPL
GQWIAINGIQ YRVVGVFEDE GGEGELRKIY IPISTAQMAY GGAETIHQLM FTVGDATAEE
SRAIEEAVRA SLAERHHFDP EDQRALRIRN SVENFEQISG IFRAIELFVW FIGAGTIGAG
IVGVSNIMLI SVKERTKEIG VRKALGASSG DIIGQILQES IFLTAVAGYL GLLAGIGLVE
LFRRYAPALD SLRDPEVDLG VALAATLILI VAGGIAGYFP ARRAARVDPV VALRDA