Gene Hoch_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0235 
Symbol 
ID8542614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp349739 
End bp350959 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content69% 
IMG OID646385031 
Productprotein of unknown function UPF0027 
Protein accessionYP_003264769 
Protein GI262193560 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCT ACGAAGTCTT TCCCACCAGC AACGTTCCGG TCAAAGCCTG GGTGCGCGGC 
GTGCCTGTCG AGGACAAGGC GCGCCAGCAG CTCGAGAACA TCGCCGGGCT GCCCTTCATT
CACAAGTGGG TGGCCGCGAT GCCGGATGTT CACCTCGGCA AGGGCGCCAC GGTCGGCAGC
GTGATCCCGA CGGTGGGCGC GATCGTGCCC GCGGCCGTGG GGGTGGATAT CGGCTGCGGC
ATGATGGCGG TGCGCACCTC GCTCACGGCC GCGGCGCTGC CCGACGCGCT GCACCCGCTG
CGCACGGCCA TCGAGGCCGC GGTGCCGCAC GGCCGCACCA ACCACGGCGG ACGCGGGGAT
CGCGGCGCCT GGGGGCGTCC GCCCGAGGCC CAGCAGCAGG CCTGGGGCGA GCTGTCGCCG
ACCTTCGAGC GCATCGTGGA CAAGCACCCC AAGCTCGGCC GCGCCAACCA CATCACGCAC
CTGGGGACGC TGGGTACCGG CAACCACTTC ATCGAGGTGT GCCTCGACGA GGCGGAGCGC
GTGTGGGTGA TGCTGCACAG CGGCTCGCGC GGGATCGGCA ACCGCATCGG CAGCTACTTC
ATCGAGCTGG CCAAGGAGGA CATGCGCCGG CACTTCATCA ACCTGCCGGA TAAGGATCTG
GCGTATCTGT GCGAGGGCAC GCAGTACTTC GACGACTACG TGGAGGCGGT GGAGTGGGCC
CAGCGCTACG CCATGGAGAA CCGCCGGCTG ATGATGGCGG CCGTGCTCCG GGCGCTGGCC
GAGTCGCCGT CGCTGCCGGC GTTCACCAGC GGCGAGATGG CGGTCAACTG CCACCACAAT
TACGTCGCCC GCGAGCACCA CTACGGCAAG AACGTGTACC TCACGCGCAA GGGCGCGGTG
CGTGCGCGCG AGGGCGATCT CGGCATCATC CCGGGCAGCA TGGGCGCGCG CTCGTACATC
GTCCGCGGCA AGGGCGAGGC GCAGAGCTTT CATAGCTGTA GCCACGGCGC CGGGCGGGTG
ATGTCGCGCA GCGAGGCCAA GCGTCGTTTC ACGGCCGAGG ACCACGCGCG CGCGACCTCG
GGCATCGAGT GCCGCAAGGA CAGCGAGGTG ATCGACGAGA CGCCGATGGC GTACAAGGAC
ATCGACGCGG TGATGGCGGC GCAGAGCGAC CTGGTCGACG TGGTGCACAC GCTGCGGCAA
GTGGTGTGCG TGAAGGGGTG A
 
Protein sequence
MQAYEVFPTS NVPVKAWVRG VPVEDKARQQ LENIAGLPFI HKWVAAMPDV HLGKGATVGS 
VIPTVGAIVP AAVGVDIGCG MMAVRTSLTA AALPDALHPL RTAIEAAVPH GRTNHGGRGD
RGAWGRPPEA QQQAWGELSP TFERIVDKHP KLGRANHITH LGTLGTGNHF IEVCLDEAER
VWVMLHSGSR GIGNRIGSYF IELAKEDMRR HFINLPDKDL AYLCEGTQYF DDYVEAVEWA
QRYAMENRRL MMAAVLRALA ESPSLPAFTS GEMAVNCHHN YVAREHHYGK NVYLTRKGAV
RAREGDLGII PGSMGARSYI VRGKGEAQSF HSCSHGAGRV MSRSEAKRRF TAEDHARATS
GIECRKDSEV IDETPMAYKD IDAVMAAQSD LVDVVHTLRQ VVCVKG