Gene Hoch_5019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5019 
Symbol 
ID8547429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6923699 
End bp6924901 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content68% 
IMG OID646389695 
ProductVWA containing CoxE family protein 
Protein accessionYP_003269401 
Protein GI262198192 
COG category[S] Function unknown 
COG ID[COG3825] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.232102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0213025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGTCG ATTTCCTCTA CGAGCTGCGC AGCCGCGACG TAAAGGTCTC GTCGCACGAG 
TGGATGGCGC TGATGGACGC GCTGGCCCTG GGCCTGCACG ACTCCTCGCT CGACGGCTTC
TACCGGGTGG CGCGCTCCAT CTGCGTCAAG GACGTGGCCC AGTACGACGC CTTTGATGAG
GCCTTTCTCG CCTACTTCAA AGACGTGCAC GTCGACGCCC TGGCGCTGAG CGAGCAGCTC
TTGCAGTGGC TGCAGGATCC GGCCGCGCGG CGCCGGCTCA GCCCCGAGGA GCTGGCCATG
CTCGAGAGCA TGGACCTCGA GCGGCTGCGC GCGCTGTTCG AGCAGCGCCT GCGCGAGCAG
AAGGAGCGCC ACGACCGCGG CAACCGCTGG ATCGGCACCG GCGGCACCTC GCCCTTTGGC
AACGGCGGCA CCTTCCCCGG CGGCCTGCGC GTCGGCGGCA TGGGCGGCGG CCGCTCGGCC
ATGCAGGTGG CCGGCGAGCG ACGCTTTCGC AACTACCGCA AGGATCTGGT GCTCGACGTG
CGGCAGATCG ACCTGGCGCT GCGCGACCTG CGCCAGCTCG GTCGCGAGGG TGCCGAGGAG
GAGCTCGACC TCGACGAGAC CGTGGACAAG ACCTGCAGCA ACGCCGGCGA GCTCGAGCTG
GTGTTCCGGC CGCCGCGGCG CAACCGGGTC AAGCTGGTGC TGATGATGGA CGTGGGCGGC
TCGATGGACC CCTACGCCGA GCTGGTCGGT CGACTGTTCA CGGCGGCCTC GCGCGCCGGT
CGCTTCGCCA AGTTCCGCAG CTTTTATTTC CACAACTGCG TGTACGAAAA AGTCTACGAG
GACGGCCACT TTCGCGACGG CATACCGGTG GAAGAGCTGA TTGCGAATAG CGATCGCGAC
GAGAAGCTGG TGTTTGTCGG CGACGCCTGG ATGCATCCGG CGGAGCTCTT GCAGCCGGGC
GGATCGATCT TCTACGACCA CCAGAACCGC CGCGCCGGCA TCGACTGGCT GCGGCGCCTG
AGCGAGCATT TCCGCCGCAG CGTGTGGCTC AACCCCGAGG CCAAGCGCTT CTGGGCGCAG
AGCACCATCG AGATGATCGC GCGCGTGGTG CCGATGTATC CGCTGAGCGT GAGCGGCATC
GGCGACGCCG TGCGCTACCT GGTGCGCGGC GGCCGCGCTC CCGACCCGGT GGACGAAGAC
TGA
 
Protein sequence
MLVDFLYELR SRDVKVSSHE WMALMDALAL GLHDSSLDGF YRVARSICVK DVAQYDAFDE 
AFLAYFKDVH VDALALSEQL LQWLQDPAAR RRLSPEELAM LESMDLERLR ALFEQRLREQ
KERHDRGNRW IGTGGTSPFG NGGTFPGGLR VGGMGGGRSA MQVAGERRFR NYRKDLVLDV
RQIDLALRDL RQLGREGAEE ELDLDETVDK TCSNAGELEL VFRPPRRNRV KLVLMMDVGG
SMDPYAELVG RLFTAASRAG RFAKFRSFYF HNCVYEKVYE DGHFRDGIPV EELIANSDRD
EKLVFVGDAW MHPAELLQPG GSIFYDHQNR RAGIDWLRRL SEHFRRSVWL NPEAKRFWAQ
STIEMIARVV PMYPLSVSGI GDAVRYLVRG GRAPDPVDED