Gene Hoch_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4011 
Symbol 
ID8546407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5501392 
End bp5502708 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content72% 
IMG OID646388683 
ProductPEGA domain protein 
Protein accessionYP_003268403 
Protein GI262197194 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0496099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGGAC GGCGATCTGG GGGCGTCGCC GGGGGCGCGC TCTTCGCCCT GCGAACGGCG 
CTGCTGGCGC TGCTGGCGTG GGCGTGCGTC GCGGCGCTGG CGCCGCACTC GGCGTGCGCG
CAGCGCGGCG GCAAGGATGG CGAGGTCGAC GCCAAGGAGC GCGCCGAGGC CGAGCTGTTC
TTCCGCGCCG GCGAGCAAGC CTACGAAGCC GGTCAGTATC TGGTGGCCGC GCAGGCCTTC
GAACAGGCCT ATGAGACCTT GCCGCTGCCG GCGATCGCGT TCTCGACCGC GCAGGCCTAT
CGGCTGCAGT ACTTCATCGA CAAGGAGTCG GCCAACCTGC TGCGCTCGAT CGAGCTCTAC
CGCCGCTACA TCGACTCGGT GGAGCAGGGC GGTCGCCGCG ACGATGCGGC CACCAGCCTG
GCCGAGCTCG AGGTCATCCG CGATCGCTTG CAGCGTGCCG GACGGCTCGA TCCGCGGCGC
CGGGTCGCGA GCGATGAGGG GCGGACGCAG CTCATGGTCA CCACCCAGGT GCCGGGCGCG
ATCGCCAAGA TCGACGACAA GAGCGGCGAG ACGCCGCTGA TCCTCGAGGT CTCGTCTGGC
AAGCACCAGG TCGAGGTCCG CGCCGACGGC TACTTCAGCG CCAAGCAGCA GGCCGTGGCC
GTGAGTGGCC GCCTGGTGGT CGTCGAGGTC GAGCTCAAGC CGCAGGCGGC CAAGATTCAG
CTCGAGGTCG AGCCCGGCGC GACGCTGCAC ATCGATGGCC GTCCGGTGGC GCAGACGCCG
AGCGCCGAGC CGGTGTCGGT GCCGGCGGGC AAGCGTTTCG TCGCCATCAC GCGCCGCGGG
CGCCGGCCGT GGACGCGCGA GCTCGAGCTC GAGCGCAACG CCTCGCTCAC CATCCGCGCC
GATCTCGAGG CCACGACCCA GCGCTCGGCG GCGATCTGGG TGCTGGGCGC CTCGGCGCTG
GCCGCGGTGG GCGCGGGCGC GAGCGGCGTG TTTTGGTATC TGGCCGACGC CGAGGGCGCC
GATTACATCG CCTCGCACGA GGAGCTGTTT CCCGCGGACA AGGTGGCTCT CAGCGATATC
CGGGAGCGCC GCGACGGGCG CCGGGGCATC ACCCTCGGCC TGGTCGGCGC CGCGGTCGCG
ATCGGCGCCA TCGGCGGCGT GCTGTACTGG TTCGACAATC CCCACGTCGA GCAGGGCGGG
CGCTCCAAGG GCGAGGTCTG GGGCGCGGAG GACGACGAGG GCGGCGCCTC CTCGCTGCGC
CTCACGCCCA CCCTCGACGA GCGCGGCGCT GGCTTTGCGG TCTCTGGCCA CTTTTGA
 
Protein sequence
MRGRRSGGVA GGALFALRTA LLALLAWACV AALAPHSACA QRGGKDGEVD AKERAEAELF 
FRAGEQAYEA GQYLVAAQAF EQAYETLPLP AIAFSTAQAY RLQYFIDKES ANLLRSIELY
RRYIDSVEQG GRRDDAATSL AELEVIRDRL QRAGRLDPRR RVASDEGRTQ LMVTTQVPGA
IAKIDDKSGE TPLILEVSSG KHQVEVRADG YFSAKQQAVA VSGRLVVVEV ELKPQAAKIQ
LEVEPGATLH IDGRPVAQTP SAEPVSVPAG KRFVAITRRG RRPWTRELEL ERNASLTIRA
DLEATTQRSA AIWVLGASAL AAVGAGASGV FWYLADAEGA DYIASHEELF PADKVALSDI
RERRDGRRGI TLGLVGAAVA IGAIGGVLYW FDNPHVEQGG RSKGEVWGAE DDEGGASSLR
LTPTLDERGA GFAVSGHF