Gene Hoch_5100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5100 
Symbol 
ID8547511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7028512 
End bp7029633 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content72% 
IMG OID646389776 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003269481 
Protein GI262198272 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.436212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGATC GCTTCACTGC CGACAGCGTC GCCAAACAGC TCGACTACTG CACCTATTGC 
CCCAAGATGT GCCGCCACGC CTGTCCGGTG TCGAACGCCG ACGGCCACGA GGCGCACATC
CCGCAGGCCA AGATGGACAG CCTCAACCAG CTCCGCAAAG GCAACGCGAG CTGGAGCAGC
GAATCCGCGG CGCCCTTGTG GGCGTGCACC GGCTGCCGGC AGTGCACCGT GTACTGCGAC
CACGGCAACG AGCCCGGCCT GGTGCTGCTC GCGGGCCGCG CCGAGGCCAC CGCGCGCGGC
GCCGGTCACC CCAATCTACG CGACTATCCG CAGCGCTTCG GCAAGCGCGA GAAGCGCCTG
GTCGAGCGCA TGCGCGAGCA GCTCCCGGCC GAGCACCGCG CGGCTGACGC CCTGGTCGGC
TTCTGGCCCG GCTGCGACGC GGTCGACAAG TATCCGGGCG GCATCGACGG CGCCCGCGCG
CTGCTGTCGC AGGTGAGCGG CATGGACGTG AGCGTGCTCG ACGTCGGCCA GACCTGCGCC
GGCTACCCGC TGCTGGCCTC CGGCCACCCC GACGCCTTTC GCTGGCACGC AAGCAAGGTG
GCGCACGCGC TGCAAACCCT GCGCACCCTG GTGGTCGGGT GTTCGGCCTG TGTCTACACC
TTGCGCGTGT CCTACCCGGC CGAGGGTCAG GCGCTGTCCT GCGAGATCCT ATCGACGCCC
GAATTCCTGG CGCGCTCGCA GCGCAGCGCG CCCGAGCGCC GCGAAAAGCC CGTGGTCTAC
TACCACGACC CGTGCATGCT GGCGCGCTAC ACCGGCGTCA TCGAGGAGCC CCGCCGGGTG
CTGGGACGCA TCGCCGAAGT GCGCGAGATG AGCTGGAGCG GCACCGACAC CGAGTGCTGC
GGCGGCGCCG GCATGCTGCC CAAGACCATG CCCGAGGTCG CCGACGCCAT GGCCCGGCGC
CGGCTGCGCG ACGTGGTCCG CGGCGGCGGC GGCACCGTGG TCACCTCGTG CCCCACCTGC
GCGCTGATGC TGCAGCGCAA CGCCCCCGAC GGCGTCAGCG TCCGCATGCT CACCGAGATG
CTCGAAGAGG CGCTGGCCGC GGCCCCCGAC GACGCCGAGT GA
 
Protein sequence
MSDRFTADSV AKQLDYCTYC PKMCRHACPV SNADGHEAHI PQAKMDSLNQ LRKGNASWSS 
ESAAPLWACT GCRQCTVYCD HGNEPGLVLL AGRAEATARG AGHPNLRDYP QRFGKREKRL
VERMREQLPA EHRAADALVG FWPGCDAVDK YPGGIDGARA LLSQVSGMDV SVLDVGQTCA
GYPLLASGHP DAFRWHASKV AHALQTLRTL VVGCSACVYT LRVSYPAEGQ ALSCEILSTP
EFLARSQRSA PERREKPVVY YHDPCMLARY TGVIEEPRRV LGRIAEVREM SWSGTDTECC
GGAGMLPKTM PEVADAMARR RLRDVVRGGG GTVVTSCPTC ALMLQRNAPD GVSVRMLTEM
LEEALAAAPD DAE