Gene Hoch_5270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5270 
Symbol 
ID8547682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7247829 
End bp7248989 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID646389944 
Productprotein of unknown function DUF34 
Protein accessionYP_003269648 
Protein GI262198439 
COG category[S] Function unknown 
COG ID[COG0327] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTCG CAGACATCCT CGCCGTGCTG CGCGAGGTCG CCCCCGAATC GCTGGCGGAG 
AGCTGGGACA AAGTCGGCCT CCAGGTCGGC GCGGAGGACG CGCGCATCCG CCGCGCGCTC
CTGTGCATCG ATCTCACCGA GCCAGTCATG GCCGAAGCCG TGGCCCGGGA ATGCGACCTC
ATCGTCGCCT ACCACCCGCC CATCTTCGCC CCGCTCACGC AGCTCACCGA GCGCACCTGG
AAAGAGCGCG TGATCATCGA GGCGGTGCGC CGGCGCATCG CCATCTACAG CCCGCACACG
GCGCTCGACA ACGTCCGCGG CGGCATCACC GACTGGCTGT GCGACGGCCT GGGCGAGCAC
GAGCTGCGCA CGACCATCGT CGACCACGTG GACGAACTCA AGAGCTACAA GGTGGTGACC
TACGCGCCCA CGGCCGTGGC CGAGCAGATG CGCACGGCCA TGCGCCAGGC CGGCGCCGGC
GGCCGCGGCG ACGACGACCA CGCCTACCAC ACGCAGGGCT ACGCCACCAT GGGCAGCATC
GCCCGCCCCG AGAACCGGCA CGAGACCGGA AACGAACCCG GCGCGCCGTC TTTCGGACAC
ATCGACCAGA TGCGCGTCGA GATGACCTGC GCGCCCAAAT ACATCGGCGA CGTGATGCGC
GCGATCCGCC ACGCACACCC CGAAAAAGAG CCGATCATCG ACCTCTACCG GCTGGCCTCG
GAGCTGCCGG CCATGGACGA CGCGCCCGGC GCCGGGCGCG TGCACCACCT CGCGCACCCC
ATCACGGTCG ACGCGCTGAT CAAGCGCATC AAGCGGCGCC TGCAGCTCGA GCACCTCAAG
CTCGCGGTGC CGCCGCCCGG GGCCGGCAAG CGCGAAGGCG CCACCGGTGA GATGCACATC
CGCACCATCG CCGTGTGCCC GGGCGCGGGC GGCTCGCTGT TCGAGCGCTA TCCCGGCGCC
GACGCCTACT TCACCGGCGA GATGCGCCAC CACCAGGTCC TCGAGATGAG CGCGCGCGGT
CAGGTCGTGG TGCTGGGCGG CCACACCAAC ACCGAGCGGC CGTATCTGCC CGTGTACCGC
GACCGCCTGG TGGCCGCGGG CGGCGACCGC GTCGAGTGGC TGATGAGCGA GAGCGACCGC
GTGCCGCTGG CCGTGTACTG A
 
Protein sequence
MELADILAVL REVAPESLAE SWDKVGLQVG AEDARIRRAL LCIDLTEPVM AEAVARECDL 
IVAYHPPIFA PLTQLTERTW KERVIIEAVR RRIAIYSPHT ALDNVRGGIT DWLCDGLGEH
ELRTTIVDHV DELKSYKVVT YAPTAVAEQM RTAMRQAGAG GRGDDDHAYH TQGYATMGSI
ARPENRHETG NEPGAPSFGH IDQMRVEMTC APKYIGDVMR AIRHAHPEKE PIIDLYRLAS
ELPAMDDAPG AGRVHHLAHP ITVDALIKRI KRRLQLEHLK LAVPPPGAGK REGATGEMHI
RTIAVCPGAG GSLFERYPGA DAYFTGEMRH HQVLEMSARG QVVVLGGHTN TERPYLPVYR
DRLVAAGGDR VEWLMSESDR VPLAVY