Gene Hoch_5021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5021 
Symbol 
ID8547431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6925845 
End bp6927554 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content72% 
IMG OID646389697 
Producthypothetical protein 
Protein accessionYP_003269403 
Protein GI262198194 
COG category 
COG ID 
TIGRFAM ID[TIGR02608] delta-60 repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.196755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0476496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTA TCCTCTCTCT CGTCGCCTCG TCCTCGCTCG CGCTCGCGCT GGCCGGCTGC 
ACCGCCATCC TCGGCATCGA GGAGCTCAGC GGCACCACCG ACGGCGGCGT TCCGGTGGAT
GCTAGACCTG GCGACGGCGC ACTGCCCGGG GATCCCGACA GCGGTCTGGC CGGCTACTCG
CTCCGGATAC ACACCAACGC GCCCACCCTG CCGCTCGACG GCACCACCTT CCTCGACATC
GAGATCCAGC GCCTGAGCGG CCACGATCGC GAGATCCGGC TCGATATCGA CGGGCCGGGC
GGCGTGATCA GCCCGGGCCT CACGGTCAGC GGCACGAGCA CGCTGGTCGA GCTGCCCATC
GGCGCCGGCG CGCCCCTGGC CATCGGCGAT GAGGTCTCGT TTCGCGTGCG CGCCATCGAG
ACCGACGGCG CCGGCATCGC GGTCGAGCGC GAGGTCACGG GCGCCCAGGT CACCGGCCGC
CCCGGCCTGC TCGATACCTC GTTTGGCGCC GCCGCCACCG GCCTGGCGCG CGTGAGCTTT
GGCAACGACG ACAGCGGCCG CTTCTACGAC CTCGAGATCT TGCCCGACGG CAGCATCCTG
GCCGCGGGCT GGGGCGCCGG CGGCCTCGGC GCCGTCACCA GCGCGCTGGC CCGGCTCACG
GCCGACGGCC TGGCCGACCT CGGGTTCTCG GGCGACGGCC TCGTGCGCAC CAACTTCGAG
ACCGGCTCGT CGGCCGAAAG CTTTCAGACC TACGCGATCG GCCGCCAGCT CGACGGCCGC
ATCATCGCCA TCGGCCAGCA CAGCAGCACC AGCTCGTATC CGCGAGCCTT CGCCCTGGCC
CGCTACACCG CCAGCGGCGG CGAGGGCGAC CCGCTGTTCG GCAACTTCGC CTCCGGCCGC
AGCCGCATCC TCATCAACAA CACCGCCATC GACCTCGTCC GCGACGGGCT CGTCACCGTC
GACAACCGCA TCCTGGGCGC GGGCAGCTTC GGCGGCAGCC TGAGCGTATT CCGCGCCACC
TCGAGCGGAG ATCTCGACCA GATCTTCGCC GACCGGGGCG TGTTCCAGCT CGACGCCGAC
GGCAGCTCGC GCGCCGAAGC CATCAGCCGC GACGCCCAGG GCCGCCTGCT CGTGGTCGGC
ACGCGCGAAC GCGGTGCTCA GAGCGACATG ATCGTAGTCC GCCTGGACGA AAACGGCGCG
CTCGACGACG GGTTCGCCGC CGGTGGCGTG CTCATCGCAG GCAGCCCGGA GATCGACGAG
CGCGCCGTGG CCGTGGCCGT GCGCGCCGAC GGCCGCCTGG TAGTCGCCGG CGACGTCACC
CTCGCCGATG GCAGCCGCGC GCTGCAGGTG CGGCAGTTCA CGGCCGAGGG CGACTTCGAC
AGCGAGTTTG GCACGAACGG CGTGAGCACC CAGGTGCTCG ACGACCGCGG CGTCGAGGTC
ACCGACATGC TGCTCGCGCC CGACGGCCGC ATCCTGGTGC TGGGCAACGG CACCGGCAAC
GCCGACCCCG TGCTCGTGCG CCTGTCGCGC GACGGCGGGC TCGACCCCTA CTTCGACGGC
GACGGCGTGC TGTCGATGTA CGTGGGCGAC TGCGGCGCGG TCGAAACGCT CGCCCTGGTC
GGCCGCAGCC GGCTGCTGAT CGCGGGCGGC GACGAGTGCG GCACGCCCGG CCCGGGCACC
GCCGGCATCA TCCTGCGGCT GTGGATCTGA
 
Protein sequence
MSRILSLVAS SSLALALAGC TAILGIEELS GTTDGGVPVD ARPGDGALPG DPDSGLAGYS 
LRIHTNAPTL PLDGTTFLDI EIQRLSGHDR EIRLDIDGPG GVISPGLTVS GTSTLVELPI
GAGAPLAIGD EVSFRVRAIE TDGAGIAVER EVTGAQVTGR PGLLDTSFGA AATGLARVSF
GNDDSGRFYD LEILPDGSIL AAGWGAGGLG AVTSALARLT ADGLADLGFS GDGLVRTNFE
TGSSAESFQT YAIGRQLDGR IIAIGQHSST SSYPRAFALA RYTASGGEGD PLFGNFASGR
SRILINNTAI DLVRDGLVTV DNRILGAGSF GGSLSVFRAT SSGDLDQIFA DRGVFQLDAD
GSSRAEAISR DAQGRLLVVG TRERGAQSDM IVVRLDENGA LDDGFAAGGV LIAGSPEIDE
RAVAVAVRAD GRLVVAGDVT LADGSRALQV RQFTAEGDFD SEFGTNGVST QVLDDRGVEV
TDMLLAPDGR ILVLGNGTGN ADPVLVRLSR DGGLDPYFDG DGVLSMYVGD CGAVETLALV
GRSRLLIAGG DECGTPGPGT AGIILRLWI