Gene Hoch_4979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4979 
Symbol 
ID8547387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6864530 
End bp6865558 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content71% 
IMG OID646389653 
Productcysteine-rich repeat protein 
Protein accessionYP_003269361 
Protein GI262198152 
COG category 
COG ID 
TIGRFAM ID[TIGR02232] Myxococcus cysteine-rich repeat 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.734088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA TGAGCGCTTC CATCCCGCGA ACCCGCGCCG TACTCGGCGC GGTGTTCGCC 
GTCGGCGCCA TGCTCGCGCT CGCCGGCTGT TTCAACCAGG GCACGCAGGC GACCGACTGT
CCGACCGGCG TGACCTGCGC GCCCGGCTGG GAGTGCGCGG CTGCGCAGGC TGCGTGCATC
CTCGACGGCT GCGGCAACGG CCGCGTGCAA TACGAGCGCG GCGAGGTCTG CGACGACGGC
AACATCCTCG ACGGCGACGG CTGCAGCGCC GATTGCCTGT CCAACGAATC CTGCGGCAAC
GGATATACCG ACGTGAGCGA GGACTGCGAC GAGGGCGACG ACGACCTGGT GTGCGACGGT
GACTGCACCG TGCCCGTATA CGGCGACGGC GTGGCCGACA ACACCGCCGC CTGCGATCGC
GACTGCAGCT TTCCGCGCTG CGGCGACGGT GTGTTCAACG AGTTTCACCT GGTGCAGCCC
GATGACGGCG GCGCCGCGTA CCTGGAAGCG TGCGACGACG GCAACGACGA GAACCGCGAT
GACTGTCTCG ACGTCTGCCT GGCCGCGCGC TGCGGCGACG GCTTCGTCCA CAGCTTGGGC
GCCGGCGGCG AGACCTGCGA CGTCGACGTG GACGGCGACG GCGTGGCCGA CAACGTCGCC
GCGTGCGACA GCGATTGCAC GGCGCCCGCG TGCGGCGACG GCGTGCACAA CGCCGCGGCC
GGGGAAGCCT GCGACGATGG CAACGAGGAC GATGGCGACG CCTGCGTCAG CGGCTGCGCG
GCCGCGCGCT GCGGCGACGG CTTCGTCTTC GAGGGCGAGG AGCTATGCGA CGACGGCAAC
GCCAGCAACG GCGACGCGTG CCCCACGGGC AGCGGCGGGA GCTGCGAGCC CGCGCGCTGC
GGCGACGGCT TCATCCAGGC CGGCGTCGAG CAATGCGACG TCGGCAACGG CGCCGTGGAT
ACGTGTGCGG GCGGATCGGA ATGCCAGCCA CCAAATCTTC CCGGTGCTTG TTCTTGCCAA
TTCACCTAG
 
Protein sequence
MATMSASIPR TRAVLGAVFA VGAMLALAGC FNQGTQATDC PTGVTCAPGW ECAAAQAACI 
LDGCGNGRVQ YERGEVCDDG NILDGDGCSA DCLSNESCGN GYTDVSEDCD EGDDDLVCDG
DCTVPVYGDG VADNTAACDR DCSFPRCGDG VFNEFHLVQP DDGGAAYLEA CDDGNDENRD
DCLDVCLAAR CGDGFVHSLG AGGETCDVDV DGDGVADNVA ACDSDCTAPA CGDGVHNAAA
GEACDDGNED DGDACVSGCA AARCGDGFVF EGEELCDDGN ASNGDACPTG SGGSCEPARC
GDGFIQAGVE QCDVGNGAVD TCAGGSECQP PNLPGACSCQ FT