Gene Hoch_2435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2435 
Symbol 
ID8544821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3363566 
End bp3365134 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content70% 
IMG OID646387134 
ProductThrombospondin type 3 repeat protein 
Protein accessionYP_003266865 
Protein GI262195656 
COG category 
COG ID 
TIGRFAM ID[TIGR03382] Myxococcales GC_trans_RRR domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.492834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCATC GTTCATACCG AGCTTCTCGT GATTCGTCTC TGACCTGCGG CCGCGCGTCC 
CTGCGGCCGC GCGCGGTCTT CTTGGCGGCG CTCGCGCTGG CCCTGGTGGC CGGCGCCGGC
CTGGCTGGCG AGGCCCGCGC GCAGGTACCC ATCGAGTGTC CGGCGCCCGT GGATTGCGAC
GCGATCGGCG TCTGCGAGCT CGCCGACCTC AACTGCAACG GCATCCTGCG CTGTGATGAT
CCCGCCTCCG GCACCGTGGG CGAGGGCGAG TGCGTCGACG TGTGCGCGCA GGGGATCTGC
TCCACCGAGG TCCCCGATCG GCGCGCATGC GACGACTACT TCGACCCCGA GGGCGACGGC
GGCCAGTGCA CCACGGATCT GAACAACGCC GGTGTGGCCG ACAGCGACGG TGACTGCATC
GGCGACGCCT GCGACAACTG CATCGACATC AGCAACCTCG CCCAGCTCAA CCGCGATAAC
GACCTCTTCG GCAACGCCTG CGATAACTGC ATCGAAGTGC GCAACAACGA CCAGGCCAAC
GCCGACGAGG ATCTCTTCGG CGACGTCTGC GACAACTGCG TCGACGTCGC CAACGACGAC
CAGGCCAACA CCGATGTGGC CGCGGACGCG CCCGGCGACA GCTTTGGCGA CGCCTGCGAC
AACTGCGTCA ACGTCGCCAA TGAGGACCAG GCCAACGCCG ACAGCGACAA CTTCGGCGAC
GTCTGCGACA ACTGCGTCAA CGTCGCCAAC GATCAGGCCA ACGCCGACAG CGACAGCTTC
GGCGACGCCT GCGACAACTG CGCCGGTGTC GCCAACGAGG ACCAGCGCAA CTCCGACGCC
GAGATGGACC CGCCGGGCGA TGGCTTCGGC GACGTCTGCG ACAACTGCTT GATGGTCGCC
AACCCCGACC AGGCCGACAG CGACGGCGAC GGCCTGGGCG ATGCCTGCGA CCTGTGCCCC
GACGACGACA GCGACGTCGA CGACCAGGTC GACCAAGACG GCGACGGCCT CGGCGACCGC
TGCGACGTGT GCCCGAACGT GGCCAACGCC GTGGCCGATC CCGGCAATGG CATCGCCGGT
CAGCTCGAGT CGGATCGCGA GGATCCCGCG GACCCGAGCT CGGGTGACGG CTTCGGCGAC
GACTGCGACA ACTGCGCCCT GGTCCGCAAC CCGGATCAGG CCGATGCCGA CAACGACGGC
GTGGGCGACG CCTGCGACAT CTGCGTGAAC GCGGCCGATC CCGACCAGGC CGATGCCGAC
GGCGACGGCC TGGGCGACGC CTGCGACGTG TGTCCGAACA TCAGCGACGC CGACGCCCAG
ATCGACGGCG ACGGCGACGG CGTGGGCGAT GCCTGCGACA ACTGTCCGAA CACGCACAAC
CCGGACCAGC GCAAGTCCGA GCTGACGCGC GCCGATGGCA GCGAGCTCGG TTACGCTTGC
GAGCCCGGCA TCCAGGGCGC GGGCGGCTGC TCGGCCCATC CCGCGATGAA CGGCCCCGCG
GCGCCGGCCG CGCTGCTGGC GCTGCTGGCG CTGCTGGGCT TCGCGGCTAT CCGCCGCCGC
CGCAGCTGA
 
Protein sequence
MSHRSYRASR DSSLTCGRAS LRPRAVFLAA LALALVAGAG LAGEARAQVP IECPAPVDCD 
AIGVCELADL NCNGILRCDD PASGTVGEGE CVDVCAQGIC STEVPDRRAC DDYFDPEGDG
GQCTTDLNNA GVADSDGDCI GDACDNCIDI SNLAQLNRDN DLFGNACDNC IEVRNNDQAN
ADEDLFGDVC DNCVDVANDD QANTDVAADA PGDSFGDACD NCVNVANEDQ ANADSDNFGD
VCDNCVNVAN DQANADSDSF GDACDNCAGV ANEDQRNSDA EMDPPGDGFG DVCDNCLMVA
NPDQADSDGD GLGDACDLCP DDDSDVDDQV DQDGDGLGDR CDVCPNVANA VADPGNGIAG
QLESDREDPA DPSSGDGFGD DCDNCALVRN PDQADADNDG VGDACDICVN AADPDQADAD
GDGLGDACDV CPNISDADAQ IDGDGDGVGD ACDNCPNTHN PDQRKSELTR ADGSELGYAC
EPGIQGAGGC SAHPAMNGPA APAALLALLA LLGFAAIRRR RS