Gene Hoch_2169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2169 
Symbol 
ID8544555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3018384 
End bp3019520 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content63% 
IMG OID646386876 
Productcysteine-rich repeat protein 
Protein accessionYP_003266607 
Protein GI262195398 
COG category 
COG ID 
TIGRFAM ID[TIGR02232] Myxococcus cysteine-rich repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0145186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGGA CGCGATGGAG AGGAATAGGC TGGGCGTGCG TGCTGGGCGT GGCGCTGGCC 
GCTGTCGGCT GCCTGGTGGA GACGAACACG AGCGAATGCG CGAGCGGGCT GCGCTGCCCG
ACCGACGCCT ACTGCGCCGA TGACGGCAAG AGCTGCATCA CCGGCCTGTG CGGCAACGGC
CGCCTGGATG TCGGCGAGGT GTGCGACGAT GGCAACGATC GCTCCATGGA CGGTTGCCGC
GCCGACTGCC TGTCCGATGA AAGCTGCGGC AACGGCGTGC ACGATCCGCA GGTGGGCGAA
CAGTGCGATG ATGGCAATCG GGTTTGGGAT GATACCTGCT CGCCCGATTG CCTCCTGCCG
CGTTGCGGTG ATGGCGAGGT CACCAAAGGC GAAGAATGCG ACAGCGGCGG TGTGGATTCC
GCAGGGTGCA ACTACGACTG CCGGGCGCCG GTGTGCGGCG ATGGCTACGC CAACCTCGTC
GCTTCCAATA CCGGGACGCC CGATATCCCC AACGATCGCG AGGAGTGCGA CAGTTGGGGG
GAAGACTCGC CATCGTGCGA CTTCGATTGT ACCCGGCCCG TGTGCGGTGA TGGTTACCTC
AACCGAGACG CATTGAATAC CGGGACGCCG GATATCCCGG ATGATAAAGA GACGTGCGAC
ACGGGCGGTG TGAACACGGC AACCTGCGAT TATGATTGCA CCGTCGCCGA GTGCGGAGAC
GGATTTTTTA ACCCGGAATT TGTCCTGGCG TCCGGTTTTC CTGAGGAGTG CGACACTGGC
ACATCAACAG TGGCTTGCGA TGGTGACTGT ACCGCCGTGG TCTGCGGCGA TGGCTTTGCG
AACGCAGCGG CCGGTGAGAC CTGCGACGAC GGCAACAGTA TACTGACGGA TGACTGTCCG
TCGGGTCCGC GCGGCATCTG CAAAGTCGCC ACCTGTGGAG ATGGGTTTCT CCACGAGGAC
GAAGGCTGCG ACGATGGTGA CAACAGCACC ACCGATGGCT GCCCCTCTGG CCCGAATGGC
TCGTGCGAGC CGGCGTACTG TGGCGATGGA TTTCGGCGCG CTGGTGTAGA GGAGTGCGAG
CGCGACTCGC ATTGTCCGGG TCAATTGACC TGTCGCAGCG ATTGCAAATG CCGCTGA
 
Protein sequence
MRGTRWRGIG WACVLGVALA AVGCLVETNT SECASGLRCP TDAYCADDGK SCITGLCGNG 
RLDVGEVCDD GNDRSMDGCR ADCLSDESCG NGVHDPQVGE QCDDGNRVWD DTCSPDCLLP
RCGDGEVTKG EECDSGGVDS AGCNYDCRAP VCGDGYANLV ASNTGTPDIP NDREECDSWG
EDSPSCDFDC TRPVCGDGYL NRDALNTGTP DIPDDKETCD TGGVNTATCD YDCTVAECGD
GFFNPEFVLA SGFPEECDTG TSTVACDGDC TAVVCGDGFA NAAAGETCDD GNSILTDDCP
SGPRGICKVA TCGDGFLHED EGCDDGDNST TDGCPSGPNG SCEPAYCGDG FRRAGVEECE
RDSHCPGQLT CRSDCKCR