Gene Hoch_5301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5301 
Symbol 
ID8547713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7288439 
End bp7290823 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content71% 
IMG OID646389975 
Productcysteine-rich repeat protein 
Protein accessionYP_003269679 
Protein GI262198470 
COG category 
COG ID 
TIGRFAM ID[TIGR02232] Myxococcus cysteine-rich repeat 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCATA TTCGGGCCGG CGTCGGATTT GTCTGGCGCG CGGTCGCGGT GGCGCTGTTG 
TTCGCGGCCG GCGCGTGCAT CGACGACGGC AGCGTGCTGT GCCCGGGCGC CGACGGCATC
CGCTGCCCGG CGAGCCTCGT GTGCACGGCC GACGGCACCG GCTGCCGGAT TCCCGGCAGC
TCGTGCGGCG ACGGCGTGGT CGAGCCCGGC GAAGGCGAGG TCTGCGACGA CGGCAACGTC
GAGTCGGGCG ACGGCTGTCG CGGCGACTGC GCATCCAACG AACGTTGCGG CAACGGCACC
GTGGACGCCA TCAGCGACAA CACCGCCGGC TTCGAGGAGT GCGACGCCGG CGAGCTCAAC
TCCGACGACA TCGGCCTGTG CACCCTGGAC TGCACGCTCA ACTGCGGCGA CGGCGAGCAC
GACTACCTCG AGGAGTGCGA CGACGCGGTG TTCGACTCCA TCTGCCTCGA CTACGGCTTC
GACTGGGGCG CCACCGCGTG CACGCGCTGC GTCGCCGATC TCGACGACTG TGGCCGCATC
GACTGGGAGC GCTCGAGCGA CGTGGCCGGT ATCGGCGTGC TCAACGCGGT GTGGGGGCTG
GGACCCGGGC TGCTCTTCGC CGCGGGCGCC AACGACGACG GCACCCGCGA CGCCATCGTG
CGTTACGACC CGGCGCGCAA TCTGTGGGTG TCGGCGGGAG CGCCGACCCG GGCCGGCGCC
TTTCACGGCA TCTGGGGCCT GGGCGATGAC GAGGTGTACT TCGTCGGTGA GTTGGGCCGC
GTCTTGCTGT ACGACGGCGT CGGCTGGCAG GCGCTCGAGA CCGGTGTGGA GTACGCGCTG
CGCGCGGTCT GGGGCAGCTA CCTCAAGGGC GAGGCGCTGC TGTTCGCCGC CGGCGACAGC
GCCGCGATCC TGCGCTACGA CGGCCGCACC TGGCGCGACG AAGCGCTGCC CGCCGAGCTC
CCCGACGACG CGCGCATCGC GGCCTTGTGG GGCAGCAGCC CGCGCGACGT GTACGCGGTC
GGCGGCGCGG GCCTGATCCT GCACTACGAC GGCGAGCGCT GGGTGCGCGA GGCCGCCGGG
CTCACCGGCG CGGACCTGCG CGACATCTGG GGCAGCGAGG GCGAGGTGTT CGCGGTCGGC
AGCGAAGGCG TGGTGCTGCA CTACGATTCG GACGGCTGGA CGATCATGGA CGAGGTGCGC
GGACCCGACA CCAACCTGCC CGAGAGTCGC CCGATGCCCA CGCTCTACGC GGTGTGGGGC
AGCGCGCCCG ACCACGTGGT CGCGGTCGGC GATCAGCTCA CCATGCTGGT CTACGACGGC
AACTCGTGGT CGCGCTTGAA CTCGGCTACC GATGCCCTGG TGCGCGATCT GTGGGGCCTG
CGTGAGTTCG GCTTGCTCGG CGTCGGCGAG CGCGACCAGG TCTTCCGGCA CTACGGTTGG
TCGCGGCCAT CGCGGCTGGC GCGCCCGCTG TTTGCCGAAA TCATCGATCT CTGGGGCCGC
GCCAGTGACG ACATCCTGGC GCTGACCAGC CGCGAGCCCT TCCTCATCCA CTTCGACGGC
ACCGAGTGGC GCGCGCTCGA GGAGAACCAC CCCGAGCTCA TGATTCTGCT GCCCGACAAA
GACGACGGCA CCACGCCGCT GCGCGACATG TGGGGCGATG CCGAGGGCGT CATCCACGCG
GTCGGCGACG ACAGCCTGAT CCTGCGCTAC CAGCCCGGCG AAGGCTGGAC GCAGGTCGCC
CTGGACGCCT CGGTGCCGCG CCGTGGGCTC AACAGCGTGT GGGGCACCGA GGCGGGCGAG
CTGTACGCGG TCGGCGAGGC GGTCGAAGGC GAGCCGGCGC TGATCCTGCA CTACGACGGC
AGCGCCTGGA CGCAGATGAG CAACGGCGCG ACCGCGACCC TGCACAGCGT GTGGGCGCAC
GACAAGCGCG CCTTTGCCGT GGGCGAGAAC GGCACGATCC TCACCTACGC AGCCGAGGAC
GGCGGCATCT GGACGCACAT GAACTCGCCG ACCCGCGAAC CCCTGTACGG CGTCTGGGGC
GCGGGCCAGG GCCTGGTGCG CGTGGTCGGC GCGGAGGGCA CCTTGCTGGT TTACGCTCCC
AGCACCGGCT GGGCGCACGT CGAGGCGCCG TCGGTGGGGG CCGAGGACCT GTACGCCATC
TGGGGCAGCG ACGCCGAGCA CGTGTTCGCG GTGGGCGCTG AGGGCACGCT GCTCTTCGAT
AATGGCGACG ACGGCTGGAC CCCGGTGCGC GTGGGCACCG ATCGCACCCT GCGCAGCGTC
TGGGGCGTGC GCTCGACCGG GAATCACCTG CGCGCGGTCA TGGTCGGCGG CGACGCGGGC
GCCTTCGACC ACCTGTTCAT CACCGATAGC TCGACCTTTC CCTGA
 
Protein sequence
MRHIRAGVGF VWRAVAVALL FAAGACIDDG SVLCPGADGI RCPASLVCTA DGTGCRIPGS 
SCGDGVVEPG EGEVCDDGNV ESGDGCRGDC ASNERCGNGT VDAISDNTAG FEECDAGELN
SDDIGLCTLD CTLNCGDGEH DYLEECDDAV FDSICLDYGF DWGATACTRC VADLDDCGRI
DWERSSDVAG IGVLNAVWGL GPGLLFAAGA NDDGTRDAIV RYDPARNLWV SAGAPTRAGA
FHGIWGLGDD EVYFVGELGR VLLYDGVGWQ ALETGVEYAL RAVWGSYLKG EALLFAAGDS
AAILRYDGRT WRDEALPAEL PDDARIAALW GSSPRDVYAV GGAGLILHYD GERWVREAAG
LTGADLRDIW GSEGEVFAVG SEGVVLHYDS DGWTIMDEVR GPDTNLPESR PMPTLYAVWG
SAPDHVVAVG DQLTMLVYDG NSWSRLNSAT DALVRDLWGL REFGLLGVGE RDQVFRHYGW
SRPSRLARPL FAEIIDLWGR ASDDILALTS REPFLIHFDG TEWRALEENH PELMILLPDK
DDGTTPLRDM WGDAEGVIHA VGDDSLILRY QPGEGWTQVA LDASVPRRGL NSVWGTEAGE
LYAVGEAVEG EPALILHYDG SAWTQMSNGA TATLHSVWAH DKRAFAVGEN GTILTYAAED
GGIWTHMNSP TREPLYGVWG AGQGLVRVVG AEGTLLVYAP STGWAHVEAP SVGAEDLYAI
WGSDAEHVFA VGAEGTLLFD NGDDGWTPVR VGTDRTLRSV WGVRSTGNHL RAVMVGGDAG
AFDHLFITDS STFP