Gene Hoch_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3094 
Symbol 
ID8545482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4264322 
End bp4265674 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content65% 
IMG OID646387764 
Productbeta and gamma crystallin 
Protein accessionYP_003267492 
Protein GI262196283 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.439475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCTT CGCGATATCG CACGATATTC AACCGCTATT TGTCCAACCT ATTCACCAAA 
GCAGCGATCA TGAACCGTTA TCGAACCCGT CTGCCCCAGC GCAGCGCCCT GCTCTTCGCG
CTCGCGCTCA CGAGCGCGAG CTGTGTGCTC GAAGCCCCTG AGCTCGATGC CGAACTCGCC
CTCGGCGAGA TGCAATACAG CATCGTGAGC ACCCACGACC TGGCCAAACG CTGGGCGCCC
ATCCACTACA TGGACGTCGA CGCCACCGGC ACCTACGCCG AGGGCGGACG CTCGGACTAC
ATCACGGCCA TCGACTACGA CGGCGACTGG GACGCGCAGA ACAACTGGAA CAACCTGCCG
CAGCACGCCT CGTCGCTGGC CGCCTACGGC TACTACTCGG TGGTCGAGAC CGCCACGCAT
TGGTTCCTCA GCTACGCCTT CTTCCACCCG CGCGACTGGA CCGATATCTT CTTCCTCTAC
GAGCTCGACC AGCACGAAAA CGACCTCGAG GGCGTGCTCA TCATCGTCGA AAAGGACGGC
TCGAGCTACG GCCGCCTGCT GGGCGCGGTC ACGGTGAGTC ACTCGGACTT CTTCTCCTAC
GCGAGCGCGG GCAGCAGCCT GAGCAGCGGC CTCGAGAACA TCGACGGCAC CTTGCACACC
CAGAGTCACG CCGGCGCCCA GCACCCGGTG ACCGCGCAAG AAGCCAAGGG CCACGGGCTC
AAGGCCTGGC CGCAGTACGA CATCAACGGC GACGGCATCG TCTACTACCC GTCGATGAGC
GACAACGCCG AGGTCCCGTC CGGCAATTCC GACAGCTACG TCGAATACGA GCTGATCGAC
ATCTTCGAGG TCGGCGGGCT GTGGGACCAG CGCTTCAACA CCAGCCTGTT CTACAACGCC
GGCGGCGGCT TCAAGGGCAA CGACTTTGGC GACGGCGGCG CCAACGCGCC CTGGGCGTGG
AACGACGGCG ACGACGGTGT GATCCAGGGC GGCGAGATCG CCACCGACCC GGCCAAGCTG
GTCGACAACT ATTTCGACGG CGTGGGCGAT TTCTCCCACG TCTACACCAG CAACCCCTAC
GGCAACGCCG GCGGCCCCGT GACCGTGTAC CAGCACTGCA ATCTCGCCGG CTACGCGGTC
ACGCTAGCGC CGGGCGCGTA CACGCTCGCC GACCTGCAGG CGCGCGGCAT CGCCAACGAC
GATCTGTCGT CGCTGCGCAT CGAGAACGGC CGCCGGGTCA CGCTCTACCA GCACGACAAC
TTCGGCGGCA GCTCTGTGGT GTTGCAGGGC AGCGACGGCT GCCTCGGCGA CGAGGGCTTC
AACGACGAGG TCAGCTCGCT GCGCATCGAG TGA
 
Protein sequence
MIASRYRTIF NRYLSNLFTK AAIMNRYRTR LPQRSALLFA LALTSASCVL EAPELDAELA 
LGEMQYSIVS THDLAKRWAP IHYMDVDATG TYAEGGRSDY ITAIDYDGDW DAQNNWNNLP
QHASSLAAYG YYSVVETATH WFLSYAFFHP RDWTDIFFLY ELDQHENDLE GVLIIVEKDG
SSYGRLLGAV TVSHSDFFSY ASAGSSLSSG LENIDGTLHT QSHAGAQHPV TAQEAKGHGL
KAWPQYDING DGIVYYPSMS DNAEVPSGNS DSYVEYELID IFEVGGLWDQ RFNTSLFYNA
GGGFKGNDFG DGGANAPWAW NDGDDGVIQG GEIATDPAKL VDNYFDGVGD FSHVYTSNPY
GNAGGPVTVY QHCNLAGYAV TLAPGAYTLA DLQARGIAND DLSSLRIENG RRVTLYQHDN
FGGSSVVLQG SDGCLGDEGF NDEVSSLRIE