Gene Hoch_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0021 
Symbol 
ID8542391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp24316 
End bp26331 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content75% 
IMG OID646384809 
Producthypothetical protein 
Protein accessionYP_003264556 
Protein GI262193347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGA GTGAGGTGGA GGCGGCGCTC GAGAGCGCCC TTGCCGAGGG GACTCTCGGT 
CTCGCGCGAC GCGCGACGCA GGAATGCCGC GCGCTGCTGC GGCTAGCCGA GGCCGGCGCG
CGCGAGGGCG ACCCGCGCTG GGATGAGGTA TTTGCCACGC GCGCGGCCTG GGGGCCGCTG
CTCTACGGGC TGCTCGACAA AGCCGAGGGG CGCCTGCCGC CGCTCATGCA GGCGCTGCTG
ATCTTTGCCC TGGAGCCGGT CGATACCACG ATCCTGGTCG TCCTCTGGGG CCTGGCCGGC
TCGGCTTCGC TGCGCCAGCG CGCCCAGCGG ATCGAACGCG CCTGGGAACA GGACCGCGTG
GACGAACTCT TCGCCGACCG CACGGACGAT GACGATGACG ATGACGGGGA CGCGGACGCG
GACGCGGACG CGGACGCGGA CGACGACAGT GAGGTGGACG TCGGCATCGG TTCCGCTCCC
GACGATGCGG CCGACGCCGA CGCCGAGATG GTTCTCGGTC ACTCGGCCGA ATTCATCGTC
CGGCTGTGCT ATCCCGACCT GCGGCGGCAG CGCAAGGCGC TGGCCGCGCT GCGCCCCCAG
GGTTCGCTGC GGCGCTTCGT TCTCGTCCGC TCGTCCGACG TCCGCAGCGC CGACCCGCGC
AGCGAGCGCT TCGAACTCGA CCCCGACCTG GCCTCGGCGC TGCTCAACGC GTGGCGGCCG
CCGAGCGGAC TCGAGGGCGT GGTGCGCAGC GGCGTGGGCG CGGGCCGCGC CCTGCATCCC
AGCCAGACCG CCCAGGCCAG CGCGCTCGCG CAGGTGTTGC AGCCGCCGCA GCAGCGCGTG
GCCCTGCTCG GCCGCCCGGG CGCCGGCAAG CGCACGCTGA TCCGGGCCGT GGCCGCGAGC
GCCGGCATGC CGCTCATCGA GCTGACCCTG GACGCGCTCG AGCGCGCGCC CCAGAAGCGG
CGCCACCTGC TCCGCCGTCT GCAGCGCGAC GCCCTGTTAT CGCGCGGCAT CCTGTACATC
CAGCTCGACG GCCCGCTGCC GCCGCCGGTG CTGCGCGAGG TGTTCGCGGC GGTTCCCGGA
CGCCTGGTCC TGGGAGTGCC GGTCGACGCC ACCGCCGCGC CCGCGCACGG TGAGGCCCTG
CGCGCGCACT GGCCCGACAT CCACCTGCAG CCGCTGCCGT CCATCGCCGT GGCCTCGCAG
CCGGCGCTGT GGCGCGCGTG TCTGAGCGAG CGCGGCCTGC CGGTCGACGC CGCCGGTCTC
GACGCGCAGC TCGAGGCCCA CGTGTGCCGG CCGGGGCTGG TGCTGGGCGA TTTCGTCCAC
GCGTGCGAAA TCGTCCGCAG CGTGTATCCG GCCGGTGAGT TCGCCAGCGC CGGCGCCGAC
GACACCAGCG CCGGTGCGCG CCTGGCCCGC GCGCTGAGCG GCGCGCTCAG CAGCCATTTG
CACCACGAGC TGGGCGCGCT GGCCGAGGCG CTGTGGCTGC CGGGCATGGA CCTCATGGAC
GAAGATCAGG CGGCCTCGGC GCCCGCGCTG CCGCCGGAGT GGGCCGAGGT GGTCGACGCC
ATCCGCGCCG GCACCACGCG GCTCTCGCCC TGGGCAGTGC CCGAGCATCG CAACTACCCG
GCGCCGCGGG TGGCGCGCGT GGCCTCGAGC GACATGCGCG CGGTCGCCGC GGGCGCGCGG
GCGCTGGCCA GCGCCGCGCA GATGCCCCTG TACCGCGTCG ACCTCGGCTA CTTCCTGGCC
GCCGAGCCGG CCGCTGGCCG GGCCGCCTGC GCGCGCGTGT TCGCGGCCGC CGAGCGCGCC
GGCGCCATGC TGTTGCTGGT GCCCGTCGAC CGCTTGGCCC TGCAGCACGC CGACAGCCTC
CAGCTCGCCA ACGCCCTGGC CGCCCAGCTC GCGGACGCCA CCATCCCCGT GGTGCTCGCG
GGCGCGCTGG CCGCGCTGCC CGTGGCCATC GAGAGCCGCA TCGACCACGT GCTGGGCGAC
ATCGGCTCGG CCGCTCCGTC CGACGTCCCG GCCTGA
 
Protein sequence
MATSEVEAAL ESALAEGTLG LARRATQECR ALLRLAEAGA REGDPRWDEV FATRAAWGPL 
LYGLLDKAEG RLPPLMQALL IFALEPVDTT ILVVLWGLAG SASLRQRAQR IERAWEQDRV
DELFADRTDD DDDDDGDADA DADADADDDS EVDVGIGSAP DDAADADAEM VLGHSAEFIV
RLCYPDLRRQ RKALAALRPQ GSLRRFVLVR SSDVRSADPR SERFELDPDL ASALLNAWRP
PSGLEGVVRS GVGAGRALHP SQTAQASALA QVLQPPQQRV ALLGRPGAGK RTLIRAVAAS
AGMPLIELTL DALERAPQKR RHLLRRLQRD ALLSRGILYI QLDGPLPPPV LREVFAAVPG
RLVLGVPVDA TAAPAHGEAL RAHWPDIHLQ PLPSIAVASQ PALWRACLSE RGLPVDAAGL
DAQLEAHVCR PGLVLGDFVH ACEIVRSVYP AGEFASAGAD DTSAGARLAR ALSGALSSHL
HHELGALAEA LWLPGMDLMD EDQAASAPAL PPEWAEVVDA IRAGTTRLSP WAVPEHRNYP
APRVARVASS DMRAVAAGAR ALASAAQMPL YRVDLGYFLA AEPAAGRAAC ARVFAAAERA
GAMLLLVPVD RLALQHADSL QLANALAAQL ADATIPVVLA GALAALPVAI ESRIDHVLGD
IGSAAPSDVP A