Gene Hoch_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1723 
Symbol 
ID8544105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2341763 
End bp2343739 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content72% 
IMG OID646386430 
Producthypothetical protein 
Protein accessionYP_003266165 
Protein GI262194956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.165254 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAA TCACCGCCCC ATCCAACCAC CCCGCCGCCA GCGGCACCGA CCACACCGAG 
GGCTCGCTGC CGTGCAACGG CGTCGACGGC GTCACCGGCC AGTACCTGGC CACCCCGAGC
AGCGTCGAGC AGTTCGCTGA GCTGGCAGTA TCCGGGCGAT TGGACGAGGT CGAACGCCTC
GACGTGCAGC GCAAGCTCCG CGCCCAGCGC AGCGCCGATT TCGCCCTCGC CGAAGGTCTC
GACGCCAACC AGCTCGGCGA CGCCGGCTGG GCCGTGGTCT TTCCCTTCGC CGCGGCCGGC
AGCGAGGCCG AGCGCGCCCA GTTGGCGCTG CGCGAGGCAT TGTCGCCGCT GCTCCAGCAC
CGCGCGCGCC AGGTCGGCAG CAGCCGCTAC CGCGAGTACC TGGGCGAGCT CGGCTACCGC
ACCGGCGAGA GCAAGCAGCA ATACCTCGCC CGCCTGGGCG CCGGCCCCGG CCCGGTCAAC
CCCGACAAAG TGCCCTACTA CCTGCTGCTG GTGGCCTCGC CCGAGGACAT CCCGTACCGC
ATTCAGTACC AGCTCGACGT CCAGTACGCG GTCGGCCGCA TCCATTTCGA CCAGATGGAC
GACTACGAGC GCTACGCGCG CAGCGTGGTC GAGGCCGAGA CCCGGCCGCC GCGGCGACGC
CGGGCCGCGT TCTTCGCCGC GGCCAACCAG GGCGACCGCG CCACCCAGCT CAGCCGCGCC
TACCTGGCCG AGCCCCTGAC CGAGATCTGT GCCCTGAGCG GCCGCGCCGC GGGCTGGGAG
ATCGAGCACT ACCTGGGCGC AGACGCCACC CGCGCGCAGC TCGGCGCGCT CTACGGCGAC
GCGCCGGCGC TGCTGTTCAC GGCCAGCCAC GGGGTCGGCT TCCCGGCCGG CCACGCGCTG
CAACGCGCGC ATCAGGGCGC GCTGCTGTGC CAGGACTGGG GCGGCCCAGG CAGCGGACTC
GCGCGCGAGC ACTACTTCGC CGGCGAGGAC ATCGACCCCG GCGCCGACCT GCGCGGCCTC
ATCACCATGC ACTTCGCCTG CTACGGCGGC GGCACCCCGG CGCGCGATGA TTTCGGCGCC
GGCGCGCGCA CCATCGCGCC CCGCGACTTC GTCGCCGCCC TGCCCCAGCG CACGCTCGCC
CATCCCGGCG GCGGCGCCCT GGCCTGCGTC GGGCACATCG ACCGCGCCTG GTCGAGCTCA
TTTCTGTGGA TCGACAGCCG CGCCGAGCGG CCCTCGCAGG CCCACGTCGA TGTCTTCGAG
AGCACGCTGC GCGGACTCCT CGGCGGCAAA CGCCTGGGCT ACGCGCTCGA GTACTTCAAC
ACCCGCTACG CCGAGCTGGC CGCCGACCTC AGCGCCCGCC TCGAGGACAT CGAGCGCTAC
GAGGGCGAGC GCAACGATCG CGAGCTGGCG CAGATGTGGT GCCTGCAGAA CGACGCCCGC
AACTACGCCG TCATCGGCGA CCCCGCCGTA CGACTGGGCG AAAACGCCGA CCCGGCCGAG
CATACCCGGC CCACGATCGA GCTGCGCCAG GACCCCGGCG AGGGCGGCCG CGCTTCCCGC
CAACCCGAGC CCGAGGCGCC CGATGCCGGC TTCGCGGACG CATCGAGCTA CGGCCTGTTC
AGCCGCGACA AGCGCGAGCA GCCCGGCATG CTGGCGCGCT TTGCCGACCG CGTGGCCGAA
ACCCTGGGCA ACGCCATCGC CGACGCCACC ACCCTCGAGG TCAAGACCTA CGTGAGCCGC
GACCTGCAGC GCGCGCTCGC CCAGGGCGGA GCCGCCGACG AATCCACCGA GCTGCGCGCC
TACACCCGCT GCGCGCTCGA CGGCGACACC GAGGTGTGCG TGCCCGTGGA CGGTGATGGC
AGCGTCGACC AAGCGCTGTG GCAGTTGCAC GTCCAGATGG TCGAACAAGC GCAAAAGCAC
CGCGCCGCGC TCATCGAGAC CGCGCTGTCG CTGATCGCGC CGCGGTGGGG TGAATGA
 
Protein sequence
MSEITAPSNH PAASGTDHTE GSLPCNGVDG VTGQYLATPS SVEQFAELAV SGRLDEVERL 
DVQRKLRAQR SADFALAEGL DANQLGDAGW AVVFPFAAAG SEAERAQLAL REALSPLLQH
RARQVGSSRY REYLGELGYR TGESKQQYLA RLGAGPGPVN PDKVPYYLLL VASPEDIPYR
IQYQLDVQYA VGRIHFDQMD DYERYARSVV EAETRPPRRR RAAFFAAANQ GDRATQLSRA
YLAEPLTEIC ALSGRAAGWE IEHYLGADAT RAQLGALYGD APALLFTASH GVGFPAGHAL
QRAHQGALLC QDWGGPGSGL AREHYFAGED IDPGADLRGL ITMHFACYGG GTPARDDFGA
GARTIAPRDF VAALPQRTLA HPGGGALACV GHIDRAWSSS FLWIDSRAER PSQAHVDVFE
STLRGLLGGK RLGYALEYFN TRYAELAADL SARLEDIERY EGERNDRELA QMWCLQNDAR
NYAVIGDPAV RLGENADPAE HTRPTIELRQ DPGEGGRASR QPEPEAPDAG FADASSYGLF
SRDKREQPGM LARFADRVAE TLGNAIADAT TLEVKTYVSR DLQRALAQGG AADESTELRA
YTRCALDGDT EVCVPVDGDG SVDQALWQLH VQMVEQAQKH RAALIETALS LIAPRWGE