Gene Hoch_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2686 
Symbol 
ID8545073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3705784 
End bp3707487 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content74% 
IMG OID646387380 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_003267109 
Protein GI262195900 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0893187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.171928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCG ACGTCTCCCC TATCCGTGAC TTCGCCGAAC TCGAGGCGCT GGCTCAGCCC 
TGGCGGGAGC TGGCCGCCTC GGGCGGTCCC GGCGGGCTGT TTCGCGGCCC CGATTGGCTG
CTGGCGTGGT GGCGCGCCTA TCACCAGGTG CTGCACGCCG AGCTGTTCGT GCTCGCCGCC
CGCGAGGGCG ATCAGCTCGT CGGCCTGGCG CCGCTCTACA CCCGCGTGGC GCGGCGCGGG
CCGGGCTTCA AGGTGCGCGA GATCCGCCTG CTCGGCGACG CCGGCCCGCG GCCGCCGGCG
CTCGACTTCC TGTTTCGCCC GGGCTACGAG GACCGCGTCG GAGCGGCCTG GGCCAAGCAC
CTCGACGCCT GCTCGGACGA TTGGGACGTG ATCGAACTCG AGCCGCTGCG CGATCCCTCG
CGCGGTCGCG GCGTGCTGGT GAGCCGCCTG GGCAACTCGG ACTACGGCGT GCGCACCTCG
CACGCGGCCG GGGGCGCGCT GCGCATCGCC CTGGGCGTGG CCGGCAACGA GGTGCCCGAC
GAGGGCGACA GCGAGGTGGT CACCCACTAC GACGACGTCG ACGCCCTGCG CAAAGGGCTG
TCGTCGCTGC GCCGGCTGTC GCGTCTGGAG TGGGCGCACC GCGAGGAGTC GAGCCCGCTG
GCCGATCGCG AGGCCTATCA GCTCCTCGAA GAGGTCACCC TGCGCCTCGG CAGCCAGCAC
CACGCCCACC TCACCCGCCT CGACGACGCC AGCGGCGAGG CCATCGCCAT CGCCCTGGTG
GTCGATGACG GCGAGCGCGC CGTGGTCCTG GCGCTGGCCG TGGATCCGCA GCACGAGGAG
CACGCGCCCG CGCGCATCCT CACCGACGAG GCGCGCACCG CGACCGAGCG CGGGCGCGCT
GGCCTCGACG TGGTCCCGGG CGCGGTCGAG CACGGCATGC CGAGCCTGCC GACCACGCGC
CAGCGGCCCG TGAGCCTGCA GATCTACAGC AGCTCGACGG CCGCGGCCAT GGCCCGCACC
TACGGCGCCG TGCGCCAGCG CGTCGAGGCC GCGCGCGAGG CTCCGGGCAC CGCGGCCGCC
AGCGCGCGCG CTGCGTGGGC CAAGATCCGC ACCGCGGCCG CGCCCGTGGC CGGCTACTCG
CGCATGCACC TGTACCGCGG CGAGCTGTGG ACGCGCGGCA TCGCGCCGCC CGAGAACCTG
GTCCTGGGGA CCCTGTCACA GGACGAGTTC GACGCCCTGG ACGAGACCGC GCGCGGCGAG
ATGGTCAAGC TCCTGCACCT CGACGAGGAC TACTGCCGGC AGAAATGGCA GCGCGGCGAC
ATCGTCGTGC TGGCCCGGCT GCAGGGCCGC CCCGCCGGCA TCGCCTGGTG CGCGCGCGGC
GCCGTGCGCG TGCCCGAGCT CGACCGCGAG CTGCACCTCA GCGCCGACGC CGCCTACATC
CATGACGTCT TCGTGGCCGC GGCCGCGCGC GGTCGCGCGG TGGCGCCGGC GATGCTCGAG
CACCTCTCGG GCATCCTGCG CCAACGCGAC GTCTACCGCA GCTGGGCCCT GATCGGCAGC
GACAACACCG CCTCGGTGCG CGCCTTCGAA AAAGCCGCGT ACACGGCCGT GGCCGACGTG
GTCTACGCAC ACATCGCGAC CGTCGATCAC ATCATGGTTC GGCCGCCCGA TCCCGAAGCC
AAGCAGCTCC TGGGGCTCTC TTGA
 
Protein sequence
MTIDVSPIRD FAELEALAQP WRELAASGGP GGLFRGPDWL LAWWRAYHQV LHAELFVLAA 
REGDQLVGLA PLYTRVARRG PGFKVREIRL LGDAGPRPPA LDFLFRPGYE DRVGAAWAKH
LDACSDDWDV IELEPLRDPS RGRGVLVSRL GNSDYGVRTS HAAGGALRIA LGVAGNEVPD
EGDSEVVTHY DDVDALRKGL SSLRRLSRLE WAHREESSPL ADREAYQLLE EVTLRLGSQH
HAHLTRLDDA SGEAIAIALV VDDGERAVVL ALAVDPQHEE HAPARILTDE ARTATERGRA
GLDVVPGAVE HGMPSLPTTR QRPVSLQIYS SSTAAAMART YGAVRQRVEA AREAPGTAAA
SARAAWAKIR TAAAPVAGYS RMHLYRGELW TRGIAPPENL VLGTLSQDEF DALDETARGE
MVKLLHLDED YCRQKWQRGD IVVLARLQGR PAGIAWCARG AVRVPELDRE LHLSADAAYI
HDVFVAAAAR GRAVAPAMLE HLSGILRQRD VYRSWALIGS DNTASVRAFE KAAYTAVADV
VYAHIATVDH IMVRPPDPEA KQLLGLS