Gene Hoch_3864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3864 
Symbol 
ID8546259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5318124 
End bp5319557 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content71% 
IMG OID646388535 
Producthypothetical protein 
Protein accessionYP_003268256 
Protein GI262197047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.571243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0690788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCAA ATCGACTGAG CGACGATAAA ACAGCAGCCA ACCCTTCGAA TCTTTGGGAA 
TCTGGGAGTG GCGCCGCGCA GCGCTGTGTG CGCCTGCGTC TCGCTGTGAC AAATTGCACC
ACGTCGCTCC GCTTCGGCGC TTCGCTGTTG GCGCTGTCGC TGGCGGCCGC CGGCGCGCTG
CTCGGCGGCT GCGCCGTGGT GACGCCGAGC GAGGGCGGCG AGCACATCAC CGCCGACGAC
ACCGCCACCG CCGTGGGCTC GCTGTCCACG CACAACCGCC TGGCGCTCAA CCGCCTCGCC
CTCAACCGCC TCGCCCTCAA CCGCCTCGCC CTCAACCGCC TCGCCCTCAA CCAGATCTCG
GCGACGCAGT TCGCCTTCGA CCCCGGGCAC GAGATGCTCG ACACCCAGGA GGAGCGCGAG
GTGCTGGCCT ACGTGGTCGA GTGCGCGCTC GCCGAGGGCG TGGTCGTCGT GGCCGAGACC
GAGCACGGCA GCTACGAGTT CCCGGGTCTG CTCGGGCTGG CCCCTGGCTG GGCCGAGCAG
ATGCCGAGCC CGGGCGATCT CGAGCGCGTG TCAGCGTGCC TGCTCGCGCG GGTCAACGCC
TTTGGCGAGT CGGTAATGCT GTCGCTGCGC ACGCCCGGCC TGCTCGACGC CGCCGCCTCC
GAGCGCGCCA GCTATCCGGT GTACGAGGGC GCGTTTTTCG GCCAGGTGTT TGCCGAGGAC
GAGGCCGACC AGCACTTCTA CGCCTGCCAG GGCAGCCCGG CCGACACCGC CGTGTTGCAC
GCGGCCTCGC GCGCGCTGCG GGTGTGCACC GACGCCGACA GCGAGTGCGG CATCACGGCG
CTCGGCCGCT GCCGCGATCT CTGCGACAGC TACACCGACG AGGGCGGCTG GAGCGGCTGC
TGGGCCGAGG GCGCGCGCCA CGACCAGACC ATCAACGTGT ACCTCGCGGC CAGCGACATC
GACGGCGCCA ACCAGCGCTG CGACGGCGGC CCCTGCCAGA TGCGCGCCCG CGACGGCGAC
ACCGCCATCC TCGACTGCGA GGGCCAGGAC TGCCTGACCA CCTGCGAGGA GGGCGCGACC
TGCGCGATCG CGGCCGCCAA CGTCGACGAG CTCGACGCCA GCGTGCGCAG CACCGCGCTC
GCCGAGATCG ACTGCTACGG CGCCAACACC TGCCAGCTGA GCTGCGACCA GGGCGCCTAC
TGCGCGACCG AGTGCGCGGG CACCAACAAC TGCGATGTCA CTTGCCGCGG CGGCGCCGAG
TGCGAGGTTT TCTGCGGTTC CGAGGCCAAC AACTGCGACC AGCTCGTGTG CAAGACCGAC
GCGCGCTGCC TGCTCACCTG CAGCAGCGCC AACAATTGCA GCTTCGCCGT GTGCGAGGGC
GGCGAGACCC AGTGCGGGGG CGGCGTGGTG GTGTGCAACC GGCCCTGCCC CTGA
 
Protein sequence
MQSNRLSDDK TAANPSNLWE SGSGAAQRCV RLRLAVTNCT TSLRFGASLL ALSLAAAGAL 
LGGCAVVTPS EGGEHITADD TATAVGSLST HNRLALNRLA LNRLALNRLA LNRLALNQIS
ATQFAFDPGH EMLDTQEERE VLAYVVECAL AEGVVVVAET EHGSYEFPGL LGLAPGWAEQ
MPSPGDLERV SACLLARVNA FGESVMLSLR TPGLLDAAAS ERASYPVYEG AFFGQVFAED
EADQHFYACQ GSPADTAVLH AASRALRVCT DADSECGITA LGRCRDLCDS YTDEGGWSGC
WAEGARHDQT INVYLAASDI DGANQRCDGG PCQMRARDGD TAILDCEGQD CLTTCEEGAT
CAIAAANVDE LDASVRSTAL AEIDCYGANT CQLSCDQGAY CATECAGTNN CDVTCRGGAE
CEVFCGSEAN NCDQLVCKTD ARCLLTCSSA NNCSFAVCEG GETQCGGGVV VCNRPCP