Gene Hoch_5619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5619 
Symbol 
ID8548033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7714940 
End bp7716052 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content71% 
IMG OID646390290 
Producthypothetical protein 
Protein accessionYP_003269992 
Protein GI262198783 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.162729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTTC GCGGATCTCT CCGAACGATG TCCATCGAGG ATGTGTTGAG CTGGCTCAGC 
CGCCGTCTGA GCCGAGGCAC GCTCACGGTC GAGCACGCGG CCGAGGCGCG CAGCTTCGTG
TGCGACTCGG GCTATGTGGT CGAGGCCAGC TCCAACCTCG CGCGCGAGGA GCTGGGCCAG
ATGTTGCTGG ACGCCGGCTG GATCGACCAG GAGGCGCTGG CCGAGGCGCG TTCGGTGCAG
GCCGACACCG GGGTGTCGCT GGCGCGCATC CTGCGCATGG TCGGCAAGCT CGACGAGGAG
CGGTTGCGCG CGGTGCTCGA GGAGCGCGCC CTGCGCGCGG TGCTCGAGAT CTTCACCTGG
GAGGACGGGA GCTTCGCCTT CGAGCGGATG GATGAGGCGC CCTCGCCCGA CGTCGCGGTG
GCGCTGCAGC TGCCGCTGTG CATCGAGAAG GGGCGCGAGC GCAGCAACCG CTGGCGCGAA
CTGCAGGAGC GCATCCCCGA CGAGGATATC GGCATCGAGC TACTCGACAG CGGCGCGCTG
GCCAAGTCCG GCGACAGCGA GGCCGATCGC CGCGCGCATC TGCGCGTGGC CCAGGTCGTG
CGCGAGCGGG CCGGGGAGGG CGAGATCCTG AGCGTGGCCC AGCTCGCGGC GCAGCTCGGC
TGGACGCGCT TCCGCACCAT GGACCGGGTG GTCGTGCTGT GCGACCGCGG GGCGCTGGGG
CTGCAGGAGC AGGCGTCCTC GGCCGGGGTC GATTTTCTGC TCGAGGAGTC GCTGCGACTG
TCCGAGGAGG GCGATCGCCT GGGCGCCTTC GAGCTGGCGC GGCGCGCTCA TCAGAGCGAG
CCCGAGCTCG CCAAGGAGCG CTACGAGACG GCCGAGCGGG CGCTCTTCGC CGAGCTCTCG
CGCGAGCTCC TGGCCAGCTT TCGCGTGCCC AAGCTGCTCA TCGAGCGCGG CTCGCTCGAG
TCCATGGACC TGTCGGACGC CGAGCGCACG CTGGCCCAGC GCGTCGACGG CCGCTGGGAT
CTGCTGTCGC TGATGCGCGC GTCTTCAGTG CGCGAAGCCG AGGCGCTGAT CACGCTCAAG
CGGCTCGCCG ACCGCGGTAT CATTTCTCTC TGA
 
Protein sequence
MSVRGSLRTM SIEDVLSWLS RRLSRGTLTV EHAAEARSFV CDSGYVVEAS SNLAREELGQ 
MLLDAGWIDQ EALAEARSVQ ADTGVSLARI LRMVGKLDEE RLRAVLEERA LRAVLEIFTW
EDGSFAFERM DEAPSPDVAV ALQLPLCIEK GRERSNRWRE LQERIPDEDI GIELLDSGAL
AKSGDSEADR RAHLRVAQVV RERAGEGEIL SVAQLAAQLG WTRFRTMDRV VVLCDRGALG
LQEQASSAGV DFLLEESLRL SEEGDRLGAF ELARRAHQSE PELAKERYET AERALFAELS
RELLASFRVP KLLIERGSLE SMDLSDAERT LAQRVDGRWD LLSLMRASSV REAEALITLK
RLADRGIISL