Gene Hoch_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2102 
Symbol 
ID8544488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2916224 
End bp2917630 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content70% 
IMG OID646386809 
Productprotein of unknown function DUF21 
Protein accessionYP_003266540 
Protein GI262195331 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.573488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.797611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGT TCTCGATCGT CGCCATCACC GTGCTCATCG TGCTCAACGC GCTGTACGTG 
GCCGCCGAGT TCGCCATCGT CGCGGCCCGG CGCTCGATCA TGGATGAGAA GGCGCGCGCG
GGTAATCGGG TGGCCGTGCA GGTCGCGCGG GTGCTCGACG ACGACCAGCT CAAAGACCGC
TACATCGCGG CCGCTCAGCT CGGCATCACC CTGGCCAGCC TGGGGCTCGG CATGTACGGC
GAGAAGGTCT TGGCCCACAG CATCGAAACT TGGCTCGGCA GCAGCGATGT GCCGGCGTGG
CTGGCCTCGC ACGCGATCGC CGGTCTGTGC GCGGTGCTCA TCCTCACCTA TTTTCACATC
GTCCTGGGCG AGATGGTGCC CAAGGCCATC GCCCTGGCCA CGCCCGAGCG CGCGGCGCTG
TGGCTCTCGC CCGTCATCCG CACCACCCAG TTCGTGGCCT ATCCGCTGAT CGCCGGGCTC
AACGGCCTGG GTAACCTGCT GCTGCGCGGC CTGCGTCTGT CCGACGACAA CAGCAAGCAC
TACTACACGC CGGAAGAGCT GCAGTACGTC GTGCGCGAGA GCGAGGAGCA GGGCCTGCTC
GGCCACGAGG CCGCCGAGGT GGTCGACGAG CTGCTCGAGT TCGGCACGCT CACGGCCCGC
GAGGTCATGG TCCCGCGGGT CAAGATCCGC GGCGTCGAGC TCGGCAGCGT GCTCTCGGCC
GCGCGCGAGA CCCTGCGCGC CGCGCCGCAC ACGCGCTATC CCGTGTTTGC GGGTGATCTC
GACCATATCG TCGGCACCGT GCACATCAAA GACGTGCTGC GCCGCACCCG GCAGTCGCGC
TCGACCATCA CCCAGGCCGA CATCCACCCG GTGCCCTTCA TCCCCGAGAC CTCGACCCTC
GACAAGGTGC TGGCCGCCAT GCGCCAGTGG CGCACGCAGA TGGCCATCGT CATGGATGAG
CACGGCGGCA CCGCCGGTCT GCTCACCATC GAGGACCTGT TCGAGGAGAT CATCGGCGAC
ATCGACGAGA CCACGGCGGC CGCGCGTCTG CCCGACATCT TCCGCGAGGA GGACGGCAGC
CTGCGCGTCG TCGGCACCGT CCGCGTGGAC GAAGTCGGCG AGTCCCTCGA GCCCGAGCGC
GAGCTCGAGC ACGAGGAGGT CGACACCGTG AGCGGTCTGG TGCTGGCGCT GCTCGACCGG
CCCCCCGAGG TCGGCGACGC CGTGAGCTTC GAAGGCCTGC GCTTCGAGGT GTGCGCGGTC
GAGGGCCACG GCGTGGCCGA GTGCCGGGTC ATCCCCCTCG AGCCTGACCC GACCTCCGAA
GGCGAGGACG CGGGCGAGGG CGAGCCGCAC GCGAGCGGCG CGGAGCAGGG CGCTAACGCC
GAGCGAGGCG ACGAGCGCGA CGCATAG
 
Protein sequence
MSLFSIVAIT VLIVLNALYV AAEFAIVAAR RSIMDEKARA GNRVAVQVAR VLDDDQLKDR 
YIAAAQLGIT LASLGLGMYG EKVLAHSIET WLGSSDVPAW LASHAIAGLC AVLILTYFHI
VLGEMVPKAI ALATPERAAL WLSPVIRTTQ FVAYPLIAGL NGLGNLLLRG LRLSDDNSKH
YYTPEELQYV VRESEEQGLL GHEAAEVVDE LLEFGTLTAR EVMVPRVKIR GVELGSVLSA
ARETLRAAPH TRYPVFAGDL DHIVGTVHIK DVLRRTRQSR STITQADIHP VPFIPETSTL
DKVLAAMRQW RTQMAIVMDE HGGTAGLLTI EDLFEEIIGD IDETTAAARL PDIFREEDGS
LRVVGTVRVD EVGESLEPER ELEHEEVDTV SGLVLALLDR PPEVGDAVSF EGLRFEVCAV
EGHGVAECRV IPLEPDPTSE GEDAGEGEPH ASGAEQGANA ERGDERDA