Gene Hoch_3307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3307 
Symbol 
ID8545695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4561148 
End bp4562167 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content69% 
IMG OID646387974 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003267702 
Protein GI262196493 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.185146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.246163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACCC GCAAGATCGA GCTACGAGTC GGCATCCTCA TCGCGGTCGC CGTGATCACC 
TTGGTGGGCT TCATCATCGT GCTCGGCAAC TTCGCGTTTG GCGGTGGCTA CACCCTGTAC
GTCGACCTCG AGTTCAGCGG CAACATCCAG ATGGGCGCGC CCGTGCGCAT CTCGGGCATC
AAGATGGGCC GCGTGAGCAG CGTGGACTTC TGGGGCGGCA AGATCGACGA GTCCCAGGGC
CGGCGCGTGC AGGTGCGCCT GGAGGTGTGG CTCAAGGACG ACGCCCGCGA GGCCGTGCGC
CAGGACGCCG AGTTCTACAT CAACACCCAG GGCGTGCTGG GCGAGCAGTA CCTCGAGATC
GTGCCCGGCC GCGACTACCA GTCACCGCCG CTGGCGCCCG GGACCATCGT GCTCGGCAAC
AGCCCGCCGC GCATGGATCT GCTGCTCTCG CGCCTGTACA CCGTGCTCGA GTCGGTGTCC
GAGGTGCTCG ACGAGGAGCG CGAGGTGCTG CGCGAGCTGT TGCACAACAG CGCCAGCGCG
GTCGCCGAGG TCAACCGCCA GCTCGTGGAC AACCGCGACA CCATCTCGCA GCTCCTGGTG
GGCGCCAACG AGCTCACGCG CGAGGCCACC GCCACCCTCG GCCGCGTCAA CGAGGGCCTC
GGCGACCCGC GCATCATCGG CAAGACCGTG CGCGACGCCG ACGCCTTGCT GGTCACGGCC
AATCGCTCGC TCGACACCCT CACGCCGTCG ATCGACCGCT TCCTGGGCGA CGCCACCCGC
GCCACCGGCC TGCTCACCGA GGAGCGCGTC GACCGCGCCA TCGCCGTGGT CGACAAAGCC
GCGAGCGCGG CCACCAAGGC CGGCGGGCTC ATCGACAACG TCGACGGCCT GGTCACCGAC
CTGCGCAGCG GCAAAGGCAC GGCCGGCGCC CTGCTCGTCC GCGAGGACAT CTACGCCGAC
CTGCGCGAGC TTCTGCGCGA CCTCAACCGC AACCCCTGGA AGCTGTTCTG GAAGGAGTAG
 
Protein sequence
MGTRKIELRV GILIAVAVIT LVGFIIVLGN FAFGGGYTLY VDLEFSGNIQ MGAPVRISGI 
KMGRVSSVDF WGGKIDESQG RRVQVRLEVW LKDDAREAVR QDAEFYINTQ GVLGEQYLEI
VPGRDYQSPP LAPGTIVLGN SPPRMDLLLS RLYTVLESVS EVLDEEREVL RELLHNSASA
VAEVNRQLVD NRDTISQLLV GANELTREAT ATLGRVNEGL GDPRIIGKTV RDADALLVTA
NRSLDTLTPS IDRFLGDATR ATGLLTEERV DRAIAVVDKA ASAATKAGGL IDNVDGLVTD
LRSGKGTAGA LLVREDIYAD LRELLRDLNR NPWKLFWKE