Gene Hoch_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1888 
Symbol 
ID8544270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2597632 
End bp2599221 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content68% 
IMG OID646386593 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003266328 
Protein GI262195119 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTCCT TATCCAACGG GATCAAGGTC GGCATCCTCT TCGTCGTGAT GGTGCTCGGC 
ACCTACGGCG TGTGGTCGGC CGTGACCGCG CCCTCGAGCG GCGAGAAGAG CTTCGAGCTC
GGCGCCATGT TCCGCGACGC CTCGGGCCTG CCCAAGGGCT CGCGCGTGGT CGTCGCCGGT
CTGCCGGTGG GCGAGATCAT CGACCTCGAC ATCGAGGGCC GCTACGCGCG CATCGGCTTC
CGCGTGCGCG AGGATCTGCC GGTCTGGTCC AACGCCATCG TGTTCAAGAA GTCATCGTCG
CTGCTCGGCG ACTACTACCT CGAGCTCGAC CCGGGCACGC CCGAGGCCCT GGACGCCACG
GGCAACATCG TCACCAACAC CCGCCTGGGC GCGGATGACA CCGTGGCCAC CGTGGTCGAG
GCGACCTCGC CCGACGAGCT GCTGCGCCGC ATCGACGAGA GCATGCCCAA CGTGGACAAC
GTGCTGCTGT CGGTGCGCGA CCTGAGCGAG GATCTGCGCC GCGTGGTCAA CGGCCCGCTG
CTGTCGGTGA GCGAGCGCAT CGACGGCCTG GTGCAGACCG AGTCCGAGAC CGTGGCCCGC
ATCCTCGAGC GCCTCGACCG CAGCATGGTC AATATCCAGG CCATCACCAA TGACGTCCGC
GACATCACGG GCGGCCGCAA CTCGCAGATC GACCGCATCC TCGAGGAGCT GGAGGCGGCC
TCGAGCGAGG CCCGCAACCT GGTGGTGAGC GCGCGTACCG AGGTCGAGCA GACCGGCTCG
AAGGTGCGCG AGAAGCTCGA CATGGTCGAC GACATCATGG CCAACACCAG CTCGATCACG
GGCAAGATCG ACGAGGACGA GGGCACGCTC GGCCGCCTGG TCAACGACCC GACGATCGCC
GACAACGTCG AGGACATCAC CGAGGACGCC AAGGGCTTCC TCGACGGCCT GCTCAACCTG
CAGACCTACG TGGGCCTGCG CTCCGAGTAC ACGGTCGGCT CGGGCTCGCT GCGCAGCTAC
GTGTCGCTCG AGCTGGCGCC GCGTCCCGAC AAGTACTACC TCATCGAGCT GGCCAAGGGA
CCGCGCGGCG GTCTGCCCGA GGTCAGCCTG ATCTACGACC CGTCGATGAA CGACAGCCAG
TACCTGCGCC GGGTGACCAT CGAGGACGAG ATCCGCTTCA CCTTCCAGCT CGCCAAGCGG
CTGAGCTGGG CGACGCTGCG CTACGGCCTC AAGGAATCCA CGGGCGGCGT CGGCCTCGAT
TTCAACGGCG AATGGTTTGG TCGCGAGCTG ACGCTGCAGA CCGACGTGTT CGACGCCAGC
TTCGACCGCC TGCCGCGGCT CAAGGTCTCG GCCGCGTACG AGTTCCTGCC GTACATCTAC
GTGCTCGGCG GCATCGACGA CGCCATGAAC GCGCCCGGCT ACCTGCCGAT CACGCCGGGG
CCGGACGAGG GCCTCGAGCG GCCGATGCTG TTCGACGAGC TGCGCTATGG TCGCGACTTC
TTCGTGGGCG CGATGCTTCG CTTCAACGAT CGCGACCTCG CGGCGCTGTT CACGGTGGCC
GGTTCGGCGG CCGGCGCGGC GCTCGAGTAA
 
Protein sequence
MKSLSNGIKV GILFVVMVLG TYGVWSAVTA PSSGEKSFEL GAMFRDASGL PKGSRVVVAG 
LPVGEIIDLD IEGRYARIGF RVREDLPVWS NAIVFKKSSS LLGDYYLELD PGTPEALDAT
GNIVTNTRLG ADDTVATVVE ATSPDELLRR IDESMPNVDN VLLSVRDLSE DLRRVVNGPL
LSVSERIDGL VQTESETVAR ILERLDRSMV NIQAITNDVR DITGGRNSQI DRILEELEAA
SSEARNLVVS ARTEVEQTGS KVREKLDMVD DIMANTSSIT GKIDEDEGTL GRLVNDPTIA
DNVEDITEDA KGFLDGLLNL QTYVGLRSEY TVGSGSLRSY VSLELAPRPD KYYLIELAKG
PRGGLPEVSL IYDPSMNDSQ YLRRVTIEDE IRFTFQLAKR LSWATLRYGL KESTGGVGLD
FNGEWFGREL TLQTDVFDAS FDRLPRLKVS AAYEFLPYIY VLGGIDDAMN APGYLPITPG
PDEGLERPML FDELRYGRDF FVGAMLRFND RDLAALFTVA GSAAGAALE