Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1888 |
Symbol | |
ID | 8544270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2597632 |
End bp | 2599221 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646386593 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003266328 |
Protein GI | 262195119 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGTCCT TATCCAACGG GATCAAGGTC GGCATCCTCT TCGTCGTGAT GGTGCTCGGC ACCTACGGCG TGTGGTCGGC CGTGACCGCG CCCTCGAGCG GCGAGAAGAG CTTCGAGCTC GGCGCCATGT TCCGCGACGC CTCGGGCCTG CCCAAGGGCT CGCGCGTGGT CGTCGCCGGT CTGCCGGTGG GCGAGATCAT CGACCTCGAC ATCGAGGGCC GCTACGCGCG CATCGGCTTC CGCGTGCGCG AGGATCTGCC GGTCTGGTCC AACGCCATCG TGTTCAAGAA GTCATCGTCG CTGCTCGGCG ACTACTACCT CGAGCTCGAC CCGGGCACGC CCGAGGCCCT GGACGCCACG GGCAACATCG TCACCAACAC CCGCCTGGGC GCGGATGACA CCGTGGCCAC CGTGGTCGAG GCGACCTCGC CCGACGAGCT GCTGCGCCGC ATCGACGAGA GCATGCCCAA CGTGGACAAC GTGCTGCTGT CGGTGCGCGA CCTGAGCGAG GATCTGCGCC GCGTGGTCAA CGGCCCGCTG CTGTCGGTGA GCGAGCGCAT CGACGGCCTG GTGCAGACCG AGTCCGAGAC CGTGGCCCGC ATCCTCGAGC GCCTCGACCG CAGCATGGTC AATATCCAGG CCATCACCAA TGACGTCCGC GACATCACGG GCGGCCGCAA CTCGCAGATC GACCGCATCC TCGAGGAGCT GGAGGCGGCC TCGAGCGAGG CCCGCAACCT GGTGGTGAGC GCGCGTACCG AGGTCGAGCA GACCGGCTCG AAGGTGCGCG AGAAGCTCGA CATGGTCGAC GACATCATGG CCAACACCAG CTCGATCACG GGCAAGATCG ACGAGGACGA GGGCACGCTC GGCCGCCTGG TCAACGACCC GACGATCGCC GACAACGTCG AGGACATCAC CGAGGACGCC AAGGGCTTCC TCGACGGCCT GCTCAACCTG CAGACCTACG TGGGCCTGCG CTCCGAGTAC ACGGTCGGCT CGGGCTCGCT GCGCAGCTAC GTGTCGCTCG AGCTGGCGCC GCGTCCCGAC AAGTACTACC TCATCGAGCT GGCCAAGGGA CCGCGCGGCG GTCTGCCCGA GGTCAGCCTG ATCTACGACC CGTCGATGAA CGACAGCCAG TACCTGCGCC GGGTGACCAT CGAGGACGAG ATCCGCTTCA CCTTCCAGCT CGCCAAGCGG CTGAGCTGGG CGACGCTGCG CTACGGCCTC AAGGAATCCA CGGGCGGCGT CGGCCTCGAT TTCAACGGCG AATGGTTTGG TCGCGAGCTG ACGCTGCAGA CCGACGTGTT CGACGCCAGC TTCGACCGCC TGCCGCGGCT CAAGGTCTCG GCCGCGTACG AGTTCCTGCC GTACATCTAC GTGCTCGGCG GCATCGACGA CGCCATGAAC GCGCCCGGCT ACCTGCCGAT CACGCCGGGG CCGGACGAGG GCCTCGAGCG GCCGATGCTG TTCGACGAGC TGCGCTATGG TCGCGACTTC TTCGTGGGCG CGATGCTTCG CTTCAACGAT CGCGACCTCG CGGCGCTGTT CACGGTGGCC GGTTCGGCGG CCGGCGCGGC GCTCGAGTAA
|
Protein sequence | MKSLSNGIKV GILFVVMVLG TYGVWSAVTA PSSGEKSFEL GAMFRDASGL PKGSRVVVAG LPVGEIIDLD IEGRYARIGF RVREDLPVWS NAIVFKKSSS LLGDYYLELD PGTPEALDAT GNIVTNTRLG ADDTVATVVE ATSPDELLRR IDESMPNVDN VLLSVRDLSE DLRRVVNGPL LSVSERIDGL VQTESETVAR ILERLDRSMV NIQAITNDVR DITGGRNSQI DRILEELEAA SSEARNLVVS ARTEVEQTGS KVREKLDMVD DIMANTSSIT GKIDEDEGTL GRLVNDPTIA DNVEDITEDA KGFLDGLLNL QTYVGLRSEY TVGSGSLRSY VSLELAPRPD KYYLIELAKG PRGGLPEVSL IYDPSMNDSQ YLRRVTIEDE IRFTFQLAKR LSWATLRYGL KESTGGVGLD FNGEWFGREL TLQTDVFDAS FDRLPRLKVS AAYEFLPYIY VLGGIDDAMN APGYLPITPG PDEGLERPML FDELRYGRDF FVGAMLRFND RDLAALFTVA GSAAGAALE
|
| |