Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1781 |
Symbol | |
ID | 8544163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2461223 |
End bp | 2462341 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646386488 |
Product | EGF-like domain protein |
Protein accession | YP_003266223 |
Protein GI | 262195014 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.891729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.581172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGTT GGTTCTGGGT GGGAGTCTCG TTTCTGCTGG GTGGCTGCTT CGCGCCGAAT TATCCCGAGG GGGTCGCCTG CAGCCAGGCG CAGACGTGTC CGCCCGGACA GCGCTGCGAC CCGGCTAGCT CGACCTGCCG CGTCAGCGCG CCCGAGCCCG CAGACGACGC CGGCGCGGCA CCGGACAGCG CCGGCCCAAC CCCGGCCGAC GCGGGCGCGG CGCGGGATGG CGGCGGCGGG GCCGTAGACG CCGCGGCGTC CGACGCCGGT CCGGGGCCGT GCGCGGATGC GCCCTGCGGC GACAACGCGA GCTGCAGCGT CAGCGACGAC GCGCCCGAGG GCTACGCGTG CACCTGCGCG GCCGGCTTCG TGCCGCGCGC GGCCGAGCCC GGCTGCCGGC TGCCGCGCTC GTGCCGCGAG CTGCTCGGCG CCCAGCCGGA CAGCGGCGAC GGCGTGTACT CGCTCGCACC CGAGGGCGAG GCCGCGGGCG CGGCGCTCGC GAGCTACTGC GACATGAGCA CCGATGGCGG TGGTTGGACA CTCGTCCAGC GAACCGTGTG GGCCTTTTCC GACTCGGGCG CGCTGCGCAC CGATTACCAG AGCTTCCGCG AGCAGACCAT CGGCGCGCCG GCGCCGGGCC AGGCGTTTCG CGCGCCCGCG CGGCTGTGGC CGCTGCTGCA GGAGACCCGC GAGCACCTGA TGCGCGAAGT GCCGCGCCGG GCCAGCGACG GCGGCGACTG CGGCGCGCTG TACTACACCG CGAGCGAGGG CGTGTGGAGC GTGCCCGCGG GCGGCGGCGC GACGCTCACC GGCGCCGTGG ATCCCAATAT CCTGTTCAAC GGCACCACGG CGCTGTCGGC GCTCGACGAT GGGCCCTCGA CTGACTGCGT GGCTGATTAC CAGGCGGTGC CGTGGCTCTA CGAGCGTTGT TGCTCCACCT GTCCAACCTT CGAGAACAAC TACTGGCGCG ACGAGCCGCA CCCGATGTCG AGCCGCATGA ACATCGCGGA TCTGTTCGGA GAGACCATGT TCACCCAGTG CGCGCCCGCG GAGCCGCGGC GCAGCGACAA CAACTCCACC TTCTACGGCA TCAACGTCAT GGAGTATTAC CTGCGCTGA
|
Protein sequence | MSRWFWVGVS FLLGGCFAPN YPEGVACSQA QTCPPGQRCD PASSTCRVSA PEPADDAGAA PDSAGPTPAD AGAARDGGGG AVDAAASDAG PGPCADAPCG DNASCSVSDD APEGYACTCA AGFVPRAAEP GCRLPRSCRE LLGAQPDSGD GVYSLAPEGE AAGAALASYC DMSTDGGGWT LVQRTVWAFS DSGALRTDYQ SFREQTIGAP APGQAFRAPA RLWPLLQETR EHLMREVPRR ASDGGDCGAL YYTASEGVWS VPAGGGATLT GAVDPNILFN GTTALSALDD GPSTDCVADY QAVPWLYERC CSTCPTFENN YWRDEPHPMS SRMNIADLFG ETMFTQCAPA EPRRSDNNST FYGINVMEYY LR
|
| |