Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5021 |
Symbol | |
ID | 8547431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 6925845 |
End bp | 6927554 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646389697 |
Product | hypothetical protein |
Protein accession | YP_003269403 |
Protein GI | 262198194 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02608] delta-60 repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.196755 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0476496 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGTA TCCTCTCTCT CGTCGCCTCG TCCTCGCTCG CGCTCGCGCT GGCCGGCTGC ACCGCCATCC TCGGCATCGA GGAGCTCAGC GGCACCACCG ACGGCGGCGT TCCGGTGGAT GCTAGACCTG GCGACGGCGC ACTGCCCGGG GATCCCGACA GCGGTCTGGC CGGCTACTCG CTCCGGATAC ACACCAACGC GCCCACCCTG CCGCTCGACG GCACCACCTT CCTCGACATC GAGATCCAGC GCCTGAGCGG CCACGATCGC GAGATCCGGC TCGATATCGA CGGGCCGGGC GGCGTGATCA GCCCGGGCCT CACGGTCAGC GGCACGAGCA CGCTGGTCGA GCTGCCCATC GGCGCCGGCG CGCCCCTGGC CATCGGCGAT GAGGTCTCGT TTCGCGTGCG CGCCATCGAG ACCGACGGCG CCGGCATCGC GGTCGAGCGC GAGGTCACGG GCGCCCAGGT CACCGGCCGC CCCGGCCTGC TCGATACCTC GTTTGGCGCC GCCGCCACCG GCCTGGCGCG CGTGAGCTTT GGCAACGACG ACAGCGGCCG CTTCTACGAC CTCGAGATCT TGCCCGACGG CAGCATCCTG GCCGCGGGCT GGGGCGCCGG CGGCCTCGGC GCCGTCACCA GCGCGCTGGC CCGGCTCACG GCCGACGGCC TGGCCGACCT CGGGTTCTCG GGCGACGGCC TCGTGCGCAC CAACTTCGAG ACCGGCTCGT CGGCCGAAAG CTTTCAGACC TACGCGATCG GCCGCCAGCT CGACGGCCGC ATCATCGCCA TCGGCCAGCA CAGCAGCACC AGCTCGTATC CGCGAGCCTT CGCCCTGGCC CGCTACACCG CCAGCGGCGG CGAGGGCGAC CCGCTGTTCG GCAACTTCGC CTCCGGCCGC AGCCGCATCC TCATCAACAA CACCGCCATC GACCTCGTCC GCGACGGGCT CGTCACCGTC GACAACCGCA TCCTGGGCGC GGGCAGCTTC GGCGGCAGCC TGAGCGTATT CCGCGCCACC TCGAGCGGAG ATCTCGACCA GATCTTCGCC GACCGGGGCG TGTTCCAGCT CGACGCCGAC GGCAGCTCGC GCGCCGAAGC CATCAGCCGC GACGCCCAGG GCCGCCTGCT CGTGGTCGGC ACGCGCGAAC GCGGTGCTCA GAGCGACATG ATCGTAGTCC GCCTGGACGA AAACGGCGCG CTCGACGACG GGTTCGCCGC CGGTGGCGTG CTCATCGCAG GCAGCCCGGA GATCGACGAG CGCGCCGTGG CCGTGGCCGT GCGCGCCGAC GGCCGCCTGG TAGTCGCCGG CGACGTCACC CTCGCCGATG GCAGCCGCGC GCTGCAGGTG CGGCAGTTCA CGGCCGAGGG CGACTTCGAC AGCGAGTTTG GCACGAACGG CGTGAGCACC CAGGTGCTCG ACGACCGCGG CGTCGAGGTC ACCGACATGC TGCTCGCGCC CGACGGCCGC ATCCTGGTGC TGGGCAACGG CACCGGCAAC GCCGACCCCG TGCTCGTGCG CCTGTCGCGC GACGGCGGGC TCGACCCCTA CTTCGACGGC GACGGCGTGC TGTCGATGTA CGTGGGCGAC TGCGGCGCGG TCGAAACGCT CGCCCTGGTC GGCCGCAGCC GGCTGCTGAT CGCGGGCGGC GACGAGTGCG GCACGCCCGG CCCGGGCACC GCCGGCATCA TCCTGCGGCT GTGGATCTGA
|
Protein sequence | MSRILSLVAS SSLALALAGC TAILGIEELS GTTDGGVPVD ARPGDGALPG DPDSGLAGYS LRIHTNAPTL PLDGTTFLDI EIQRLSGHDR EIRLDIDGPG GVISPGLTVS GTSTLVELPI GAGAPLAIGD EVSFRVRAIE TDGAGIAVER EVTGAQVTGR PGLLDTSFGA AATGLARVSF GNDDSGRFYD LEILPDGSIL AAGWGAGGLG AVTSALARLT ADGLADLGFS GDGLVRTNFE TGSSAESFQT YAIGRQLDGR IIAIGQHSST SSYPRAFALA RYTASGGEGD PLFGNFASGR SRILINNTAI DLVRDGLVTV DNRILGAGSF GGSLSVFRAT SSGDLDQIFA DRGVFQLDAD GSSRAEAISR DAQGRLLVVG TRERGAQSDM IVVRLDENGA LDDGFAAGGV LIAGSPEIDE RAVAVAVRAD GRLVVAGDVT LADGSRALQV RQFTAEGDFD SEFGTNGVST QVLDDRGVEV TDMLLAPDGR ILVLGNGTGN ADPVLVRLSR DGGLDPYFDG DGVLSMYVGD CGAVETLALV GRSRLLIAGG DECGTPGPGT AGIILRLWI
|
| |