Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1270 |
Symbol | |
ID | 8543652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 1672348 |
End bp | 1674150 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646385988 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003265723 |
Protein GI | 262194514 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.932295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000745074 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGACAG CAACGTGGAA GTTAGCCTGC CAGATGGCCT GCATCGCGAC GCTCGTGTTC ACCGGCTGCA CGGGCGATCG CGGCCCGCAG GGCGCCGACG GTCCGCCCGG AACCCAGGGC CCGGACGGCA GCGACGGCGC AAACGGCAAT GACGGCAACG ACGGCGCGAA CGGCGACGAC GGCGCGGACG GCGCGGACGG CGCGGACGGC GTCACCGCGA TCGCGCTGCG CCTGCTCGGC CGCTACGAGA GCGGCATCTT CGACCAGGGG GCCTCGGAGA TCGTCAGCTA CGACCCGGCC ACGACGCAGC TCTTCCAGGT CAACGCCAAC TCCGGCGCGC TCGACGTCCT GTCCCTGGTC GATCCCGCGG CGCCGACGCT GCTCAGCAGC ATCGACGTGG CCGCCGCCAT CGCCGACAAC ACCGATATCA CCACCGTCCT CGGCGCGGTC AACAGCGTCG ATGTCCGCGG CGGCGTGGTC GCGGCCGTTA TCGCCGCCGA CAGCGGCGAC GAGCGCGGCG CCATCGCGTT CTTCCGCGCC GCCGATCACG CGTTCCTGGC CGGCTACGAG CTGGGATTTG GCCCCGACTC GCTCGCGTTC AGCCCCGACG GCGACACCGT GATCGTGGCC AACGAAGGCG AGCCCCTCGA CGACTACACC GTCGATCCGC CGGGCTCGGT GAGCGTCATC GATCTCAGCG TCGGCGTCGC CGCGGCGACC ATTCGCGATC TCGATTTTAC CGCGTTTGAC GCCGGCGGAA CGCGCGCAGG CGAACTCGAC CCCGCGGTCC GCATCTTCGG CATCAAGCAG CCGGGCGACG TCCCATCCAC GGTGTCCGAA GACATCGAAC CCGAGTACGT CGCCTTCGCG CCCGACGGCG CCACCGCCTT CGTCAGCCTG CAGGAGAACA ACGCCATCGC CGTCATCGAG GTGGCCGCGC CGCGCATCGC CCGCATCTTC CCGGCCGGCG CCACCGACCA CGGCCGCATC GGCAACGAGC TCGATCCCAG CGATCGCGAC CGCGGCATCG AGATCCGCAA CTGGCCGGTG TCGGGCCTGC GTTTGCCCGA CTCCATCGCC ACCTACGATT ACCAGGGCCG GACCCTGCTG GTCACCGCCA ACGAGGGCGA CACCCGCGAC TACGGCGGCT TCTCCGAGGA GGAGCGGATC CGCGACCTGG TGCTCGATCC CGAGGCATTC CCCGACGCCG CCGCGCTGCA GAGCGACGCC CAGATCGGCC GCCTGCTCAC CACCTCGAGC GCTGGCGACG ACGACGGCGA CGGTGATTTC GACCGCCTGT TCGCCATCGG CTCGCGCTCG TTCTCGATCT ACACCGCGGA CGGCGCGCCG ATCTTCGACA GCGGCAACCA GTTCGAGCTG ATCACGGCCT TCCGCCTCGA AGACCACTTC AACGCCAGCA ACGACGACAA CGAGGGCGAC AGCCGCAGCG ACGCCAAAGG CTGCGAGCCC GAGGCGCTCA CCGTCGGCCG GGTGCGCAAC GCCATGTTCG CGTTCATCGG CCTCGAGCGC ACCGGCGGCA TCATGGTCTA CAACATCTCC AACCCGCACA GCCCCCGCTT CGTGCAGTAC GTCAACGACC GCAATTTCGC CGAAGAACCC AGCCTGGGCG ACACCGACGG CGACGGCGTC GAGGAGAGCA ACCCGGCCGC CGGCGACCTC GGCCCCGAGA GCATCCGCTT CATCCCGGCC GCCGACAGCC CCAACGGCAG CGCTCTGCTG ATCGTGGGCA ACGAGGTCAG CGGCACCACG ACCGTCTACG CCATCGACAT CGTCCCCGAA TGA
|
Protein sequence | MATATWKLAC QMACIATLVF TGCTGDRGPQ GADGPPGTQG PDGSDGANGN DGNDGANGDD GADGADGADG VTAIALRLLG RYESGIFDQG ASEIVSYDPA TTQLFQVNAN SGALDVLSLV DPAAPTLLSS IDVAAAIADN TDITTVLGAV NSVDVRGGVV AAVIAADSGD ERGAIAFFRA ADHAFLAGYE LGFGPDSLAF SPDGDTVIVA NEGEPLDDYT VDPPGSVSVI DLSVGVAAAT IRDLDFTAFD AGGTRAGELD PAVRIFGIKQ PGDVPSTVSE DIEPEYVAFA PDGATAFVSL QENNAIAVIE VAAPRIARIF PAGATDHGRI GNELDPSDRD RGIEIRNWPV SGLRLPDSIA TYDYQGRTLL VTANEGDTRD YGGFSEEERI RDLVLDPEAF PDAAALQSDA QIGRLLTTSS AGDDDGDGDF DRLFAIGSRS FSIYTADGAP IFDSGNQFEL ITAFRLEDHF NASNDDNEGD SRSDAKGCEP EALTVGRVRN AMFAFIGLER TGGIMVYNIS NPHSPRFVQY VNDRNFAEEP SLGDTDGDGV EESNPAAGDL GPESIRFIPA ADSPNGSALL IVGNEVSGTT TVYAIDIVPE
|
| |