Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3579 |
Symbol | |
ID | 8755264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 3762482 |
End bp | 3763717 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | phage major capsid protein, HK97 |
Protein accession | YP_003410538 |
Protein GI | 284991984 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.978109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGTCG CCATCACCAC TCCCGAGCAC GCCCGCCAGG CTTTGGACCG GGCCGACCAG CTCGCCTCCG CCGCCCGCAA GTCCGGGCGA CCCCTCACCG ATGCCGAGCA CGCCGAAGTG CGTGACCTCC TCGCCGGTGT CCAGCAGCAC AGGGACGAAA CCGAGATGCG GGACCGCATC GAGGGCATGC GGCGACCCGG TGCAAAGGCC GTGTACGGCT CCCGGAGCAC CGGCGGCACC GCCGGTGAGG CGTTCACCAG CAGCACCGCT TTCCAGTCGC TGCAGATGGC GTTCAAGTCC GGCGCGCTCA CCGGCCGGTG GACCTCCGGC CCCGTCGAGG TGCCGGACTA CTTCACCGGC CGCAAGGCCA CCCTCCTCAC CGGCGACTTC CCGTTCGAGC CGGACGTGCG CCCCGGCATC CAGCCGATCC TCCAGCGCCC GATCGTGGTC ACCGACCTGT TCGCGCCGGG CACCACCGAC TCCGCCCTGG TCCGGTACGT CGAGGAGACC CTGTTCACCA ACGGCGCTAC CACCGTCGGC GAGGGCGGGC TCAAGCCCGA GAGTGCGCTC GCGTTCGACT CCGTGGACGA GCCCGTGAGA AAGATTGCGC ACCACCTGCC GGTGAGCGAC GAAATGCTCG AAGATTCGAG CCAGATGAGG TCATATATCG ATTCTCGTCT GCGGCTCGGT GTGCAACTGG TCGAGGAGAC CCAGCTCCTC TCCGGTGACG GCACCGGCAC CAACCTCCGC GGCCTACTCA ACCGCACCGG CCTGCAGAAC CTCACCCTGA CCGCCCCGAC GACGGGGACG AACCCGTCCA TCGCCGAGAC GCTGTACCAG GCGATCACGA ACGTCCGCGT CAACGCCCTC GTCGAACCCG ACGGCGTCGT CATGCACCCG TCCGACTACG CCGCCCTGCG GCTGGCCAAG GACTCCGGTG GGGAGTTCAA CGCCGGAGGC CCGTTCGGCG CGCTCGCCGG GACCACCGTG TGGGGCCTGC CCGTGGCCCT GTCCATGGCG CTGCCGGTCA ACACCGCGAT CGTGGGCGCG TTCCGCTCCC AGGCTCAGGT GTTCCGCCGC AGCGGGCTCG TCGTGGAGGC AAGCAACTCG CACGCTGACT ACTGGACGCA CAACCTGACT TCCATCCGCG CTGAGCTGAG AGTTGCGCTG GCGGTGTTCA GGCCGTCGGC GTTCGTGCGG TTGGCCGGAC TCCAGCACGC AGGCGTCAGC GCCTGA
|
Protein sequence | MHVAITTPEH ARQALDRADQ LASAARKSGR PLTDAEHAEV RDLLAGVQQH RDETEMRDRI EGMRRPGAKA VYGSRSTGGT AGEAFTSSTA FQSLQMAFKS GALTGRWTSG PVEVPDYFTG RKATLLTGDF PFEPDVRPGI QPILQRPIVV TDLFAPGTTD SALVRYVEET LFTNGATTVG EGGLKPESAL AFDSVDEPVR KIAHHLPVSD EMLEDSSQMR SYIDSRLRLG VQLVEETQLL SGDGTGTNLR GLLNRTGLQN LTLTAPTTGT NPSIAETLYQ AITNVRVNAL VEPDGVVMHP SDYAALRLAK DSGGEFNAGG PFGALAGTTV WGLPVALSMA LPVNTAIVGA FRSQAQVFRR SGLVVEASNS HADYWTHNLT SIRAELRVAL AVFRPSAFVR LAGLQHAGVS A
|
| |