Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_4848 |
Symbol | |
ID | 8547255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 6633254 |
End bp | 6634462 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646389521 |
Product | CRISPR-associated protein GSU0053 |
Protein accession | YP_003269230 |
Protein GI | 262198021 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02570] CRISPR-associated protein, GSU0053 family, N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAACA CGCTCCCCCT CGAAACCCTC GCCCAGGCCG TCGCCGGCCA CGCCTCGGCG CTGCGCTGCG TCACCGAGTA CCAGCCCGCC GGCGGCCCCG GCGACAAGAT CTTTCCCCCG ACCTACGAGG GCGGCAAGTA TGCAACCGAG GAGCGCGTCA TCGACGGCGA GCGCGTCCCC TGTGTACTCG TCGACTCGGT GCAGTCGCAG GCCAACCGCA TGGAGCTGGC GCTGCTCGAG GCCTGGGAGC GCGAGCAGCT CCCGCTGCCC GTCATCACCG TGGACTTCCA CGACAAAGAC GTGCTCAAGC CGCTGCGCGT GACCAGCCTC GAGGCCCCGC ACCGCATCGC CGACGCCATC CTGCGCGACA GCACCCTCGG CGGCAAGCCC TTCCGCAAGT GCGAGCCCGG CAACTCGCTC GACCTGGTCG ATAACGGCTA CGCCACGCCG CTGTTCGAGC TGTGCCCGAC CGCCCTGATC TTCGGCATGT GGGACTCCAC CGGCCCGCGC GGCGGCCTCG GCGCCAAGTT CGCGCGCGCC ATGGTGTCCG AAATCATCGG CCTGCACGCC GTCGCCGGCA AGCGCACCAG CAGCCGCATC GATCCCCTGC AGATCCAGCG CAACGCCGGC GTGCTCTACG AGACCAAGGG CGAGGGCGGC ATCCACTGGA CCCTCGATGA GAAGCAGGCC AAGAGCAAAA AGGCCAAGCT CGGCAAAGAT GGACGCCCGT CCGAAGCCAA CCACGGCAAC GTGACCCCGA GCATCGCCGA CGGCGGCTTC ACCATCTCCA AAGCCGTGCA GACCACGGTG CTGTCGCTGC CCGCGCTGCG CCGCCTGCGC TTCCCGGTCG ATGGCAAGGC CGGCCTCGCC CGCGACGACG CCGCACGCAC CGCCCTGGCC GCCCTGGCCC TGTGCGCGGC CACGCTGTCG CGCGCCCAGG GCTGCGACCT GCGCTCGCGC TGCATCCTGC ACGCCCAGGA CGCCATCACC TGGGAGCTGC TCGGCGAACC CGGCAGCGAG CCGCAGCGCT TCGCCCTGCC CGCCGCCGAC GCCATCGCCC TGTACCGCGA GGCCCTCGAC AAAGCCCGCG CCGCCGGCCT GCCCTTCCGC GAACAAGCGC TGGAACTCGA GCCCTCGGCC GAGCTGGTCA CCCTGCTGCG CAAGAGCCAG GAACTTGAAG TCCAGAGCGC GCCCGAAGCA GGAGCCTGA
|
Protein sequence | MSNTLPLETL AQAVAGHASA LRCVTEYQPA GGPGDKIFPP TYEGGKYATE ERVIDGERVP CVLVDSVQSQ ANRMELALLE AWEREQLPLP VITVDFHDKD VLKPLRVTSL EAPHRIADAI LRDSTLGGKP FRKCEPGNSL DLVDNGYATP LFELCPTALI FGMWDSTGPR GGLGAKFARA MVSEIIGLHA VAGKRTSSRI DPLQIQRNAG VLYETKGEGG IHWTLDEKQA KSKKAKLGKD GRPSEANHGN VTPSIADGGF TISKAVQTTV LSLPALRRLR FPVDGKAGLA RDDAARTALA ALALCAATLS RAQGCDLRSR CILHAQDAIT WELLGEPGSE PQRFALPAAD AIALYREALD KARAAGLPFR EQALELEPSA ELVTLLRKSQ ELEVQSAPEA GA
|
| |