Gene Hoch_4848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4848 
Symbol 
ID8547255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6633254 
End bp6634462 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID646389521 
ProductCRISPR-associated protein GSU0053 
Protein accessionYP_003269230 
Protein GI262198021 
COG category 
COG ID 
TIGRFAM ID[TIGR02570] CRISPR-associated protein, GSU0053 family, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACA CGCTCCCCCT CGAAACCCTC GCCCAGGCCG TCGCCGGCCA CGCCTCGGCG 
CTGCGCTGCG TCACCGAGTA CCAGCCCGCC GGCGGCCCCG GCGACAAGAT CTTTCCCCCG
ACCTACGAGG GCGGCAAGTA TGCAACCGAG GAGCGCGTCA TCGACGGCGA GCGCGTCCCC
TGTGTACTCG TCGACTCGGT GCAGTCGCAG GCCAACCGCA TGGAGCTGGC GCTGCTCGAG
GCCTGGGAGC GCGAGCAGCT CCCGCTGCCC GTCATCACCG TGGACTTCCA CGACAAAGAC
GTGCTCAAGC CGCTGCGCGT GACCAGCCTC GAGGCCCCGC ACCGCATCGC CGACGCCATC
CTGCGCGACA GCACCCTCGG CGGCAAGCCC TTCCGCAAGT GCGAGCCCGG CAACTCGCTC
GACCTGGTCG ATAACGGCTA CGCCACGCCG CTGTTCGAGC TGTGCCCGAC CGCCCTGATC
TTCGGCATGT GGGACTCCAC CGGCCCGCGC GGCGGCCTCG GCGCCAAGTT CGCGCGCGCC
ATGGTGTCCG AAATCATCGG CCTGCACGCC GTCGCCGGCA AGCGCACCAG CAGCCGCATC
GATCCCCTGC AGATCCAGCG CAACGCCGGC GTGCTCTACG AGACCAAGGG CGAGGGCGGC
ATCCACTGGA CCCTCGATGA GAAGCAGGCC AAGAGCAAAA AGGCCAAGCT CGGCAAAGAT
GGACGCCCGT CCGAAGCCAA CCACGGCAAC GTGACCCCGA GCATCGCCGA CGGCGGCTTC
ACCATCTCCA AAGCCGTGCA GACCACGGTG CTGTCGCTGC CCGCGCTGCG CCGCCTGCGC
TTCCCGGTCG ATGGCAAGGC CGGCCTCGCC CGCGACGACG CCGCACGCAC CGCCCTGGCC
GCCCTGGCCC TGTGCGCGGC CACGCTGTCG CGCGCCCAGG GCTGCGACCT GCGCTCGCGC
TGCATCCTGC ACGCCCAGGA CGCCATCACC TGGGAGCTGC TCGGCGAACC CGGCAGCGAG
CCGCAGCGCT TCGCCCTGCC CGCCGCCGAC GCCATCGCCC TGTACCGCGA GGCCCTCGAC
AAAGCCCGCG CCGCCGGCCT GCCCTTCCGC GAACAAGCGC TGGAACTCGA GCCCTCGGCC
GAGCTGGTCA CCCTGCTGCG CAAGAGCCAG GAACTTGAAG TCCAGAGCGC GCCCGAAGCA
GGAGCCTGA
 
Protein sequence
MSNTLPLETL AQAVAGHASA LRCVTEYQPA GGPGDKIFPP TYEGGKYATE ERVIDGERVP 
CVLVDSVQSQ ANRMELALLE AWEREQLPLP VITVDFHDKD VLKPLRVTSL EAPHRIADAI
LRDSTLGGKP FRKCEPGNSL DLVDNGYATP LFELCPTALI FGMWDSTGPR GGLGAKFARA
MVSEIIGLHA VAGKRTSSRI DPLQIQRNAG VLYETKGEGG IHWTLDEKQA KSKKAKLGKD
GRPSEANHGN VTPSIADGGF TISKAVQTTV LSLPALRRLR FPVDGKAGLA RDDAARTALA
ALALCAATLS RAQGCDLRSR CILHAQDAIT WELLGEPGSE PQRFALPAAD AIALYREALD
KARAAGLPFR EQALELEPSA ELVTLLRKSQ ELEVQSAPEA GA