Gene Hoch_3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3849 
Symbol 
ID8546242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5299513 
End bp5300628 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content74% 
IMG OID646388518 
ProducttRNA pseudouridine synthase B 
Protein accessionYP_003268241 
Protein GI262197032 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0130] Pseudouridine synthase 
TIGRFAM ID[TIGR00431] tRNA pseudouridine 55 synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0712752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.22473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG AGCGCAGCAG CCCCGAGCCC GAGGAGCGGC GCCGGCCCGC GCGGGACGCC 
GCGAGCGGGC AGGGCGGCGC GAACAAGAGC GCGCTGCACG GCGTGTTGGT GGTGGACAAG
CCGGCCGGGA TCACCTCGGC GGCCGTGGTC GCCCGGGTCA AGCGTCACCT GGGCGTGCGC
CGCGTCGGCC ACACCGGCAC GCTCGATCCC ATGGCCACCG GGGTCTTGCC GCTGTGTCTG
GGCGAGGCCA CCAAGATCGC CGGCTACCTG CTGGCCGAGG ACAAGGGCTA CGAGGCCGAG
CTGCTGCTCG GGGTCGAGAC CGACACCCTC GACGCCGAGG GCCAGGTCAC CGCGCGCGCG
CCCGAGGCCG CCGCGGCCGT GGACGAGGCC GCGCTGCGCC GCGTGCTGGC CACATTCGTC
GGCCCCGGCG AGCAGGTGCC GCCCATGTTT TCGGCGCTCA AACGCGGCGG TAAGCGGCTG
CACGAGCTGG CCCGCGCCGG CCAGGAGGTC GAGCGGCCGC CGCGTCCGGT GGTGATCCAC
GAGCTGCTGC TGCACGCCTT TGCGACGCCG CGCGCGCGCT TCTCCGTGCA CTGCTCCAAG
GGCACCTACG TGCGCAGCCT GGCCGACGAC ATCGGCCGCG CGCTCGGCTG CGGCGCCCAC
CTGAGCGCCC TGCGGCGCAC GCGCTCGGGC GCCTTTGCCA TCGCCCAGGC GATCCCGCTA
GCCGCGATCG AGGACGATCC CGAGCGCGCG CGCGCGGCGC TGGTGTCGCC GGCCGTGGCC
GTGGCGCATC TGCCCGCGGT GGCGATCCCG AGCGAGGGCG TCCACGACAT CGCCTGCGGC
AAGCCCATGA GCTGGCAGCG GCTGAGCCTG CTGGCGCCCG AGGCCGCGGC CATCGCGTGC
GACGCCCCGG TGCGCCTGCT GCTGCCCAGC GGCGAGCTGG TGGCGCTGGC CGAGCGCGTG
GCGACAAAAA ATGCCCCCGA CATGTCGAAC AGGGGCGCCG ATCAGTTGCA CTACTTACGC
GGGTTTTCTT ATGACTTGAC GAACCGGGCG GCTTCCTCCA ATCTCTCCGG CTCCAGTGGC
CGGGAGCGTG CCCGGCAAAG GCGCTCCGAG CGCTGA
 
Protein sequence
MSTERSSPEP EERRRPARDA ASGQGGANKS ALHGVLVVDK PAGITSAAVV ARVKRHLGVR 
RVGHTGTLDP MATGVLPLCL GEATKIAGYL LAEDKGYEAE LLLGVETDTL DAEGQVTARA
PEAAAAVDEA ALRRVLATFV GPGEQVPPMF SALKRGGKRL HELARAGQEV ERPPRPVVIH
ELLLHAFATP RARFSVHCSK GTYVRSLADD IGRALGCGAH LSALRRTRSG AFAIAQAIPL
AAIEDDPERA RAALVSPAVA VAHLPAVAIP SEGVHDIACG KPMSWQRLSL LAPEAAAIAC
DAPVRLLLPS GELVALAERV ATKNAPDMSN RGADQLHYLR GFSYDLTNRA ASSNLSGSSG
RERARQRRSE R