Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1757 |
Symbol | |
ID | 8534915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 1891312 |
End bp | 1892352 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646384139 |
Product | pseudouridine synthase, RluA family |
Protein accession | YP_003263627 |
Protein GI | 261856344 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0151064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAC ACCTTAAGGC GCCCTCCGGT CAGCCTATCG AAAAGAAACA ATCCGTGCGC CGCGTCACTG TCGATGCGCA CTATGCAGGT CAACGCATCG ACAATTTTCT GCTGCGCGAA TTGGGCGCGA CGCATGGCGA GGTTCCGCGC TCCCTGATTT ATCGCATTCT GCGCACCGGA GAAGTCCGCG TGAACAGTCA ACGCGCCAAG CCGACCACTC GGCTTGCGAC GGGCGATGAG GTCCGCATTC CACCACTCAA GCTGCAAAGC CCCTCACAAG AATCGGCGGG CGTGATCTCG GCGAACTGGC TCGCACGCGC TGCGGACATG ATCGTGTACG AAGACGAAGC CCTGCTGGCC GTCAACAAAC CTGCTGGCCT TGCCGTCCAC GGCGGCTCGA ACATCCCTTT TGGTCTGATT GAACTCATGC GCCAACACAC GGGATTGGGC GAAAAACTCG AACTGGCCCA TCGCATTGAT CGTGACACCA GCGGCCTGTT GATTCTGGCC AAGACCCGCG CCACACTGCA TTCGCTGCAA ACCCAGTTCC GGCCGGAAGG CCATGCCGAA AAGCAGTATC TGGCCATCGT GCATGGTCAT TGGCCAGACA AACTCAAACG CGTCGATGCG CCATTGGAAA AATGGCAGGG CGAAGGTGAG TCGCATCGGG TGCAGGTCAA CCCACAAGGC AAGGAAGCCG TGACCCATTT CGCCGTGCTG GCCGCCAACA AAAACGCCAC CCTGCTGCGC GCGCAACTTG AAACCGGTCG CACGCATCAG ATTCGCGTAC ACACCGCGCA CGAGAACCAC CCGATCGTCG GCGATGAAAA ATACGGTCAG CGCGAATGGG ACAAACGGCT CTTTCCATCC ACTGGCGCCT CGGCCCGACG ACGCCCGCCG CTACTGCTGC ACGCGTATCG CCTGATGCTG ACGCATCCGC AAACAGAACA ACCACTGCAG TTGACGGCCC CGATCCCAGA TAAATGGCGT TCGCTGGCAC AACAACTCAA TCTAACGCTG CCGGAACACG ACCCGAAATG A
|
Protein sequence | MNTHLKAPSG QPIEKKQSVR RVTVDAHYAG QRIDNFLLRE LGATHGEVPR SLIYRILRTG EVRVNSQRAK PTTRLATGDE VRIPPLKLQS PSQESAGVIS ANWLARAADM IVYEDEALLA VNKPAGLAVH GGSNIPFGLI ELMRQHTGLG EKLELAHRID RDTSGLLILA KTRATLHSLQ TQFRPEGHAE KQYLAIVHGH WPDKLKRVDA PLEKWQGEGE SHRVQVNPQG KEAVTHFAVL AANKNATLLR AQLETGRTHQ IRVHTAHENH PIVGDEKYGQ REWDKRLFPS TGASARRRPP LLLHAYRLML THPQTEQPLQ LTAPIPDKWR SLAQQLNLTL PEHDPK
|
| |