Gene Hneap_1757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1757 
Symbol 
ID8534915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1891312 
End bp1892352 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content59% 
IMG OID646384139 
Productpseudouridine synthase, RluA family 
Protein accessionYP_003263627 
Protein GI261856344 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0151064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAC ACCTTAAGGC GCCCTCCGGT CAGCCTATCG AAAAGAAACA ATCCGTGCGC 
CGCGTCACTG TCGATGCGCA CTATGCAGGT CAACGCATCG ACAATTTTCT GCTGCGCGAA
TTGGGCGCGA CGCATGGCGA GGTTCCGCGC TCCCTGATTT ATCGCATTCT GCGCACCGGA
GAAGTCCGCG TGAACAGTCA ACGCGCCAAG CCGACCACTC GGCTTGCGAC GGGCGATGAG
GTCCGCATTC CACCACTCAA GCTGCAAAGC CCCTCACAAG AATCGGCGGG CGTGATCTCG
GCGAACTGGC TCGCACGCGC TGCGGACATG ATCGTGTACG AAGACGAAGC CCTGCTGGCC
GTCAACAAAC CTGCTGGCCT TGCCGTCCAC GGCGGCTCGA ACATCCCTTT TGGTCTGATT
GAACTCATGC GCCAACACAC GGGATTGGGC GAAAAACTCG AACTGGCCCA TCGCATTGAT
CGTGACACCA GCGGCCTGTT GATTCTGGCC AAGACCCGCG CCACACTGCA TTCGCTGCAA
ACCCAGTTCC GGCCGGAAGG CCATGCCGAA AAGCAGTATC TGGCCATCGT GCATGGTCAT
TGGCCAGACA AACTCAAACG CGTCGATGCG CCATTGGAAA AATGGCAGGG CGAAGGTGAG
TCGCATCGGG TGCAGGTCAA CCCACAAGGC AAGGAAGCCG TGACCCATTT CGCCGTGCTG
GCCGCCAACA AAAACGCCAC CCTGCTGCGC GCGCAACTTG AAACCGGTCG CACGCATCAG
ATTCGCGTAC ACACCGCGCA CGAGAACCAC CCGATCGTCG GCGATGAAAA ATACGGTCAG
CGCGAATGGG ACAAACGGCT CTTTCCATCC ACTGGCGCCT CGGCCCGACG ACGCCCGCCG
CTACTGCTGC ACGCGTATCG CCTGATGCTG ACGCATCCGC AAACAGAACA ACCACTGCAG
TTGACGGCCC CGATCCCAGA TAAATGGCGT TCGCTGGCAC AACAACTCAA TCTAACGCTG
CCGGAACACG ACCCGAAATG A
 
Protein sequence
MNTHLKAPSG QPIEKKQSVR RVTVDAHYAG QRIDNFLLRE LGATHGEVPR SLIYRILRTG 
EVRVNSQRAK PTTRLATGDE VRIPPLKLQS PSQESAGVIS ANWLARAADM IVYEDEALLA
VNKPAGLAVH GGSNIPFGLI ELMRQHTGLG EKLELAHRID RDTSGLLILA KTRATLHSLQ
TQFRPEGHAE KQYLAIVHGH WPDKLKRVDA PLEKWQGEGE SHRVQVNPQG KEAVTHFAVL
AANKNATLLR AQLETGRTHQ IRVHTAHENH PIVGDEKYGQ REWDKRLFPS TGASARRRPP
LLLHAYRLML THPQTEQPLQ LTAPIPDKWR SLAQQLNLTL PEHDPK