Gene Hneap_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1148 
Symbol 
ID8534300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1247515 
End bp1248606 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content52% 
IMG OID646383537 
ProducttRNA pseudouridine synthase D TruD 
Protein accessionYP_003263031 
Protein GI261855748 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCA ACCCCGGGCC ACTGGATCAG TTGTTGTATC ACGGCGCTCC GCCCATTCTG 
CAAGGGCAGT TGAAACAATC TCCATCAGAC TTCCGCGTGG ATGAAATCCT CGGGTTTGAA
CCGGACGGAG AAGGTGCCCA CGGTCTGTTT CTGGTCGAGA AAACGGGAAT CACCACGGGG
CAGATGCTGG GGCTACTGTC CAAATTATCC GGTGTGGCAG AAAGAGACAT CGGCTTTTGC
GGCATGAAGG ACAAACTCGC GGTCACATCC CAGTGGGTCA GTCTGCCTTT GATGCCATCG
CATTCATTAG AGAACCCTCC GGATTGGATC GATGCATTAC CCGATCACGT AAAAGTGCTT
CGCTGGAACC TGCATCGCAA GAAGCTGCGT CGGGGTAGTC ATCGGGGTAA CCGCTTCACT
GTTACGATTC GCGATGTCAT AGGGCATGAC CCAGAACTTC GGCAACGAAT TGAAAGGTTA
GAGTCGCAGG GTTTTCCCAA TTACTTTGCC GAGCAGCGGT TCGGGCATGC GGGAAGCAAC
TATGCCTTGC TCGAAAAGCT GGGACGATTA TCGAACGCCC GTTCAATTAG TCGCGCTGAT
CGAAACTGGG GCATATCGAC GCTCAGAGCT GAAATCTTCA ACCGGGTCTT GTCCGATCGT
CTGTCCCAAA ACACTGAAGC CACCGCTAAA CCTGGCGATC TGGCCCGTCT TGCGGGCACA
AATAGTTGGT TTTTAGTTGT CGAGGAAGAG TTGAACAACA CACAGCAAAG AATTGATACC
AAAGATATTT GGCTTACGGG GCCGCTCTGG GGTGAAGGAC CGAGTCCCGC CTTTGGAGAT
ATTAAGACTG AGGAAACCCG AATCGTAGAA GAAGTTTTAA CGAGCTACGG CTCGGAGAAT
TGGTCGAATC ACCTGCGCGA CTGGCGGGTT GAACATGATC GACGCGCTCT AATGGCACCG
ATAACCAATT TGCAGTGTGA AGAAAAGACA GAGGAGGGCA GCCGTATCCT CAATCTGTCA
TTTGCATTGG AATCAGGAAG TTATGCGACA GCTTTGCTTC GGGAAATTAT TGATCTGACA
CCGGCAGATT GA
 
Protein sequence
MSANPGPLDQ LLYHGAPPIL QGQLKQSPSD FRVDEILGFE PDGEGAHGLF LVEKTGITTG 
QMLGLLSKLS GVAERDIGFC GMKDKLAVTS QWVSLPLMPS HSLENPPDWI DALPDHVKVL
RWNLHRKKLR RGSHRGNRFT VTIRDVIGHD PELRQRIERL ESQGFPNYFA EQRFGHAGSN
YALLEKLGRL SNARSISRAD RNWGISTLRA EIFNRVLSDR LSQNTEATAK PGDLARLAGT
NSWFLVVEEE LNNTQQRIDT KDIWLTGPLW GEGPSPAFGD IKTEETRIVE EVLTSYGSEN
WSNHLRDWRV EHDRRALMAP ITNLQCEEKT EEGSRILNLS FALESGSYAT ALLREIIDLT
PAD