Gene HS_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1650 
SymbolcysQ 
ID4241177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1881915 
End bp1882889 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content37% 
IMG OID638105236 
Product3'-phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase 
Protein accessionYP_719855 
Protein GI113461786 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1218] 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase 
TIGRFAM ID[TIGR01331] 3'(2'),5'-bisphosphate nucleotidase, bacterial 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTTATA AAAGCGTGCA GCCCATAGCC ATTTATCGAG CCGAATACTA TCATCTTTTT 
CCTTGGGTAA ATGCTGTTTT GCCATATTTG TCTTTTCGTT CAGTTAAAAT CAGGGGTATG
ATAGGGGGAT TAAAGACATT TGTAAAAGGG AAGATTATGA ATGAATTAAA AAGTCAGGTA
CTACTGGAAA AGGTTTTACA AATCGCTCAT CAAGCTGGAG ACTATCTTAA TCTTTTTTAC
AATGGAGAAA TTGATTTTCA GATCAACATA AAATCGGATA ATACGCCTAT CACAAACGCA
GATTTATTTG TAAACCAATT TCTTATCGAA AAACTGACCG CACTTACGCC ACATATCCCT
GTTTTATCAG AAGAAAGCTG TCAAATTTCT TTTTCAGATC GACGGCGGTG GCGTACTTAT
TGGTTGATTG ATCCGCTGGA TGGTACACAA CAATTTATTA ATCGAACAGA TCAATTCGCT
GTGCTAATTG CACTGATTCA TCAAAATCGG AGCATGTTAG GTATTATTCA TGCTCCCGTG
TTAAAACAAA CTTATTATGC ACTGCAAGGT CATGGTACTT ATAAACAAAC GGAACATTCT
CTGCAAACTT TATCAGCTAG AAAATTTGGC TTAAACCATA CAGTAAAAAT TGCAGTAGGT
TCAAAAAATG CGGAGCAAAA AGTGCGGTCA ATTTTGAGCT CAAATTATCA ATACGAATTT
ATTACTTATG GTTCTAGCGG TTTAAAAACC GCACTGGTTG CAGAAGGAAG TGCAGATTGC
TATATTCGAC TGGGACAAAC CGGTGAATGG GACACGGCTG CGGCGGAAGC CATTTTATCC
GAAATAGGCG GAGGAATCCG TGATACCCAA TTCAACGCCC TAACTTATAA CAAACGACCG
AGTTTAATAA ACCCTGATTT TATTATGGTA TCGGACATAT CTGCCGATTG GAAAAAAATC
TTTCAATTTA ATTAA
 
Protein sequence
MFYKSVQPIA IYRAEYYHLF PWVNAVLPYL SFRSVKIRGM IGGLKTFVKG KIMNELKSQV 
LLEKVLQIAH QAGDYLNLFY NGEIDFQINI KSDNTPITNA DLFVNQFLIE KLTALTPHIP
VLSEESCQIS FSDRRRWRTY WLIDPLDGTQ QFINRTDQFA VLIALIHQNR SMLGIIHAPV
LKQTYYALQG HGTYKQTEHS LQTLSARKFG LNHTVKIAVG SKNAEQKVRS ILSSNYQYEF
ITYGSSGLKT ALVAEGSADC YIRLGQTGEW DTAAAEAILS EIGGGIRDTQ FNALTYNKRP
SLINPDFIMV SDISADWKKI FQFN