Gene HS_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1024 
SymbolpcnB 
ID4240522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1132205 
End bp1133578 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content38% 
IMG OID638104585 
Productpoly(A) polymerase 
Protein accessionYP_719236 
Protein GI113461167 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR01942] poly(A) polymerase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACTG AGTCGCCTTT TATTGCCACA AAACAGCGAT CGATACACAA AAATCAACAC 
AATAAATATG TTGTCAAGGC ATCTCAATAC GGAATTATGC CACGTATGAT CAGTCGTAAT
GCACTGTCAG TTGTAGAAAA ATTACATCGT AATGGTTATG AAGCTTATAT TGTTGGCGGT
TGTTTAAGAG ATTTATTATT AGGCAAAAAG CCAAAAGATT TTGATGTTGC AACTAATGCT
AAACCTGAGC AAGTTCAAGC TATCTTTCAA CGACAATGTC GTTTAGTGGG GCGTCGTTTC
CGTTTAGCTC ATATTATGTT TGGACGTGAT ATTATTGAAG TGGCAACATT CCGTGCAGCA
CACTCGGACA ATCATAATGA ACGCCAGGCA AAACAAAATA ATGCCGGTAT GCTATTGCGT
GACAATGTTT ATGGAACTAT TGAACAAGAT GCCGAACGCC GTGATTTTAC CGTCAATGCC
TTTTATTATA ATCCGCAAAA TAATACATTA CGTGATTATT TCAACGGCAT TGAAGACCTC
AAAGCCGGAA AGTTACGTTT AATTGGCGAT CCTGTTAAAC GCTACCAAGA AGATCCTGTA
CGCATGCTTC GTTCTATTCG CTTTATGGCA AAACTGGATA TGTTTTTGGA TAAGCCTAGC
CAAGAACCTA TCAAAAAAAT GGCTCATTTA TTAAAAAATA TTCCTCCGGC AAGACTTTTT
GACGAAAGTG TAAAACTTTT ACAAGCGGGT TATGGTATAA AAACTTATCA ATTGTTACGT
GAATACGGAT TATTTGATCA ATTATTTCCC ACGCTTACAC CTTATTTCAC CGATAAAGCT
GATAGCTTTG CAGAAAAAAT GATTATTACC GCTCTTACCT CTACAGATGA AAGAGTTGCC
GATAACTTAC CAATTAATCC TGCCTTCTTG TTTGCCGCTT TCTTCTGGTA TCCGCTAAGA
GAGAAAGTTG AAACACTCAA AAATGAAGGT AGTTTCAATA ATCATGATGC TTATGCTCTA
GCAAGTAATG ATATTTTAGA TGCTTTTTGT ACCGCACTTG CCGCACCACG TCGTCATACA
ACTGTCATCC GAGATATTTG GTTTCTACAG CTTCAACTGC TTAAACGCAC CGGTTCTGCT
CCGATGCGAG TTATGGAACA TGCCAAATTT AGGGCGGCAT TTGATCTCCT CGTTATGCGT
GCACAAATAG AGGGGGGTGA AGCAATTGAA TTGGCAACTT GGTGGCATGA ATATCAACTT
AGTAACCAAG ATCAACGTGA AAGTTTAATA AAAGAACAAC AACGCCTTAA TCCGAAACAG
AAGAAGAAAT TTTATCGTTC AAAAAAACAT CGGAAATCTA CGCCATCACC ATGA
 
Protein sequence
MRTESPFIAT KQRSIHKNQH NKYVVKASQY GIMPRMISRN ALSVVEKLHR NGYEAYIVGG 
CLRDLLLGKK PKDFDVATNA KPEQVQAIFQ RQCRLVGRRF RLAHIMFGRD IIEVATFRAA
HSDNHNERQA KQNNAGMLLR DNVYGTIEQD AERRDFTVNA FYYNPQNNTL RDYFNGIEDL
KAGKLRLIGD PVKRYQEDPV RMLRSIRFMA KLDMFLDKPS QEPIKKMAHL LKNIPPARLF
DESVKLLQAG YGIKTYQLLR EYGLFDQLFP TLTPYFTDKA DSFAEKMIIT ALTSTDERVA
DNLPINPAFL FAAFFWYPLR EKVETLKNEG SFNNHDAYAL ASNDILDAFC TALAAPRRHT
TVIRDIWFLQ LQLLKRTGSA PMRVMEHAKF RAAFDLLVMR AQIEGGEAIE LATWWHEYQL
SNQDQRESLI KEQQRLNPKQ KKKFYRSKKH RKSTPSP