Gene HS_0341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0341 
Symbolppx 
ID4239815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp342482 
End bp344032 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content37% 
IMG OID638103882 
Productexopolyphosphatase 
Protein accessionYP_718549 
Protein GI113460487 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.859859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACG AAAATTTATT ACAAAAAGTA ACCGCACTTG TACAACACCG CGGTGAAGCG 
AAAGAAATTG CCGCTATTGA TTTAGGCTCC AATAGCTTTC ATATGGTGAT TGCCCGAATT
ATTAACGGCT CTATTCAAGT TTTATCTCGT TTAAAACAAA AAGTACAATT GGCGGAAGGG
TTAGATGAAA ATAACATGTT AAGCCAAGCC GCTATCACTA GAGGCGTAAA CTGTCTTGCT
CTTTTTGCCG AACGCTTACA AGGATTTGAT CCTCAAAATA TCAATGTAGT CGGTACTTAT
ACATTGCGAA GTGCGGTCAA TAAGATAGAA TTTTTACAAC AGGCGGCAGA GGTTTTCCCC
TATCCGATTA ATATCATTAG TGGGGAAACT GAGGCAAAAA CGATTTATTC CGGTGTTTCT
CATACCCAAC CTGAGAGCGG TCGGAAATTC GTGATTGATA TTGGCGGTGG TTCAACGGAA
ATGATTATTG GCGATGATTT TATCCCTTTA ATTGCCAATA GTCGCAATAT GGGATGTGTC
AGTTTTGCCA AGAAATTTTT TCCATGTGGC AAAATATCAA GGGATAACTT TCAACGAGCT
AAAAAAACGG CAAAACAATG TATTGAAGAT CTTGCCAAAC CTTATCTTGA CTTAAATTGG
GATTGTGTTT TAGGTTCTTC CGGTACCATC AAAACCGTTC ATCAAGTTAT TAGTACCAAC
TATAATCAAC ACGCAATTAT CACATTATCG CACTTAAACA AGTTGATAAC ACAAGTATTG
AAAGCTCACC ATTTCAATAA ATTACATATC AATGGACTCA ATGAAGATCG TGTAGATGTA
TTTGTTCCTG GACTTGCTAT TTTGACCGCA CTTTTTGAAA CCTTTGCTAT CAAAGAAATG
CGTTATTCAG ACGGAGCATT GCGTGAGGGG ATTATTTATA GTTTGGAAAA GGATTTCCAA
GTCAAAAATA TTCGCCAACG TACCGCACTT GGTATTATGC AACAATTTAA TGTGGATCTC
GCACAAGCGG AGCGAACCTA TCAAAGCACC TTATTGCTCA GTGAACAATA TCAAAGTTGG
CAAGCTGTTG AATTAAAAAC TGAAATGCAA GATATTCTGT TATGGGCGGC AAGGTTACAT
GAAGTGGGGG TCGTGATTAA CCATAAAAAT TTACAAAAAC ATTCCGCTTA TATTCTACAA
AATATGGAAT TGCCCGGCTT TGATAAAGAA CAGCAACGCT TACTTACTAC AATAATCCAT
CACCAATTTA ATCATTTCAA AATGCCTGAC ATTGGAAAAT TTGCACGCTA TCCGAGGGCT
GATGTAATCG CACTTGTTCG TTTATTACGT TTGGCTATTT TACTCAATAA ATCTCGCCAA
GCGACATCGA AAACAAATAA CATTACCCTA AAAATCGACC GCACTCTAAA AAAATGGTCA
TTATATTTTG ATGCAGAATA TCTTGAGCAT AACCCTTTAG TCAAAAATGA ATTGATGGAA
GAACAGAAAC GCTTGTCAGA ATTTAACTTA GCACTGGACT TTTATTCTTA A
 
Protein sequence
MNNENLLQKV TALVQHRGEA KEIAAIDLGS NSFHMVIARI INGSIQVLSR LKQKVQLAEG 
LDENNMLSQA AITRGVNCLA LFAERLQGFD PQNINVVGTY TLRSAVNKIE FLQQAAEVFP
YPINIISGET EAKTIYSGVS HTQPESGRKF VIDIGGGSTE MIIGDDFIPL IANSRNMGCV
SFAKKFFPCG KISRDNFQRA KKTAKQCIED LAKPYLDLNW DCVLGSSGTI KTVHQVISTN
YNQHAIITLS HLNKLITQVL KAHHFNKLHI NGLNEDRVDV FVPGLAILTA LFETFAIKEM
RYSDGALREG IIYSLEKDFQ VKNIRQRTAL GIMQQFNVDL AQAERTYQST LLLSEQYQSW
QAVELKTEMQ DILLWAARLH EVGVVINHKN LQKHSAYILQ NMELPGFDKE QQRLLTTIIH
HQFNHFKMPD IGKFARYPRA DVIALVRLLR LAILLNKSRQ ATSKTNNITL KIDRTLKKWS
LYFDAEYLEH NPLVKNELME EQKRLSEFNL ALDFYS