Gene HS_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1004 
Symbol 
ID4240497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1105109 
End bp1106497 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content36% 
IMG OID638104560 
Productputative transglycosylase 
Protein accessionYP_719215 
Protein GI113461147 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4623] Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00893953 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTCC CTTGGCAACA AATAGTGAGA TCAAAACAGA ATTACCATAC TACAATTCAA 
GAGCGACAAA AATTGATTGT AGGAACAATC AATAATCCGG TGTCTTATTT TATCGGTACA
AACGGAGAAA CAGGGCTAGA ATATGAATTA AGCAAGGCTT TTGCAAATTA CTTAAATGTT
GATTTGGAAA TGTTTCCTCT AAATAGTGCG GACGCATTAT TTCAGGCTTT AGCACAAGGT
AAAATAGACA TTGCTGCTGC AAGTTTATTT TATCAACAAG ATAGAAGTGA AAAATTCAAA
CTAGGTCCGG CATATCACGC TGCATCTTGG CAATTGACTT ATCGCAAAGG TGAACGTCGT
CCTATCACAT TAGAAAATTT ATCCGGCAAA TTGGTTATTC CGGCTAATTC GGCACTGAAT
AATATTCTGC TGGCAAAAAA AGAAAAATAC CCTTCTTTAA CATGGGAAAC CAGTGAACTA
AGCCAAGAGG AACTCTTATT TCAAGTTGCT GAAGGAAAAA TAGATTACAC AATCGCTACC
TCTACTGAAG TATCAGTTAA TCAGCAAATT AAACCTCAAA TTGCAATTGC CTTTAATGTG
ACTGATGAGT TTACAGTACA TTGGTACTTA TCTGATAAGG GGTCTTCAGA ATTACAAGCC
GCACTATTAG ACTTCATGAA CTCTGCCATT GAAAACGGCT TAATTGCTCG TATTGAAGAA
AAATATTTCA ATCACCTCAA CCAATTTGAC TATGTTGATA CTCGCTCCTA TTTGAATGCA
ATTGAAACAG TTTTGCCTAA ATATGCTCCT TTGTTTGAAA AATATAAAGG TGATTTAGAT
TGGCGTTTAT TGGCAGCCAT ATCTTATCAA GAATCCCATT GGAATCCGGA AGCAACCTCA
CCAACCGGAG TACGCGGTAT GATGATGTTG ACAAAAGCAA CTGCAGATAG AATGAATATT
ACTAATCGTC TCGATCCTGA ACAAAGCATT AAAGCCGGTT CCGAATATTT ACATCTTCTG
CTCAAACAAA TGCCGGATAC TATTTTAAAA GAAGATCGTA TTTGGTTTGC ACTTGCCGCT
TATAACATGG GATTGGGACA TTTATTAGAC GTTAGACGCT TAACTAAACA GCTGGGAGGA
AATCCGGATA ATTGGTTAGA GGTGAAAAAA AATTTACCCT TATTAGCACA AAAACGTTAT
TTTACCCATC TTAAATATGG CTACGCTCGA GGCTACGAAG CGTTTCAATA TGTGGAAAAT
ATTAGAAGAT ATATGAACAG CATAATGAAT TATTATCGGC TTCAACAAAA CCAACAAGAT
CGACAAGATC GGTATGAAAA TGAAAATAAT GATGTCATTT CAACACAAAC ACAACAGGAA
CAACGATGA
 
Protein sequence
MVFPWQQIVR SKQNYHTTIQ ERQKLIVGTI NNPVSYFIGT NGETGLEYEL SKAFANYLNV 
DLEMFPLNSA DALFQALAQG KIDIAAASLF YQQDRSEKFK LGPAYHAASW QLTYRKGERR
PITLENLSGK LVIPANSALN NILLAKKEKY PSLTWETSEL SQEELLFQVA EGKIDYTIAT
STEVSVNQQI KPQIAIAFNV TDEFTVHWYL SDKGSSELQA ALLDFMNSAI ENGLIARIEE
KYFNHLNQFD YVDTRSYLNA IETVLPKYAP LFEKYKGDLD WRLLAAISYQ ESHWNPEATS
PTGVRGMMML TKATADRMNI TNRLDPEQSI KAGSEYLHLL LKQMPDTILK EDRIWFALAA
YNMGLGHLLD VRRLTKQLGG NPDNWLEVKK NLPLLAQKRY FTHLKYGYAR GYEAFQYVEN
IRRYMNSIMN YYRLQQNQQD RQDRYENENN DVISTQTQQE QR