Gene HS_0973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0973 
Symbol 
ID4240466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1072426 
End bp1073622 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content37% 
IMG OID638104529 
Producttetratricopeptide repeat protein 
Protein accessionYP_719184 
Protein GI113461116 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000517245 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGAAT TACTTTTCCT GCTTTTACCG ATTGCAGCAG CTTACGGCTG GTATATGGGG 
CATCGTAGTG CAAGAAAAGA TCAAGATACG ATTAGCAATA AATTTTCTCG TGATTATGTA
ACAGGTATTA ATTTATTATT ATCTAATCAA CACGAAAAAG CGGCAGATCT CTTCCTTGAT
ATTCTACAAA AGCAGGAACA AGAAAATAAC ATTGAAACAG GTTCACAATT TGAGGCTGAA
TTGACCTTAG GAAATTTATA TCGTTCTCGA GGTGAAGTTG ATAGAGCTTT GCGTATTCAT
CAAGCATTGG ATAGCAGTTC AAATTATACC TTCGAACAAA AACTTCTTGC TAAACAACAA
TTAGCAAAAG ATTTTTTGAC GATAGGCTTT TATGATCGAG CTGAAAATCT TTATATTTTA
TTGATTGACG AACCGAACTA CGCTGAAAAT GCTTTACAAC AACTAGCCGT CATTTATCAA
AAAACAAAAG AATGGAAAAA AGCGATCAAT GTTGCCGAGA AACTTGCCAA AATTTCGCCT
ACGGAAGATA ACATTGCATT AGCACATTAC TACTGCGAAT ATTCCCTGAC TTTAGGAAGT
GATGAGCAAC AACAAGCTCA AGCAATCCAT ATTTTGCAAC AAGCGTTGAA TGTTTCAAAA
ACGAGTGTTA GAGCCTCTAT CCTCATTGCC GAACGTTATA TTGTGAATTT AGAATATCAA
CGTGCGGTGC AACATTTAGA AAATGTGCTA ATACAAAATG CGGATTACAT GAGCGAAATT
TTACCGGCAT TAAAATATTG CTACCAAGAA TTAAATCGGT TAGACAACTT TGAACTTTTT
CTTATTCGGG CAAGCCAAAC CAGCAATAAT AGTGCGGTTG ATTTAGCTTT ATCGGATCTT
ATCGCAGAAA AAGACGGCAT TGTTGCGGCA CAAACTAAAT TACACCAACA ACTGGAACAA
CACCCAAGTA CATTTATTTT ACATCGTTTT ATTCAATACC AAATTGATGC TGCTGAAAAC
GGCAAAGCAA AAGAAAGTTT GATTTTATTG CACAAAATCG TGGGAGATAG AATTGCACGA
GGCTTTGATT ATCGTTGCAG TCATTGCGGT TACCAAACAC ACAAACTATC ATGGAATTGC
CCATCTTGTC GGAAATGGGA AAAAATCAAA CCGATTGTCG GAACTGAACA CCACTAA
 
Protein sequence
MLELLFLLLP IAAAYGWYMG HRSARKDQDT ISNKFSRDYV TGINLLLSNQ HEKAADLFLD 
ILQKQEQENN IETGSQFEAE LTLGNLYRSR GEVDRALRIH QALDSSSNYT FEQKLLAKQQ
LAKDFLTIGF YDRAENLYIL LIDEPNYAEN ALQQLAVIYQ KTKEWKKAIN VAEKLAKISP
TEDNIALAHY YCEYSLTLGS DEQQQAQAIH ILQQALNVSK TSVRASILIA ERYIVNLEYQ
RAVQHLENVL IQNADYMSEI LPALKYCYQE LNRLDNFELF LIRASQTSNN SAVDLALSDL
IAEKDGIVAA QTKLHQQLEQ HPSTFILHRF IQYQIDAAEN GKAKESLILL HKIVGDRIAR
GFDYRCSHCG YQTHKLSWNC PSCRKWEKIK PIVGTEHH