Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1521 |
Symbol | |
ID | 4241044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1721371 |
End bp | 1722489 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638105105 |
Product | CdaR family transcriptional regulator |
Protein accession | YP_719730 |
Protein GI | 113461661 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3835] Sugar diacid utilization regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000148841 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATGA AATTAAGCGT TTCTTTAGCT CAAAATATTG TGCAACGAAC AAACCGTGTT CTACATAAGC CCATCAATGT AATGAATGAA ACCGGCATTA TTATTGCCTC AAGTAATCCT CACCGTTTGC AACAAAGACA TATTGGTGCC ATATACTCTA TTCGTCACAA CCAAACTATA GAAATAAATC AGGAGTTAGC TGAAAAATGG TTATTTGAAG TGCAACCGGG CATTAATCTT CCCATTAGTT ATTTGGGAGA AATTTTAGGT GTTGTAGGTA TTTCCGGAGA TCCCGACACT GTTCGATCCT ATGGTGAAAT GGTTAAAATG ACTGCCGAAT TGATGATTGA ACAACACTTT TTGTTAGAAA AAGAACGTTG GGATAAACGT CATAAAGAAG AGTTTATTCT AGGTTTATTA AAAGGTAAAC TCAACGAGGC AGAAATTGAG AAACAAAGTG CTTTTTTCGG ACTAACGCTA CATACTAAAA GTGCGGTCAT TATTATCCAA ATTTTGCACC CCACAGCAGA GAAACTTCAA AAGCTTGCTG GATATTTAGA ACAATCGATT AACCACTTCG CAATATTATC GCATGATAAA ATCGCACTTA TACAACCTGC TGAAGAACTT CAAGCCTTAC TTGGTACAAA AGGATTACAG AAATTATTTC CTCCTTATCT ATTACCTTCG GGATTAAAAG GGCTGAAAGT GGCTATCGGT TCCGAAGTTG AAAATAATCG CAACATTTCA CTTTCATTTC AGACCGCACT TAGTACCCTT CGTTATGGGG AAACTTATTT TCCTAAGAAA TCCATCTATT TCTTTAATCA ATATAAATTG CCTGCGTTAT TAGATAACTT ACGTAGTACT TGGCAAAGTA AGGAGTTATT AAAACCTATT GACACCTTAT ATCAACAAGA TGAAAATCAT CTGTTACAAA AAACATTGCA ACAGTATTTT TTTTCAAATT GTGATCTTGC TCTCACTTCA CAACAATTAT TTATTCATAT CAATACATTA AGGTATCGTC TAAGCAAGAT TGAGAAAATA ACCGGCTTGT CTTTCAATAA GATAGATGAA AAATTTACAC TCTACCTAAG CACATTGCTA AACCAATAA
|
Protein sequence | MNMKLSVSLA QNIVQRTNRV LHKPINVMNE TGIIIASSNP HRLQQRHIGA IYSIRHNQTI EINQELAEKW LFEVQPGINL PISYLGEILG VVGISGDPDT VRSYGEMVKM TAELMIEQHF LLEKERWDKR HKEEFILGLL KGKLNEAEIE KQSAFFGLTL HTKSAVIIIQ ILHPTAEKLQ KLAGYLEQSI NHFAILSHDK IALIQPAEEL QALLGTKGLQ KLFPPYLLPS GLKGLKVAIG SEVENNRNIS LSFQTALSTL RYGETYFPKK SIYFFNQYKL PALLDNLRST WQSKELLKPI DTLYQQDENH LLQKTLQQYF FSNCDLALTS QQLFIHINTL RYRLSKIEKI TGLSFNKIDE KFTLYLSTLL NQ
|
| |