Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0158 |
Symbol | dcuB |
ID | 4239668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 141796 |
End bp | 143106 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638103689 |
Product | anaerobic C4-dicarboxylate transporter |
Protein accession | YP_718364 |
Protein GI | 113460304 |
COG category | [R] General function prediction only |
COG ID | [COG2704] Anaerobic C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTATA TTCAATTCGC TATCGTTTTA GTGTGTATTC TAATTGGTGC ACGCATTGGT GGTATCGGTT TGGGCGTTAT GGGTGGTTTA GGGCTTTGTA TTCTCGCCTT TGGCTTCGGG TTACAACCTG CTGGATTACC GATTGATGTG ATGTTTATGA TTATGGCTGT AGTTTCTGCC GCTGCTGCAA TGCAGGCAGC TGGTGGTTTG GATTATATGA TCAAAATCGC AACCAATATT TTACGCAGAA ATCCTAAATA TATCACTTTT GTCGCACCTG CCGTAACTTG GTTATTTACT TTCTTAGCCG GTACTGGCCA TGTAGCTTAT TCTGTATTAC CAGTTATCGC AGAAGTAAGT CGTCACAGCG GTATTCGTCC TGAGCGTCCA CTTTCAATGG CAGTAATTGC ATCGCAATTT GCTATTGTAG CAAGTCCTAT CGCTGCCGCT GTAGTGGCTG TTGTGAACTA TTTAGAACCA CAAGGAATCA ACTTGGGTAG CGTCTTAATG GTTACTATTC CGTCAACTAT TTTAGGCATA GCACTTGCTT GTGTTTTTGT CAATAAAATG GGGAAAGAAT TAAAAGACGA TCCGAATTAT CAACGTTTAT TACAAAATCC TGAGTATGTC AAAGAAAACA GCGTAAGTAA TATTGCTGAA ATTGAAATAA AACCGACTGC GAAATTATCA GTGGGCTTAT TCTTATTCGG TGCATTACTT GTTGTTGTGA TGGGAGCGTA TCCGGAGTTA CGTCCATCAT TCAACGATAA ACCTATGGGT ATGGCTCACA CTATTGAGAT CGTTATGTTA ACCGTTGGTG CATTGATTAT CTTAACCTGT AAACCGAATG GTAATGCTAT TACACAAGGT TCCGTATTCC ATGCTGGTAT GCGTGCGGTG ATTGCAATTT TTGGTATTGC TTGGTTGGGA GATACGTTGA TGCAAGGTCA TATTGCAGGC ATTAAAGAAA GTGTGAAGTT ATTAGTTGAA ACCGCACCTT GGGCATTTGC ATTTGCACTG TTCTTACTTT CTGTTTTGGT AAACAGTCAA GGGGCGACTG TAGCAACTTT ATTTCCGCTA GGTATTGCAT TAGGTATTGA ACCTGCGATT TTAATCGGGG TATTTGTAGC AGTAAATGGT TATTTCTTTA TTCCAAACTA TGGTCCAATT ATTGCTTCAA TTGACTTTGA TACAACAGGC ACAACAAGAA TCGGTAAGTT TATCTTTAAT CACAGCTTTA TGCTTCCAGG TTTATTAAGT ATGGCATTTA GCTTAGGTTT CGGGTTATTG TTTGCAAGTA TTTTACTTTA G
|
Protein sequence | MFYIQFAIVL VCILIGARIG GIGLGVMGGL GLCILAFGFG LQPAGLPIDV MFMIMAVVSA AAAMQAAGGL DYMIKIATNI LRRNPKYITF VAPAVTWLFT FLAGTGHVAY SVLPVIAEVS RHSGIRPERP LSMAVIASQF AIVASPIAAA VVAVVNYLEP QGINLGSVLM VTIPSTILGI ALACVFVNKM GKELKDDPNY QRLLQNPEYV KENSVSNIAE IEIKPTAKLS VGLFLFGALL VVVMGAYPEL RPSFNDKPMG MAHTIEIVML TVGALIILTC KPNGNAITQG SVFHAGMRAV IAIFGIAWLG DTLMQGHIAG IKESVKLLVE TAPWAFAFAL FLLSVLVNSQ GATVATLFPL GIALGIEPAI LIGVFVAVNG YFFIPNYGPI IASIDFDTTG TTRIGKFIFN HSFMLPGLLS MAFSLGFGLL FASILL
|
| |