Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0702 |
Symbol | dctQ |
ID | 4240190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 744433 |
End bp | 746298 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638104254 |
Product | TRAP dicarboxylate transporter, permease component |
Protein accession | YP_718914 |
Protein GI | 113460847 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1593] TRAP-type C4-dicarboxylate transport system, large permease component [COG3090] TRAP-type C4-dicarboxylate transport system, small permease component |
TIGRFAM ID | [TIGR00786] TRAP transporter, DctM subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000972925 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA TGAACAAGTT AGAAGAATGG GTTGGCGGGA CATTATTCCT CGCAATTTTC CTGATTTTAA TTGCACAAAT TGTTGCACGT CAAGTTATTC ATATGCCACT TATATGGTCG GAAGAATTAG CACGTTTGCT GTTTGTTTAT ACGGCATTAT TGGGGATCAG TATGGCGGTA CGTGCACAAC AACACGTCTA TATTGACTTT ATTACCAATT TAATGCCGCT GACAATCAAA CGTATGGCGA TGACCTTTGT TCAATTATTA ATTTTTGTCT CAATTATTCT GTTTATCTAC TTAGGTTATG GCGTTTGGGC AGACGCAACT TTTCCTATGG AAGCATTAAA AGCTACCTTT GGAACGGAAA TTACGCAAAA ATGGCTCTAC GCAGGATTGC CGATTATTGC ATCTTTGATG TTATTCCGTT TTTTAGAAGC TCAAGCAGAA AACTTTAGAA ACAAAGCCAC TTACTTGCCT GTTTCTTTTT TCCTTGTAAG TGCGGTCATT ATTTTTGCGA TTTTATATTT TCAGCCGGAA TGGTTTAAAT CTCTGCGTAT TTCTAACTAT GTAAAATTCG GTAAAAATGC CGTTTATATC GCTTTAGCGG TTTGGTTAGT GATTATGTTC TTAGGCACGC CTGTAGGATG GTCATTATTT ATTGCCGCTT TACTTTATTT TTCAATGACA CGTTGGAATA TCACTTATTC CGCATCGGGT AAATTAGTTG ATAGCTTAGA CAGCTTCCCG TTATTGTCCG TACCGTTTTT TATCTTAACC GGTATTTTGA TGAACACTGG CGGTATCACT GAGCGTATCT TCCACTTCGC AAAAACGTTA TTAGGACACT ATACCGGTGG TATGGGACAT GTAAATATCG GTGCTAGCTT AATTTTTTCG GGTATGTCAG GATCTGCTTT GGCAGATGCT GGCGGTTTAG GTCAATTGGA AATCAAAGCG ATGCGTGATG CAGGCTATGA TGACGATATT TGCGGTGGTA TTACTGCTGC CTCCTGTATC ATCGGTCCGT TAGTTCCGCC AAGTATTGCA ATGATAATCT ATGGTGTTAT TGCTAATGAA TCCATCGCAA AGCTATTTGT AGCAGGTTTT GTACCTGGCG TGTTGGTTAC CATCGCATTG ATGGTCATGA ACTACTATGT TTCTAAAAAA CGTGGTTACC CTAAAACACC AAAGGCGAGC AAAGCAGAAG TTTGTGCCGC ATTTAAAGAG ACATTTTGGG CAATTTTGAC ACCGTTTTTG ATTATTGGTG GTATTTTCTC TGGTTTATTT ACACCGACAG AAGCGGCGGT CGTTGCTGCT GCATATTCCG TAATTGTAGG AAAGTTTGTT TATAATGATT TGAATCTAAA GAATTTCCTC AAAAGCTGTG TAGAAGCAGT ATCTATTACC GGTGTAACCG CATTAATGGT AATGACAGTC ACTTTCTTCG GTGACATGAT TGCCCGTGAA CAGGTTGCGA TGAAAATTGC GGAAATTTTT GTTGCTGTTG CAGATTCGCC TATCACAGTG TTAATTATGA TCAATTTGCT TCTCTTATTT TTAGGAATGT TCATTGACGC CTTGGCACTA CAATTTTTAG TGTTGCCAAT GTTGATTCCA ATTGCGATGC AATTTGGTAT TGATCTTGTC TTTTTTGGGG TCATGACTAC ATTAAATATG ATGATTGGTA TTTTAACCCC TCCGATGGGA ATGGCGTTAT TTGTCGTTGC TCGGGTGGGT AATATGCCTG TAAGTGTGGT AGCAAAAGGC GTTTTACCTT TCTTGATACC AATTTTTATG ACCTTAGTAT TGATTACGAT TTTCCCACAA ATCATTACAT TTATACCTAA CTTGCTAATG CTTTAG
|
Protein sequence | MKIMNKLEEW VGGTLFLAIF LILIAQIVAR QVIHMPLIWS EELARLLFVY TALLGISMAV RAQQHVYIDF ITNLMPLTIK RMAMTFVQLL IFVSIILFIY LGYGVWADAT FPMEALKATF GTEITQKWLY AGLPIIASLM LFRFLEAQAE NFRNKATYLP VSFFLVSAVI IFAILYFQPE WFKSLRISNY VKFGKNAVYI ALAVWLVIMF LGTPVGWSLF IAALLYFSMT RWNITYSASG KLVDSLDSFP LLSVPFFILT GILMNTGGIT ERIFHFAKTL LGHYTGGMGH VNIGASLIFS GMSGSALADA GGLGQLEIKA MRDAGYDDDI CGGITAASCI IGPLVPPSIA MIIYGVIANE SIAKLFVAGF VPGVLVTIAL MVMNYYVSKK RGYPKTPKAS KAEVCAAFKE TFWAILTPFL IIGGIFSGLF TPTEAAVVAA AYSVIVGKFV YNDLNLKNFL KSCVEAVSIT GVTALMVMTV TFFGDMIARE QVAMKIAEIF VAVADSPITV LIMINLLLLF LGMFIDALAL QFLVLPMLIP IAMQFGIDLV FFGVMTTLNM MIGILTPPMG MALFVVARVG NMPVSVVAKG VLPFLIPIFM TLVLITIFPQ IITFIPNLLM L
|
| |