Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0762 |
Symbol | |
ID | 4240253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 816285 |
End bp | 817271 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638104316 |
Product | C4-dicarboxylate transport system, periplasmic component |
Protein accession | YP_718972 |
Protein GI | 113460905 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTAT TAAATTTTAA AACGCTTTCT GTGTTAATCG CTAGCATAGC TCTATTCTCA GGTAATACCA ATGCTGCAAC GACATTACGT TTTGGTTATG AAGCTCCTCG TGGTGATACA CAACATGAAG CAGCGAAGAA ATTTAATGAC CTTTTAAAGG AAAAAACAAA AGGAGAAATT AAGTTAACAC TTTTTCCAGA CAGTACGTTA GGTAACGCAC AAACTATGAT CAGTGCTGTA CGTGGTGGAA CGATTGACTT GGAAATGTCA GGTTCTCCAA ATTTTTCTGG ATTAGTACCA AAATTAAATG TGATTGATAT TCCATTCATT TTCCAAAACC GCGAGCATGT ATATGCCGTA TTGGATAGTG AAATTGGTCA AAGTTTATTA AAAGAATTAG AAGCCAAAGG ATTAAAAGGT TTAGCTTTTT GGGAAGTAGG ATTTCGATCT TTTACTAACT CAAAACATCC TGTAAAATCC CCTGATGATA TTAAAGGGTT AAAAGTGCGG ACTAACCAAA ATCCAATGTA TATTCAAGCA TTTTCAATCT TAGGGGCAAA TCCAGTTCCA ATGCCATTAT CCGAACTTTA TACCGCACTT GAAACACGTG CGGTAGATGC TCAAGAACAT CCTGTTGGTA TAGTTTGGTC TGCTAAATTA TATGAGGTGC AAAAATACTT AAGCTTAACT AACCATGGTT ATACGCCATT GATTGTTGTT ATGAACAAAG CCAAGTTCGA TGCTCTGTCT CCGGCATTGC AAACAGCATT AGTCGAAGCT GCACAAGAAG CAGGTGCTTA TCAACGCCAG CTTAACCTTA GAAATGAAAA AAGCATTATT GAAAAAATGC AAAAAGCGGG CGTTGAAATT ATTGAGCGTG TTGATACCGG ACCGTTTAGG GCAGCCATTG AAAACGAAGT ACGCAAGGCA TTCATTGAGA AAAATGGTGA TGATTTAGTT AAAAAAATTG ATGCATTAGC AAAATAA
|
Protein sequence | MTLLNFKTLS VLIASIALFS GNTNAATTLR FGYEAPRGDT QHEAAKKFND LLKEKTKGEI KLTLFPDSTL GNAQTMISAV RGGTIDLEMS GSPNFSGLVP KLNVIDIPFI FQNREHVYAV LDSEIGQSLL KELEAKGLKG LAFWEVGFRS FTNSKHPVKS PDDIKGLKVR TNQNPMYIQA FSILGANPVP MPLSELYTAL ETRAVDAQEH PVGIVWSAKL YEVQKYLSLT NHGYTPLIVV MNKAKFDALS PALQTALVEA AQEAGAYQRQ LNLRNEKSII EKMQKAGVEI IERVDTGPFR AAIENEVRKA FIEKNGDDLV KKIDALAK
|
| |