Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0004 |
Symbol | pepD |
ID | 4239511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 3990 |
End bp | 5450 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638103534 |
Product | M20C family Xaa-His dipeptidase |
Protein accession | YP_718209 |
Protein GI | 113460153 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGATC TTCAATCTCT GCAACCTAAA CTACTTTGGC AATGGTTTGA TCAAATTTGT GCTATTCCAC ATCCTTCTTA CAAAGAAGAG CAGTTAGCAC AATTCATTAT TAATTGGGCA AAAACAAAAG GTTTTTTTGC GGAACGTGAT GAAGTCGGTA ATGTATTAAT TCGTAAACCG GCAACAGTAG GAATGGAAAA TCGTAAACCT GTAGTACTAC AAGCACACTT AGATATGGTT CCACAAGCTA ATGAAGGGAC AAATCATAAT TTTGATCAGG ATCCTATTTT GCCATATATC GATGGTGATT GGGTTAAGGC TAAAGGTACA ACGTTAGGTG CTGATAACGG TATTGGTATG GCATCTGCAC TTGCTGTCTT AGAAAGTAAT GATATTGCAC ATCCGGAGTT GGAAGTATTG TTAACCATGA CTGAAGAAAG GGGGATGGAG GGGGCGATTG GGCTACGTCC AAATTGGCTT CGTTCTGAGA TCTTAATTAA TACCGATACT GAAGAAAATG GAGAAATTTA TATAGGTTGT GCCGGCGGTG AAAATGCAGA TCTTGAATTA CCTATTGAGT ATCAAGTCAA TAATTTCGAA CATTGTTATC AAGTTGTGCT AAAAGGGTTA CGAGGCGGGC ATTCTGGTGT GGATATTCAT ACCGGACGAG CTAATGCAAT CAAAGTGTTG CTTCGTTTTT TAGCAGAACT TCAACAAAAC CAACCGCACT TTGACTTCAC TTTAGCAAAT ATCCGTGGCG GTTCTATTCG CAATGCTATT CCAAGAGAAA GTGTTGCCAC TTTGGTCTTT AATGGTGATA TTACCGTATT ACAAAGTGCG GTACAAAAAT TTGCAGATGT AATCAAAGCC GAATTGGCAC TAACTGAGCC AAACTTGATA TTTACACTTG AAAAAGTTGA AAAACCTCAA CAAGTATTTT CCAGTCAATG CACGAAAAAT ATTATCCATT GTTTGAATGT TTTGCCAAAT GGCGTAGTAC GTAACAGTGA TGTCATTGAA AATGTGGTAG AAACATCATT AAGCATTGGC GTGTTAAAAA CTGAAGATAA TTTTGTTAGA AGTACCATGT TAGTGCGGTC ATTAATTGAA AGTGGCAAAT CCTATGTTGC TTCTTTATTA AAATCTTTAG CCTCATTAGC ACAAGGTAAT ATCAATTTAT CAGGCGATTA TCCGGGTTGG GAACCACAAA GTCATAGTGA TATTTTGGAC TTAACTAAGA CAATTTATGC ACAAGTTTTA GGTACAGATC CTGAAATCAA GGTAATTCAT GCGGGACTTG AATGTGGGTT ATTGAAAAAA ATCTATCCAA CGATCGATAT GGTATCTATC GGACCGACAA TTAGAAATGC ACATTCTCCG GATGAAAAAG TACATATTCC GGCTGTGGAA ACTTATTGGA AAGTATTAAC CGGTATACTT GCTCATATTC CATCACGTTA A
|
Protein sequence | MSDLQSLQPK LLWQWFDQIC AIPHPSYKEE QLAQFIINWA KTKGFFAERD EVGNVLIRKP ATVGMENRKP VVLQAHLDMV PQANEGTNHN FDQDPILPYI DGDWVKAKGT TLGADNGIGM ASALAVLESN DIAHPELEVL LTMTEERGME GAIGLRPNWL RSEILINTDT EENGEIYIGC AGGENADLEL PIEYQVNNFE HCYQVVLKGL RGGHSGVDIH TGRANAIKVL LRFLAELQQN QPHFDFTLAN IRGGSIRNAI PRESVATLVF NGDITVLQSA VQKFADVIKA ELALTEPNLI FTLEKVEKPQ QVFSSQCTKN IIHCLNVLPN GVVRNSDVIE NVVETSLSIG VLKTEDNFVR STMLVRSLIE SGKSYVASLL KSLASLAQGN INLSGDYPGW EPQSHSDILD LTKTIYAQVL GTDPEIKVIH AGLECGLLKK IYPTIDMVSI GPTIRNAHSP DEKVHIPAVE TYWKVLTGIL AHIPSR
|
| |