Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_2070 |
Symbol | |
ID | 5876561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 2081098 |
End bp | 2082492 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641542422 |
Product | dipeptidase PepV |
Protein accession | YP_001663678 |
Protein GI | 167040693 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01887] dipeptidase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000110945 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTAA ATTCCTACAT AGACAATATG AGAGATGACA TTATAAAATC AGTTCAAGAG TTGGTCAGGA TAAAAAGCGT ACAAGATGAA CCGAAACCAG GAATGCCTTA TGGAGAAGGT ATTGCTAAGG CATTGGACAA AGCTTTAGAG ATTGCACAAA GTTTAGGCTT AAAAACTAAA AATGTGGATG GCTATGTAGG GTATGCAGAA TACGGCGAAG GAGAAGAAAT GATAGGAGTT TTGGGACATT TAGATGTGGT TCCAGAAGGT GATGGGTGGA CTTATCCACC TTATGGTGCT GAAATCCATG ATGGAAAGAT ATATGGAAGA GGTACTGTAG ATGATAAGGG ACCGATAATC GCTGCTTTAT ATGGCTTAAA AGCTATAAAA GATGCAGGAT TAGAACTTTC CAGAAGAGTG AGGATTTTAT TTGGTACAAA TGAAGAGACT GGTTCTCATG AAATTCCTTA TTATTTAAAA CACGATGAAG CGCCTACAAT GGGCTTTACT CCTGATGCTC AATATCCCAT CATATATGCA GAAAAGGGAA TAACAATGTT TAATGTAGTA AAAGACTTTA ATAAAAAGCC CAGCAATATA GTTATAAAAT ACATCAAAGG TGGAGAAAGG CCTAATGTAG TGCCAGGCTT TTGTGAAGCT GGATTAAAGG TAAAAGAGGC AAATAAGAAG AAAGAAATTC AAGATAAGCT AGAAGCTTTC GTAAAAGAGA CGGGTTACAA TTTGAAGGCT GAAGAAAAAG ACGAAATGCT TGTAATCAAA TCAGTAGGGG TTTCGGCTCA TGGCAGCCTT CCACACTTAG GGAAAAATGC TATAATGCAG CTATTTCTCT TCCTTGACAG AATCGATTTA GAAGATAGCG ATGTCAAAGA TTTTATACAT TTCTTCGCTA CAAATATTGG AATAGAAACA AACGGAAAAA CTTTTGGAAT ATACTTAAAA GATGAAACAG GAGAATTGAC TTTTAACGTT GGTACAATTC AATTAGATGA AAGCAAAGGA GTATTAGGTT TAAATATCAG GTATCCTGTA AAATATAAAT ATGAGGATTG GATGAATATT TTTGAGAATA AAATCAAAAC TAATGGAATG AGAATAGAAG ACATGCTCCA TCAGCCACCT TTGTATTTCC CACCAGCCCA TCCTTTGATA AAAACTTTGA GCAAGGTTTA TGAAGAACAG ACAGGACAGA AGGCAGAGCT TTTAGCAATA GGTGGAGGAA CTTACGCGAA AGAGATGCCT AACACAGTGG CTTTTGGGCC TGTTTTCCCA GGCAAGCCAG AGTTAGCACA TCAAGCGGAT GAATATATAG AAATTGAAGA TTTGATATTA AATGCAAAAA TTTATGCTCA CGCTATATAT GAATTAGCAA AATAA
|
Protein sequence | MDLNSYIDNM RDDIIKSVQE LVRIKSVQDE PKPGMPYGEG IAKALDKALE IAQSLGLKTK NVDGYVGYAE YGEGEEMIGV LGHLDVVPEG DGWTYPPYGA EIHDGKIYGR GTVDDKGPII AALYGLKAIK DAGLELSRRV RILFGTNEET GSHEIPYYLK HDEAPTMGFT PDAQYPIIYA EKGITMFNVV KDFNKKPSNI VIKYIKGGER PNVVPGFCEA GLKVKEANKK KEIQDKLEAF VKETGYNLKA EEKDEMLVIK SVGVSAHGSL PHLGKNAIMQ LFLFLDRIDL EDSDVKDFIH FFATNIGIET NGKTFGIYLK DETGELTFNV GTIQLDESKG VLGLNIRYPV KYKYEDWMNI FENKIKTNGM RIEDMLHQPP LYFPPAHPLI KTLSKVYEEQ TGQKAELLAI GGGTYAKEMP NTVAFGPVFP GKPELAHQAD EYIEIEDLIL NAKIYAHAIY ELAK
|
| |