Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4146 |
Symbol | |
ID | 5708303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4710434 |
End bp | 4712578 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641273574 |
Product | dipeptidyl-peptidase IV |
Protein accession | YP_001538927 |
Protein GI | 159039674 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0135173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACTTTC CGGAGCTGGC CGCGCGTACC CGTCGGTTCC GCCACGGGGC ACCGCGCGCG GTGTCGGTGG CCGACGACGG CTCCCGGGTG GTCTTCCTCC GCTCCGCAGG GCCGACGGAC CCCACCGACG CGCTCTGGCT GCTCGACGTG GACACCGGGG AGGAACGGCT CGTCGCCGAC CCGGCGGTGC TCCTTCAGGA GGACGCCGAC CAGCTCAGCC CGGGAGAGCG CACGCTGCGG GAACGGCTGC GGCTGAGCGT CTCCGGCATC GGTTCGTACG CCCTCGACTC GGCCGGCCGG GTGGCCGTCT TCGTGCTCGG TGGCCGACTG TTCCGGGCCG ACCTGATCCA CGGGGACGTG GTCGAGGTCG CCGCGGCCGG CCCGGTGCTC GATCCGCGCC CCGACCCGAC CGGACAGCGG CTGGCGTACG TGACCGACGC CGCGCCCGGA ATCCGCCGTG GCGAGCTACG GGTGGTCGAG TACGACGGCA CCGACACCAT GCTCGCCGGC GAGGACGCGG GGGTGATCTG GGGGCTGCCG GAACATGTCG CGGCGGAGGA GTTCGACCGG TTCCGGGGCT ACTGGTGGGC CCCGGACGGG CGCTCGGTGC TCGCCGCCCG GGTGGACGAG TCCCGGCTGG ACCGGTGGCA CCTACACGAC CCGGCCGAAC CGGCGACCGC GCCGACCACC GTCGCCTACC CCCGGGCGGG CGGGCCCAAC GCCGAGGTCA GCCTGCACCT GCTCGACCTC GACGGCGGCT GGGTCGACGT GCACTGGGAC CGGGAGACGT ACCCGTACCT GACCGCCGTG CACTGGACCG ACGGCGGGCC ACTGATCACG GTGCTGCGCC GGTCCCAGCA GCACGGGCTG GTGCTCGCGG TGGACCCGCG TACCGGCGAG ACACAGGTGC ATGCCGAGCT GGCCGACCCG CGCTGGGTGG AACCGGTCCC CGGCACTCCC GCCCACCTGC CCGACGGCCG GGTGCTGGTG GGCGGGGAAC TGGCCCACGA CGGGTACGAC GCGCGCTGCC TCTTCGCCGA CGGCACGCTG CTGACACCCC CGTCGCTCTA CGTGCGCCGG GTGGTGGGCC GGCTACCGGC CCACCCCGGC GCCGGGCCGG CCGACCTGCT GGTGGAGGCG ACCGAGGGCG AGCCAAGCGA GCAGCACCTG TTCCGGGTCC GCACCACGGT CGGCGGCGGC ATGGACTCCC GCCGGATCAC CACGGACGCC GGCTGGCACG TCGCGGTCGT GGGCGGGGAC GTGTTGGTCG TGGGTAGCGC CTCGCTGGAC CACCCGGGCC TGCGCTGGAC GGTGTGGCGA GGCGACCGGG AGGTGGCGAG GCTGCGGTCG TTCGCGGCGA CCCCACCGTA TGCTCCGCTG CCGTTGCTGG AGCGGGTGAC CGACCGGCGG CTGCCGGCCG CGGTGCTCTA CCCGGAGCAG CACGTCTCGG GCCGCCGGCT GCCGGTGCTG CTGGACGTGT ACGGCGGTCC CGGCCACCAG GAGGTGTTGG CGGCGCGGTC GGTGTGGTTG GAGCGGCAGT GGTGGGCCGA CGCCGGGTTC GCGGTGGTGG TGATCGACAA CCGCGGTACG CCGGGGGTCG CGCCGTCGTT CGAGAAGGCG ATCCACCGAC GGATGGCGGA CATGGTCCTC ACCGACCAGG TGGAGGGGCT CACCGCGCTC GCCGACAAGC ATCCCGACCT GGACCTCGGT CGGGTGGCCG TGCGGGGCTG GTCGTTCGGT GGCTGGCTGG CGGCGCTGGC GGTGCTGCGC CGCCCGGAGC TGTTCCGGTG CGGGATCGCC GGGGCGCCGG TGACCGACTG GAGTTTGTAC GACACCGCCT ACGCCGAGCG CTACCTGGGT CTGCCCGAGG ACGGGGCGGA CGTGTACGCC CACCACTCGC TGGTGGAGTT GGCCGCAGCG GCGACCTCGA CCACGGAGCA GGCCCCGCCC CTGCTGCTGG TGCACGGCAT GGCCGATGAC AACGTGGTGG CGGCGCACAC GCTGCGGCTG TCGGCCGCGC TGTTGACCAA TGGGCACCCG CATTCGGTGC TGCCGCTGAC CGGCGCGACG CACATGGCGG CCGGCGGCGC CGGCGAGCAC CTGCTGAAGC TGGAGCTGGC CTTTCTCCGT ACCCACCTGG ACTGA
|
Protein sequence | MDFPELAART RRFRHGAPRA VSVADDGSRV VFLRSAGPTD PTDALWLLDV DTGEERLVAD PAVLLQEDAD QLSPGERTLR ERLRLSVSGI GSYALDSAGR VAVFVLGGRL FRADLIHGDV VEVAAAGPVL DPRPDPTGQR LAYVTDAAPG IRRGELRVVE YDGTDTMLAG EDAGVIWGLP EHVAAEEFDR FRGYWWAPDG RSVLAARVDE SRLDRWHLHD PAEPATAPTT VAYPRAGGPN AEVSLHLLDL DGGWVDVHWD RETYPYLTAV HWTDGGPLIT VLRRSQQHGL VLAVDPRTGE TQVHAELADP RWVEPVPGTP AHLPDGRVLV GGELAHDGYD ARCLFADGTL LTPPSLYVRR VVGRLPAHPG AGPADLLVEA TEGEPSEQHL FRVRTTVGGG MDSRRITTDA GWHVAVVGGD VLVVGSASLD HPGLRWTVWR GDREVARLRS FAATPPYAPL PLLERVTDRR LPAAVLYPEQ HVSGRRLPVL LDVYGGPGHQ EVLAARSVWL ERQWWADAGF AVVVIDNRGT PGVAPSFEKA IHRRMADMVL TDQVEGLTAL ADKHPDLDLG RVAVRGWSFG GWLAALAVLR RPELFRCGIA GAPVTDWSLY DTAYAERYLG LPEDGADVYA HHSLVELAAA ATSTTEQAPP LLLVHGMADD NVVAAHTLRL SAALLTNGHP HSVLPLTGAT HMAAGGAGEH LLKLELAFLR THLD
|
| |