Gene Sare_4146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4146 
Symbol 
ID5708303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4710434 
End bp4712578 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content74% 
IMG OID641273574 
Productdipeptidyl-peptidase IV 
Protein accessionYP_001538927 
Protein GI159039674 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0135173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTTTC CGGAGCTGGC CGCGCGTACC CGTCGGTTCC GCCACGGGGC ACCGCGCGCG 
GTGTCGGTGG CCGACGACGG CTCCCGGGTG GTCTTCCTCC GCTCCGCAGG GCCGACGGAC
CCCACCGACG CGCTCTGGCT GCTCGACGTG GACACCGGGG AGGAACGGCT CGTCGCCGAC
CCGGCGGTGC TCCTTCAGGA GGACGCCGAC CAGCTCAGCC CGGGAGAGCG CACGCTGCGG
GAACGGCTGC GGCTGAGCGT CTCCGGCATC GGTTCGTACG CCCTCGACTC GGCCGGCCGG
GTGGCCGTCT TCGTGCTCGG TGGCCGACTG TTCCGGGCCG ACCTGATCCA CGGGGACGTG
GTCGAGGTCG CCGCGGCCGG CCCGGTGCTC GATCCGCGCC CCGACCCGAC CGGACAGCGG
CTGGCGTACG TGACCGACGC CGCGCCCGGA ATCCGCCGTG GCGAGCTACG GGTGGTCGAG
TACGACGGCA CCGACACCAT GCTCGCCGGC GAGGACGCGG GGGTGATCTG GGGGCTGCCG
GAACATGTCG CGGCGGAGGA GTTCGACCGG TTCCGGGGCT ACTGGTGGGC CCCGGACGGG
CGCTCGGTGC TCGCCGCCCG GGTGGACGAG TCCCGGCTGG ACCGGTGGCA CCTACACGAC
CCGGCCGAAC CGGCGACCGC GCCGACCACC GTCGCCTACC CCCGGGCGGG CGGGCCCAAC
GCCGAGGTCA GCCTGCACCT GCTCGACCTC GACGGCGGCT GGGTCGACGT GCACTGGGAC
CGGGAGACGT ACCCGTACCT GACCGCCGTG CACTGGACCG ACGGCGGGCC ACTGATCACG
GTGCTGCGCC GGTCCCAGCA GCACGGGCTG GTGCTCGCGG TGGACCCGCG TACCGGCGAG
ACACAGGTGC ATGCCGAGCT GGCCGACCCG CGCTGGGTGG AACCGGTCCC CGGCACTCCC
GCCCACCTGC CCGACGGCCG GGTGCTGGTG GGCGGGGAAC TGGCCCACGA CGGGTACGAC
GCGCGCTGCC TCTTCGCCGA CGGCACGCTG CTGACACCCC CGTCGCTCTA CGTGCGCCGG
GTGGTGGGCC GGCTACCGGC CCACCCCGGC GCCGGGCCGG CCGACCTGCT GGTGGAGGCG
ACCGAGGGCG AGCCAAGCGA GCAGCACCTG TTCCGGGTCC GCACCACGGT CGGCGGCGGC
ATGGACTCCC GCCGGATCAC CACGGACGCC GGCTGGCACG TCGCGGTCGT GGGCGGGGAC
GTGTTGGTCG TGGGTAGCGC CTCGCTGGAC CACCCGGGCC TGCGCTGGAC GGTGTGGCGA
GGCGACCGGG AGGTGGCGAG GCTGCGGTCG TTCGCGGCGA CCCCACCGTA TGCTCCGCTG
CCGTTGCTGG AGCGGGTGAC CGACCGGCGG CTGCCGGCCG CGGTGCTCTA CCCGGAGCAG
CACGTCTCGG GCCGCCGGCT GCCGGTGCTG CTGGACGTGT ACGGCGGTCC CGGCCACCAG
GAGGTGTTGG CGGCGCGGTC GGTGTGGTTG GAGCGGCAGT GGTGGGCCGA CGCCGGGTTC
GCGGTGGTGG TGATCGACAA CCGCGGTACG CCGGGGGTCG CGCCGTCGTT CGAGAAGGCG
ATCCACCGAC GGATGGCGGA CATGGTCCTC ACCGACCAGG TGGAGGGGCT CACCGCGCTC
GCCGACAAGC ATCCCGACCT GGACCTCGGT CGGGTGGCCG TGCGGGGCTG GTCGTTCGGT
GGCTGGCTGG CGGCGCTGGC GGTGCTGCGC CGCCCGGAGC TGTTCCGGTG CGGGATCGCC
GGGGCGCCGG TGACCGACTG GAGTTTGTAC GACACCGCCT ACGCCGAGCG CTACCTGGGT
CTGCCCGAGG ACGGGGCGGA CGTGTACGCC CACCACTCGC TGGTGGAGTT GGCCGCAGCG
GCGACCTCGA CCACGGAGCA GGCCCCGCCC CTGCTGCTGG TGCACGGCAT GGCCGATGAC
AACGTGGTGG CGGCGCACAC GCTGCGGCTG TCGGCCGCGC TGTTGACCAA TGGGCACCCG
CATTCGGTGC TGCCGCTGAC CGGCGCGACG CACATGGCGG CCGGCGGCGC CGGCGAGCAC
CTGCTGAAGC TGGAGCTGGC CTTTCTCCGT ACCCACCTGG ACTGA
 
Protein sequence
MDFPELAART RRFRHGAPRA VSVADDGSRV VFLRSAGPTD PTDALWLLDV DTGEERLVAD 
PAVLLQEDAD QLSPGERTLR ERLRLSVSGI GSYALDSAGR VAVFVLGGRL FRADLIHGDV
VEVAAAGPVL DPRPDPTGQR LAYVTDAAPG IRRGELRVVE YDGTDTMLAG EDAGVIWGLP
EHVAAEEFDR FRGYWWAPDG RSVLAARVDE SRLDRWHLHD PAEPATAPTT VAYPRAGGPN
AEVSLHLLDL DGGWVDVHWD RETYPYLTAV HWTDGGPLIT VLRRSQQHGL VLAVDPRTGE
TQVHAELADP RWVEPVPGTP AHLPDGRVLV GGELAHDGYD ARCLFADGTL LTPPSLYVRR
VVGRLPAHPG AGPADLLVEA TEGEPSEQHL FRVRTTVGGG MDSRRITTDA GWHVAVVGGD
VLVVGSASLD HPGLRWTVWR GDREVARLRS FAATPPYAPL PLLERVTDRR LPAAVLYPEQ
HVSGRRLPVL LDVYGGPGHQ EVLAARSVWL ERQWWADAGF AVVVIDNRGT PGVAPSFEKA
IHRRMADMVL TDQVEGLTAL ADKHPDLDLG RVAVRGWSFG GWLAALAVLR RPELFRCGIA
GAPVTDWSLY DTAYAERYLG LPEDGADVYA HHSLVELAAA ATSTTEQAPP LLLVHGMADD
NVVAAHTLRL SAALLTNGHP HSVLPLTGAT HMAAGGAGEH LLKLELAFLR THLD