Gene Acid345_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0056 
Symbol 
ID4069990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp52837 
End bp55080 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content59% 
IMG OID637982056 
Productpeptidase S9B, dipeptidylpeptidase IV-like 
Protein accessionYP_589135 
Protein GI94967087 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGCAC ATCTAGCTAC GTTCCTTCTC TTCACTTCCT CTTTTACGTT GGTAAGCGCG 
CAGGAAGCGC CGAAGCCGAA ACAGCTAACC ATTGAAGCGA TCTTCGCGAA GGGCGGAGTG
CTCGGCCGCG CGCCGGAGTC GGTGGAGTGG AGTCCGGATG GAACGAAGGT CTCGTTCGTG
CAGCGTGATG ATTCGGGCGA TAACGGTGCG CTCTATTACG TGGATGTGAC CACCGGAGCG
AAGCCAGCTG TGCTTGTCGC GCAGGAGAAG CTGACCGAGA TGAAGCCGCC GGCGAAGACG
AAGTCCGATG ACCGCGAGAA GGACAATCGC GAGCGGTATT CGGTCGCGGC CTATCATTGG
GCGCCGGATT CCAAGCACAT CCTCTTCGAC TCGGGCGGTG CGCTGTGGAA CTATGACCTC
GCGGCGGGGA AGTCGGCGAT GATTGCGTCG GCCGAAGGTG GGCTTGGCGA TCCGAAGTTC
TCGCCGTCGG GAGATCGCAT TTCCTATCTG CGCGAGCACG ATCTGTATGT ATCGGGACTT
GATGGAAAGG CGAAGCGAAT TACCGAAGGT GGCAATGCGA ACGTCCTGAA CGGCGAAGTG
GATTGGGTTT ACGCGGAGGA GCTGAACGTC CGCAGCAATT ACTTCTGGTC ACCGAATGGG
AAGCAGATTG TGTATCTGCA AATGGACCAG ACGAAGGTGC CGACCTATCC GATTACCGAT
TACATCCCGA CGCAGGCGAC GGTGGATGAG GAGAAATTCC CCAAGCCGGG CGATCCGAAC
CCGTCGGTGA AACTTGGCGT GGTGAGCGCG ACGGGCGGGA AGACGAAGTG GATCGAGCTG
CCGGCAACGG ATGCATATAT CTCGCGCTTT GGGTGGGTGC ACGATGGACT GCTGTATGCG
TTCGTGCTGA ACCGTCCGCA GAACAAGCTG GATTTGTACC TGGTGGATGC GAAGTCGGGG
CGCACGCAGG TAGCGATGAC GGAGACGAGC CCGTCGTGGA TTGAGACCAA TGATGAGTAC
AAGTTCATTG CTAATGGAGA GAAGCTGCTG TGGACGAGCT GGCGCGATGG GCACACGCAC
ATCTACTTGT ATGACCTCGA CAAGAGCAAT GCGCTGGCGC CGCTGAAGCT GGAGCGGCAA
CTGACGCGCG GGGATTGGGA CGTCGTCTCC ATCGATGGCG TGAATGAGAA GACGGGGATC
GTCTACTACA GCTCGGACCA GGAAGACGAG CGGCAGCGGC AGGAGTATCG CGTGAACCTG
GCCGATGGCG TGAGCGAGAA GATCACCAAG GACCACGGCA CGCATGAAGC GAAGTTTGCA
CCCGAGGCGA ACTGGTTCGT GGACAACTAC TCGGCGCTGA CGACTCCGCC GGCGCTTGCG
GTGTGCACGC TGAAGGATGA GTGCACAACG TTCTGGAGCG GACGGAGCGT GAAGGACTAC
GCTCTGCTGG TACCGCAGTT TGTAGATGCA AAGGCCGATG ATGGCACGGT GATGCACGGG
GTTCTGTTGA TGCCAACAGA AGGCGTAGCG ATGGTGAACG GCAAAGTGCC GTTGATCACC
AATCCGTACG GCGGACCGGG CGTTCCGGGG GATTGGGATT CCTGGGGATC GGTTGATCTC
TTCGACCAAT ATATGGCGAA GCGCGGCTAC GCGATCCTGA AGATGGAGAA CCGCGGCATG
GCGGGACGCG GCGAGAAGTT CGCGGCGCCG ATTATGCACC ACATGTGCGA GCTTCCGCTG
AAAGACCAAC TGGCTTCCGT CGAGCAGGTG CTGAAGCAGT TCCCGCAGCT CGATCGCGCC
CGGCTTGGGT GGTGGGGATG GAGCTATGGC GGCACCATGA CCGCCTGGGC GCTCGAGCAT
TCCGACTGGT TCAAGGTGGG CGTGAGCGTG GCGCCGGTGA CCGACTGGCG CAACTACGAC
TCGATCTACA CCGAGCGCTA CATGGGCATG CCGAAGGAAC AGGCGGCGGA CTATGACCGG
ACGTCGGTGG TGCTGAACGC GAAGCAGATT CATGGCAGGC TGCTGGTGGT GCACGGCACG
AGCGACGACA ACGTGCACAT GCAGAACTCG ATGCAATTCA TGTATGCGTT GATCAATCAC
GGCGTGCCGT TCGATGTGCA GATTTATCCG CGAAAAACGC ATTCGATCTC GGGAGAAGAA
ACGAGGGTGC ATCTCTTTCA CCGCATTCAA AGGCAGTTTG ACGATGTGCT GATGCCGAAG
ACGGCTGCGT CATCTACTCC GTAA
 
Protein sequence
MKAHLATFLL FTSSFTLVSA QEAPKPKQLT IEAIFAKGGV LGRAPESVEW SPDGTKVSFV 
QRDDSGDNGA LYYVDVTTGA KPAVLVAQEK LTEMKPPAKT KSDDREKDNR ERYSVAAYHW
APDSKHILFD SGGALWNYDL AAGKSAMIAS AEGGLGDPKF SPSGDRISYL REHDLYVSGL
DGKAKRITEG GNANVLNGEV DWVYAEELNV RSNYFWSPNG KQIVYLQMDQ TKVPTYPITD
YIPTQATVDE EKFPKPGDPN PSVKLGVVSA TGGKTKWIEL PATDAYISRF GWVHDGLLYA
FVLNRPQNKL DLYLVDAKSG RTQVAMTETS PSWIETNDEY KFIANGEKLL WTSWRDGHTH
IYLYDLDKSN ALAPLKLERQ LTRGDWDVVS IDGVNEKTGI VYYSSDQEDE RQRQEYRVNL
ADGVSEKITK DHGTHEAKFA PEANWFVDNY SALTTPPALA VCTLKDECTT FWSGRSVKDY
ALLVPQFVDA KADDGTVMHG VLLMPTEGVA MVNGKVPLIT NPYGGPGVPG DWDSWGSVDL
FDQYMAKRGY AILKMENRGM AGRGEKFAAP IMHHMCELPL KDQLASVEQV LKQFPQLDRA
RLGWWGWSYG GTMTAWALEH SDWFKVGVSV APVTDWRNYD SIYTERYMGM PKEQAADYDR
TSVVLNAKQI HGRLLVVHGT SDDNVHMQNS MQFMYALINH GVPFDVQIYP RKTHSISGEE
TRVHLFHRIQ RQFDDVLMPK TAASSTP