Gene Acid345_2277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2277 
Symbol 
ID4073271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2699778 
End bp2701883 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content59% 
IMG OID637984293 
Productdipeptidyl-peptidase 7 
Protein accessionYP_591352 
Protein GI94969304 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATCTGT GCCGGGTCCC CTGCAAGCTC CTCTCCCTCC TCGTTATTGC CTTCTGCACT 
TTTACGACTT TCGCACAAGC CGATGAAGGC ATGTGGCTTT TCACCAACCC GCCTACGGCG
GCGATCCAGA AGAAATATGG CTTCGCCATC ACCCAGGAAT GGCTGGACCA CCTCCGGCTG
TCTTCATCGC GCGCGCCGGG CGGGTCGAGT GAGTTCGTTT CGCCCGACGG ACTGCTGATG
ACGAACCATC ACGTGGCGCA GAGCTGCATC CACGACCTTT CGAGCGGCGG CAAGGACTAC
ATGAAGGACG GCTTCTACGC TGCGACGCGC GAGCAGGAAC CGAAATGCCC GGGTATCGAG
TTTCTTGTGC TGACGGATAT CAAAGACGTC TCGGAACAGA TCCACTCGGC CGTGAAGGCG
CGCATGGCCT CCGCGGAATC CGGCAAAGCC ACGCGGCAGG CGATGTCGGC CGCCGAGAAA
GCCTGCTCTA CCGAGGGCTT CAAGTGCGAC GTCGTGACGC TCTACGCGGG CGCGATGTAT
CACCTTTACA AGTACAAGAA GTACACCGAC GTCCGGCTGG TGTTCGCGCC TGAGTTCCAA
ATGGCGTTCT TCGGCGGCGA TCCTGACAAC TTCACCTTCC CTCGCTACGA CCTCGACATT
ACGTTCTTCC GCATGTACGA GAACGGTAAA CCGGCGCATA CCGAAAACTA TCTGAAGTTT
GCCAAGAAGG GCGTGAAGGA GGGCGACCTA CTCTTCGTGA GCGGGCACCC CGGTCGTACC
AGCCGCCTGC TGTCCATCGC CGAACTTGAG TACCTGCGCG ACGTGCAGTA TCCGTGGCAG
ATCAAGAACC TGACCCGTCG CGTCAACCTG CTGCTTGCGT TTTCGAAGGA AGGAGCTGAG
CAGGCGCGCG AAGCCGAGCA CGATCTGTTC AGCTTCCAGA ATTCGCAGAA GGCCCTCACC
GGCTACAACA CGTTCTTCGC GAACAAAGAA GGCTGGGCGA AGAAGCAGGC TGACGAAAAA
CAGTTCCGCG ATTACGTAAA AGCCCACGCC GAACGCGAGA AAGAATTTGG CGATCCGTGG
ACTGAGGTGA ATCAAGCCGA GGGCACGCAG CGCGGGATGT TCTTCGATTA CCAGTACGTC
GAAATGCTCG CCGGGCTGCG CGGTTCGCTG GCTGCCGACG CGCGATTCAT CGTCCGTGCA
GCTCAGCAGC GGACCTTGCC GAATGACCAG CGCCTTCGTG GCTATACCGA CTCTGCCCTG
GCGACGCGCG AGCAAGAGCT GTTCTCCGAC GCGCCCGCGT ACAAAGACCT GAACAAAGCA
ATGCTGGCGG ACAGCCTCGC CGATATGCAG GAACAGGAAC CGAAGAATCC GTCGTTGCTG
AAAGTGCTTG CCGGGAAATC CCCGAAAGAC CGCGCGGCAG AATTGATTGA CGGCAGCAAG
CTCGACGACG CGGCCTACCG CAAGAAACTG TACGACGGCG GATTGAAGGC GGTGGAAGAA
AGTACCGATC CGCTGGTCGT GATGATGCGC AGCATCGAAC CGCAGGCGCT CGCGCTTCGC
CAGAAGTTCG ACGACGAAGT GGATCCGAAG CTCCGCGATG GCGGGGCGAA AATCGCGAAA
GTTCGCTTTG CGATCTTTGG CCAGACCCAG CCACCCGACG CCACCTTTAC TCTGCGGCTG
AGTTACGGGC CGGCGAAGGG CTACGAAGAA AACGGGAAAC ATATTCCGTG GAGCACCACC
ATGGACGGCG CGTACAAGCA CGCCGCCGAT CACGGAAACA AATCGCCCTA CGAATTACCG
AAGAGCTGGA TGGACGCCAA GGGCAAGTTC GACGGCAACA CGCCCTTCGA CGTCGTTACG
ACCGCGGACA TTATCGGCGG AAATTCTGGC AGCCCGGTGG TCAACACCAA CGGAGAACTA
GTCGGCATCA TCTTCGACGG CAATATCGAG TCGCTGCCGC TGAACTTTAT GTACGACGAT
CTGCAGGCCC GCGCAGTGCA CGTGGATTCA CGCGCGATCC TCGAGTCGCT GCAGAAGATC
TACCACGCCG ATGCGCTGTA CGAAGAAATC ACGGGCGCTT CGGGTATCAG CGCGGCCAAA
CAGTAA
 
Protein sequence
MYLCRVPCKL LSLLVIAFCT FTTFAQADEG MWLFTNPPTA AIQKKYGFAI TQEWLDHLRL 
SSSRAPGGSS EFVSPDGLLM TNHHVAQSCI HDLSSGGKDY MKDGFYAATR EQEPKCPGIE
FLVLTDIKDV SEQIHSAVKA RMASAESGKA TRQAMSAAEK ACSTEGFKCD VVTLYAGAMY
HLYKYKKYTD VRLVFAPEFQ MAFFGGDPDN FTFPRYDLDI TFFRMYENGK PAHTENYLKF
AKKGVKEGDL LFVSGHPGRT SRLLSIAELE YLRDVQYPWQ IKNLTRRVNL LLAFSKEGAE
QAREAEHDLF SFQNSQKALT GYNTFFANKE GWAKKQADEK QFRDYVKAHA EREKEFGDPW
TEVNQAEGTQ RGMFFDYQYV EMLAGLRGSL AADARFIVRA AQQRTLPNDQ RLRGYTDSAL
ATREQELFSD APAYKDLNKA MLADSLADMQ EQEPKNPSLL KVLAGKSPKD RAAELIDGSK
LDDAAYRKKL YDGGLKAVEE STDPLVVMMR SIEPQALALR QKFDDEVDPK LRDGGAKIAK
VRFAIFGQTQ PPDATFTLRL SYGPAKGYEE NGKHIPWSTT MDGAYKHAAD HGNKSPYELP
KSWMDAKGKF DGNTPFDVVT TADIIGGNSG SPVVNTNGEL VGIIFDGNIE SLPLNFMYDD
LQARAVHVDS RAILESLQKI YHADALYEEI TGASGISAAK Q