Gene Acid345_1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1151 
Symbol 
ID4069960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1434838 
End bp1437090 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content57% 
IMG OID637983161 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_590228 
Protein GI94968180 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.16756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGTTT TTCCTCAAGA CAAAACCCTC AACAGGAAGG ACACCAAGGG CAACAAGGGT 
TTTGCTTGGG GCGATGACCT GCCGATCACA GGAACTCTTC GAATCTATTC CAGATTTCAG
AATCTCTTCG TGGTCTCCTT GGTGAAAGAT TTCGCTCTGT TATCCTCCTC AACCATGCGC
AACAGGATTC TTCTCTCGGC TCTTCTGATC GCATCCACCT TCGCATTTCC TCAATTCACC
ATCGAGCAGG TTCTTAGTTC TCCATTCCCG AGTGAACTCG TCGCTGCCAA AGATGCTTCG
CGCGTCGCTT GGGTCTTCGA CAATCGGGGC GAGCGCAACG TCTGGATCGC CGACGCTCCC
GACTTCAAAG CGCGCCAACT CACCCATTAC CACGGCGACG ACGGCCAACC GATCGCTGCG
CTCACCATCA GCAACGACGG CAAGACGATT CTTTACGCTC GCGGAACCGA AGTAAATGGC
GGCGGCTCTT CCGCAAATCC TCAGAGTCTG ACCGCAGGCG CAAAGCAGCA GGTCTTCGCT
ATTGATGTAG CGAAGGGAGA GCCGCGCACA CTTGGCGACA TGGGCTGTGG CGAAGAAGGT
TGCGAGGACA TCCAAATCTC TCCTGACGGC AAGTGGGCCG CATGGGCGAA CAAGACCGGC
ATCCTTCTGG CACCCATCGA CGGCAAACAG CAGGCCAAGA AGCTCAATGA GCTTCGCGGC
GAACTCTCCG AACCACGGTG GTCGCCGGAC GGAAAGCGGC TCGCGTTCGT CGTCAATCGC
ACCGACCATA GTTTTATCGG GCTGCAGGAA ATCGGCGGAA CCGAAGTCCG CTATCTAGCT
CCAAGCACCA ATCGTGACGA TCTTCCGCGA TGGTCCGCGG ACGGCAAGCA GATCGCCTTC
GTTCGTCAGC CCGGAGTGCA GGCGAAATTG CCGTTGATCC CGGTACGTCC GCACCCCTGG
TCGATCTGGG TCGCCGACGC CAGCACCGCA CAGGGCAAAG AGGTCTGGCA CAGCGGCGAT
CAACCTCGAG ACTCATTCCC GATGTTCATC GGTACGTCGT TTTATTTCGC GGGTGATCGC
ATTGTCTTCG TCTCCACGCG CGACAACCGC AATCACCTCT ACTCGGTTCC GGCCGCCGGG
GGTGAAGCGA CGCAACTCAC GCAAGGCAAT TTCGAAGTCG AAGACGTGAC CCTTAGCAAA
GACCAAAAGT GGATCACCTA CTCGTCGAAT GAGTACACGA GTGATTCCAA GGACGAAGAC
CGGCGTCATC TCTGGCGCGT CCCGGCAACT GGCGGTCAGC GCCAGCAACT CAGCAGCGGT
GAGACGATCG AATGGTCGCC CACGCTTGTC GGCGACAAAG TCATCTGTCT CGGATCAAGT
GCGACTTCGC CCGCCATGCC CTACGAAGTA ACCGGCAGCA GCCGCAGGCT GATTGCCGCC
GACATGCTGA AAGATTTCCC CTCGAACCAG TTGGTAACAC CCCAGCAAGT CATCTTCGAT
AGTGACGGCC TGAAGATCCA CGGCCAACTA TTCGTCCCCA AGGATGGCAA ATCAACCCAT
CCCGCGCTGA TCTTCACCCA CGGCGGACCG GTCCGACAGA TGATGCTCGG CTTCCACTAC
ATGGACTACT ACCACAACGC CTACGCCATG AATCAATACC TGGCGAGCAA GGGCTATGTG
GTGCTGTCCG TGAATTACCG CCTCGGCATC ATGTACGGAT ACGACTTCCT CAACCCACCC
AACACAGTTT GGCGCGGTGC GGCCGAGTAC AACGACGTTG TCGCTGGAGC GAAGTACCTG
CAGTCCCTGT CTAACGTCGA CAAATCGAAG ATCGGCCTCT GGGGCGGATC GTACGGAGGA
TTCCTCACGG CCATGGGCCT AGCCCGCAAC TCCGACATCT TCTCTGCCGG CGTGGACTTC
CATGGCGTAC ACGACTGGTC GGCCTTCATT GGTGAGTGGG AGAACAATGC GACGGCTGCG
CCCGATGCGA AGGAAGCCCA GAAACTGGCC TTCGATTCAT CGCCTGAGGC ATCGATCAGC
ACGTGGAAGT CTCCAGTGCT GTTGATTCAC GGCGACGACG ACCGTAACGT ACCGTTCTCG
CAAACGACCA CCCTCGCGGA GAAACTGAAG AACCAGGGCG TGGAGTTTGA AGAGCTGATC
TTCCCTGATG AGATCCACGG GTTCCTGATG TTCAAATCGT GGATCAAGGC CTACTCGGTG
GAGGAGCAGT ATTTCGACAA GAAACTTAAG TAA
 
Protein sequence
MQVFPQDKTL NRKDTKGNKG FAWGDDLPIT GTLRIYSRFQ NLFVVSLVKD FALLSSSTMR 
NRILLSALLI ASTFAFPQFT IEQVLSSPFP SELVAAKDAS RVAWVFDNRG ERNVWIADAP
DFKARQLTHY HGDDGQPIAA LTISNDGKTI LYARGTEVNG GGSSANPQSL TAGAKQQVFA
IDVAKGEPRT LGDMGCGEEG CEDIQISPDG KWAAWANKTG ILLAPIDGKQ QAKKLNELRG
ELSEPRWSPD GKRLAFVVNR TDHSFIGLQE IGGTEVRYLA PSTNRDDLPR WSADGKQIAF
VRQPGVQAKL PLIPVRPHPW SIWVADASTA QGKEVWHSGD QPRDSFPMFI GTSFYFAGDR
IVFVSTRDNR NHLYSVPAAG GEATQLTQGN FEVEDVTLSK DQKWITYSSN EYTSDSKDED
RRHLWRVPAT GGQRQQLSSG ETIEWSPTLV GDKVICLGSS ATSPAMPYEV TGSSRRLIAA
DMLKDFPSNQ LVTPQQVIFD SDGLKIHGQL FVPKDGKSTH PALIFTHGGP VRQMMLGFHY
MDYYHNAYAM NQYLASKGYV VLSVNYRLGI MYGYDFLNPP NTVWRGAAEY NDVVAGAKYL
QSLSNVDKSK IGLWGGSYGG FLTAMGLARN SDIFSAGVDF HGVHDWSAFI GEWENNATAA
PDAKEAQKLA FDSSPEASIS TWKSPVLLIH GDDDRNVPFS QTTTLAEKLK NQGVEFEELI
FPDEIHGFLM FKSWIKAYSV EEQYFDKKLK