Gene Acid345_1815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1815 
Symbol 
ID4070502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2193851 
End bp2195812 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content58% 
IMG OID637983824 
Productpeptidase S9, prolyl oligopeptidase 
Protein accessionYP_590890 
Protein GI94968842 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCGTC ATTTCTTTAC CGCTCTAGCA CTCGTTGCCC TCTCGTTCGG CGCGCTCGCG 
CAGCCAGCTT CCAATCAGAA CGACACCGTG AAGCCCGGGG ACAATCTCGT CCTCGAGAAT
ATTCCGCCGG TTCCAACTTC GATTGATGAG CAGACACGTC GCTACGGAGA CTTTCGCACG
GCAGCGCATT TGGATTGGCA TCCGCAGCGG CACGAGATGT TGATCTCCAC GCGTTTCGCG
GATGTGCCGC AGGTGCATTG GGTGAAGACG CCAGGCGGGG AGCGACGGCA GATGACGTTT
TATCCGGACC GCGTGCTCGG TGCCCAATTT GAGCCCGGCG GACGTTACTT TGTCTTCGCG
AAAGATGTTG GCGGTGGCGA GTGGTTCCAG TTCTATCGGT TCGATGTGCA AAGTGGCGAG
ATCACGCTGC TGACGGACGG GAAGTCGCGA AACACAGGCA TGGTCTTCGC GCATGAGGGC
ACGCGGATTG CGTACTCCTC GACGCGACGC ACCGGGGCTG ACAACGATTT GTGGGTGATG
GATACGGCAG ATCCGAAGAC GGACAAGCTG CTCACGCAGT TGCAGGGCGG CGGCTGGGAC
CCGGCAGATT GGTCGCATGA CGGAAAGCAG ATCCTGCTGG TGGAAGGGAA GTCCATCAAC
GAGAGCAATC TTTGGCTGGT GGATGTTGCC AGTGGCGAGA AGAAGCAAAT CACGGCCAAC
CAGACGTCAG AAAATCAGGT GAGCTACGAC AATGCGCGGT TCTCGCGCGA TGGGCGCGGG
ATTTATGTGT CCACCGATTT CGACTCCGAG TTCCACCGGC TAGCGTATTT CGATCTGGCG
ACGATGAAGC CGAAGTTCCT TACGACAGAC ATCAAGTGGG ACGTCGAGGA ATTCGATGTC
AGCGAGGACG GCAAGACGCT CGCGTTCGTG ACGAACGAGA ATGGGATCGG CAAGCTGTAC
CTGATGGACA CGGCCACAGG GAAGTATCGG TCGGTGACGG CATTGCCGAT TGCGATCATC
ACCGGAGCAA AGCTGCATCG CGATGGCCGC GTCGTCGCGG TGAATGTGAC CTCGGCAAAG
TCGCCGACCG ATGTGTACTC GGTAGACGTC GCCAGCAGAA AAGTTGAGCG CTGGACGGAG
AGCGAAACCG GCGGGCTGAA CGCCAGCAAT TTCGTTGAAG CACAGTTGGT GAAGTGGAAG
ACTTTTGATG GAAAAGAGAT CAGCGGTTTT CTCTACCAGC CGGATGCGAA GAAGTTTCCG
GGCAAGCGGC CGGTGATCAT CAATATCCAT GGCGGACCGG AGGGACAGAT ACGGCCGGGC
TATCTCGGGC GCAACAACTA TTACCTCAAC GAAATGGGAG TGGCGCTGAT CTTCCCCAAC
GTGCGGGGAT CCACCGGATA CGGGAAGACG TTCACAAAGC TGGACAACGG TTTCAAGCGC
GAGGACACAT ATAAAGACAT CGGCGCACTG TTTGATTGGA TCGGCACGCA GCCAGCACTC
GACGCCAGCC ACGTGATGGT GACCGGCGGC AGCTACGGCG GACACATGAC TTGGGCGGTG
ACGGCGTACT ACAACGACAA GATCCGATGT TCGCTACCGA TCGTGGGGAT GTCGAACCTG
GTGACGTTCC TTGAGCACAC GGAAGCGTAT CGGCGTGATT TGCGGCGGGT GGAGTATGGC
GATGAGCGGG ATCCTAAGAT GCACGAGTAT CTCGAGAAGA TTGCGCCGAT GAACCATCTC
GACGCTATGC ACAAGCCGAT CTTCGCGGTG GTGGGGAAGA ACGATCCGCG GGTGCCGTGG
ACGGAGTCGC GGCAGATTCT CGACAAACTC AACGCCCAGG GAACGCCAAC GTGGTTCCTG
ATGGCGAACG ACGAAGGCCA CGGCTACGCG AAGAAGAAGA ACCAGGATTT TCAGTTCGAC
GCGACAGTGA TGTTTGTGAA TAGCTGTCTG CTGGAGAAAT AG
 
Protein sequence
MQRHFFTALA LVALSFGALA QPASNQNDTV KPGDNLVLEN IPPVPTSIDE QTRRYGDFRT 
AAHLDWHPQR HEMLISTRFA DVPQVHWVKT PGGERRQMTF YPDRVLGAQF EPGGRYFVFA
KDVGGGEWFQ FYRFDVQSGE ITLLTDGKSR NTGMVFAHEG TRIAYSSTRR TGADNDLWVM
DTADPKTDKL LTQLQGGGWD PADWSHDGKQ ILLVEGKSIN ESNLWLVDVA SGEKKQITAN
QTSENQVSYD NARFSRDGRG IYVSTDFDSE FHRLAYFDLA TMKPKFLTTD IKWDVEEFDV
SEDGKTLAFV TNENGIGKLY LMDTATGKYR SVTALPIAII TGAKLHRDGR VVAVNVTSAK
SPTDVYSVDV ASRKVERWTE SETGGLNASN FVEAQLVKWK TFDGKEISGF LYQPDAKKFP
GKRPVIINIH GGPEGQIRPG YLGRNNYYLN EMGVALIFPN VRGSTGYGKT FTKLDNGFKR
EDTYKDIGAL FDWIGTQPAL DASHVMVTGG SYGGHMTWAV TAYYNDKIRC SLPIVGMSNL
VTFLEHTEAY RRDLRRVEYG DERDPKMHEY LEKIAPMNHL DAMHKPIFAV VGKNDPRVPW
TESRQILDKL NAQGTPTWFL MANDEGHGYA KKKNQDFQFD ATVMFVNSCL LEK