Gene Acid345_4515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4515 
Symbol 
ID4070193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5354900 
End bp5356957 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content58% 
IMG OID637986554 
ProductPgPepO oligopeptidase 
Protein accessionYP_593589 
Protein GI94971541 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.857143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.954997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TCCTATCGGT CGCGACCTCT GCCGTCCTGG CGGTTTCCAT GGGCTTCGCC 
CAGAAACCAG CCAACGATCA GCAGGCCGAG AAGACCGCGG CCAAGAAGGG CACGAAGGGC
TTCGATATCA ACGCCCTCGA TCGCAGCACT GATCCTTGTA CCGACTTCTA CCAGTTCGCA
TGCGGTAGCT GGATCAAGAA CAACCCGATT CCTTCCGACC AAGCCCGTTG GGGACGCTTC
TCAGAACTCC TCGAGCGCAA TCAGATGATC CTGCGCGACA TCCTCGAGAA ACAGCGTGCA
GCCAATGCCA ACCGTGACGC GATCGACCAG AAAATCGGCG ATTACTACGA CGCTTGCATG
GACGAAAAGG GCATCGACGC CAAAGGCCTC GATCCTCTGA AGTCGACACT CGACAGCATC
GCCGCGGTGA AAGACAAATC AGAATTGCCC GCGCTCCTTG GCAAACTGCA CCGAACCGGC
CTCGAGCCGC TGTTCGGGTT CGGTCCGGAG CCGGATTTCA AGAACGCGAA AATGATGGGC
GCTTCGGTGG ACCAGGGTGG CCTCGGCCTG CCGGAAAAAG ACTACTACTC GCGTGACGAC
GCCAAGTCCG TCGAACTTCG CAAGGCCTAT GTTGAACACA TCACCAACAT GTTCAAACTC
GCCGGTGAGT CTGCAGACCA GGCCGCGAAA GACGCGCAGA CGGTGATGAC CTTCGAGACC
ACGCTGGCGA AGAACTCTAT GAGCGTAGTC GAACGTCGCG ACGTGCAGAA GCTATACAAC
CCCAAGACGA AGGCGGAATT CATCGCGTTG ACGCCGGCTT TCGATTGGAA CAAATACCTC
GTCGCACTCG ATGCCCCGTC GTTCGAAAAG ATCAACCTCG ACTCGCCCAA TTACATTGCG
AAGCTGAACG AGGTTGTCCA AAGCAACTCG CTCGACGACA TCAAGACGTA CCTTCGTTGG
CAGACTTTGC ACGGCGCTGC CCGCGCGCTC CCAACGCCCT TCGTGAACGA GAATTTCTCG
TTCTATGGGA AGACGCTCAC CGGTGCGAAA GAGATCCGTC CGCGCTGGAA ACGTTGCGTA
CAGTTCACGG ACAATCAACT CGGTGAGGCG CTAGGCCAGG CATATGTAAA AGTCGCTTTC
CCGCCCGACG CAAAGGACCG CATGGAAAAG ATGGTCCACA ATCTCGAGGC TTCGATGAAG
ACCGATATCG AAGGCCTCGA TTGGATGACG GCAGAAACCA AGAAGGCCGC CATCGTCAAA
CTTTCGATGA TCAATGACAA GATCGGCTAT CCCGACAAGT GGCGTGACTA CAGCAGGTAC
AACGTTGTTC GTGGCGACTT CCTGGGCAAC ACCATGCGCG GCAACGAGTT CGAAACGCAA
CGTCAGCTCG ACAAGATCAA CAAGCCCGTG GATCGCACCG AGTGGGGCAT GACGCCGCCG
ACCGTGAACG CCTATTACAA CCCGCAGGAA AACAATATCA ACTTCCCGGC GGGCATTTTG
CAGCCCCCGT TCTTCGACAA TAAGCTCGAT GATGGCGTGA ACTACGGCGC GATCGGTGCT
GTGATCGGCC ACGAAATGAC CCACGGGTTC GATGACGAAG GCCGTGAATT CGACGGCAGC
GGCGATCTCC GCAACTGGTG GACCGAAGCT GACGGCAAAG CCTTTGAGCA GCGCGCCCAG
TGTTTGGTGG ACGAGTATGA CAGCTTCATC GCCACCGACG ACGTGCACGT TCGCGGCAAG
CTGACCCTCG GCGAAAACAC TGCCGACAAC GGCGGTCTCC GCGTCGCGCT AATGGCGTTG
GAATCCACGT TCAATGGCAA GGAGCCAGCG AAGATTGACG GCTTCACGGC CCAGCAGCGC
GCGTTCCTTG GCTTCGCGCA AGTGTGGTGC GAGAACCAGA CTCCACAAGC CCTTCGCGTG
CAGGCCCAGA CCAACCCGCA CTCTCCTGGC AAGTGGCGTA CCAACGGTGT CATGCGCAAC
ATGCCCGAGT TCCGCAAGGC GTTCGGTTGC AAGGAAGACG CGCCCATGGC GCCGACAAAC
GCATGCCGCG TCTGGTAA
 
Protein sequence
MKKFLSVATS AVLAVSMGFA QKPANDQQAE KTAAKKGTKG FDINALDRST DPCTDFYQFA 
CGSWIKNNPI PSDQARWGRF SELLERNQMI LRDILEKQRA ANANRDAIDQ KIGDYYDACM
DEKGIDAKGL DPLKSTLDSI AAVKDKSELP ALLGKLHRTG LEPLFGFGPE PDFKNAKMMG
ASVDQGGLGL PEKDYYSRDD AKSVELRKAY VEHITNMFKL AGESADQAAK DAQTVMTFET
TLAKNSMSVV ERRDVQKLYN PKTKAEFIAL TPAFDWNKYL VALDAPSFEK INLDSPNYIA
KLNEVVQSNS LDDIKTYLRW QTLHGAARAL PTPFVNENFS FYGKTLTGAK EIRPRWKRCV
QFTDNQLGEA LGQAYVKVAF PPDAKDRMEK MVHNLEASMK TDIEGLDWMT AETKKAAIVK
LSMINDKIGY PDKWRDYSRY NVVRGDFLGN TMRGNEFETQ RQLDKINKPV DRTEWGMTPP
TVNAYYNPQE NNINFPAGIL QPPFFDNKLD DGVNYGAIGA VIGHEMTHGF DDEGREFDGS
GDLRNWWTEA DGKAFEQRAQ CLVDEYDSFI ATDDVHVRGK LTLGENTADN GGLRVALMAL
ESTFNGKEPA KIDGFTAQQR AFLGFAQVWC ENQTPQALRV QAQTNPHSPG KWRTNGVMRN
MPEFRKAFGC KEDAPMAPTN ACRVW