Gene Acid345_1491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1491 
Symbol 
ID4071661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1809398 
End bp1811434 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content60% 
IMG OID637983500 
ProductPgPepO oligopeptidase 
Protein accessionYP_590567 
Protein GI94968519 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTCTAC CAGCACTTCT GCTCTCCTGC TGTTTTCTCG TTAGCGTCGC TATCGCCCAA 
CAACCTACCG AACCCACCCT CCCGTACACC CCAGGTCTCG ATATCACTGC GATGGACAAA
TCCATCGATC CCTGCCAAGA CTTCTACACC TATTCCTGTG GCGGCTGGAT GAAAAAGAAC
CCCATCCCAC CCGACCAGAC CAGTTGGGGC GTTTACGGCA AGCTCTATGA GGACAACCTC
ACCTTCCTGC GCGAAATCCT CGTTCAGGAC GCACGCGAGA AAGACCGCAC TCCCGTCGCG
CAAAAGATCG GTGATTTCTA CGGCGCGTGC ATGGAAGAAG CCACCATCAA CGACGCCGGC
GCCAAGCCAA TGCAGGCCGA CCTCGACGCC GTTACCTCTC TCATGAGCAT TAAGGACCTT
CCGCCCACGC TCGCCAAGCT CCACATCGGC GGCGTAAGCG CGCTCTTTGG CGGCGGCTCC
ATGCAGGACC CCGATAACTC CGAGATGCAG ATCGTCGGCC TCGACCAAGG CGGCCTCGGC
TTGCCCGACC GCGACTACTA CCTCAACGAC GACGCCAAGT CCAAGGCAGA TCGCGCCAAG
TACCTCGAGC ACGTTCAGAA AATGTTCGAG CTCCTCGGCG ACAGTCCTGA CAAGGCGAAG
GCAGAATCGG CCGTCGTGAT GAAGATCGAA ACCGAGTTGG CAAAGCACTC GCTTACTCGC
GTCGATCGCC GCGATCCTTA CAAGGTCAAA AACAAGATGA GCCCGGCGGA ACTCGCGAAG
CTCTCGCCAA ACTTCGACTG GGCCGCATAC TTCAGCGCCT CCGGCCTGCC GAAGATGGAC
GTCCTGAACC TCGGCACCAA GGACTTCTTC AAAGACGTTA GCGACCAGAT GAAGTCCGTC
AGCCTCGCCG ACTGGAAAAC CTACCTCCGT TTCCACGTCG CGAACTCGCG CTCGCCCTAT
CTCTCCAAGC CGTTCGTGGA CGAGAACTTC GCCTTCTACC GTGCCTACCT GCGCGGCGCC
AAAGAGCAGC AGCCGCGCTG GAAGCGCTGC GTCGAATGGA CCGACATGCT CCTCGGCGAA
GCCCTCGGCC AGGAATACGT GAAGCGCACC TTCTCGCCGG AACTCAAAGA GTCCACGGTC
GATATGACGC GCCGTATCGA AGACGCCATG GCCGTCCGCA TCCAGCAACT CGATTGGATG
AGCCCGAAAA CGAAAGAACA GGCGATGGTC AAGCTGAAGT CCATCCGCAA CAAGATCGGC
TATCCCGACA AGTGGCGCGA CTATAGCTCC GTGGACATCA AGCCCCTCGA CTTCTACGGC
AACGTCTCGC GCGCCATCGC GTTTGAGTCC CATCGCGACT GGAACAAGGT CGGCAAGCCG
GTAGACCGCG GCGAGTGGGG CATGACGCCG CCCACCGTCA ACGCCTACTA CAATCCGCAG
ATGAACGACA TCAACTTCCC GGCCGGCGTT CTACAGCCGC CACTCTACGA CGCCAAGATG
GATGATGCCC CGAACTACGG CAACACCGGC GGCACCATCG GCCACGAACT GACCCATGGC
TTCGACGACG AAGGCCGTCA GTTCGACGCC CAGGGCAACC TCAAAGATTG GTGGACGAAG
CAGGACGCCG ATGAGTTCGT GAAGCGCGCC AACTGCGTTG TGGACCAGTA CGCGACCTAC
GTTGTCGTCG ACGACATCCA CATCAACTCC AAGCTGACGG AAGGCGAAGA CGTTGCCGAC
CTCGGCGGCG AAATCCTCGC CTACGTCGCG TGGAAGGACA AAACGAAGGA CATGAAGCTG
GAAGATCGTG ACGGCCTTAC GCCTGACCAG CGCTTCTTCG TCGGCTACGC GCAGTGGGTC
TGCGAAAACG ATCGTCCGGA GAACCTGCGC GTCCACGCCA AAACCGATCC GCACTCACCG
GGTAAGTACC GCATCAACGG CGTGGTCGTA AACATGCCGG AGTTCGGCAA AGCGTTCGCC
TGCAAGGCTG ACGCACCGAT GGTGAAAGCC GCTGACAAAG TCTGTCACGT CTGGTAG
 
Protein sequence
MRLPALLLSC CFLVSVAIAQ QPTEPTLPYT PGLDITAMDK SIDPCQDFYT YSCGGWMKKN 
PIPPDQTSWG VYGKLYEDNL TFLREILVQD AREKDRTPVA QKIGDFYGAC MEEATINDAG
AKPMQADLDA VTSLMSIKDL PPTLAKLHIG GVSALFGGGS MQDPDNSEMQ IVGLDQGGLG
LPDRDYYLND DAKSKADRAK YLEHVQKMFE LLGDSPDKAK AESAVVMKIE TELAKHSLTR
VDRRDPYKVK NKMSPAELAK LSPNFDWAAY FSASGLPKMD VLNLGTKDFF KDVSDQMKSV
SLADWKTYLR FHVANSRSPY LSKPFVDENF AFYRAYLRGA KEQQPRWKRC VEWTDMLLGE
ALGQEYVKRT FSPELKESTV DMTRRIEDAM AVRIQQLDWM SPKTKEQAMV KLKSIRNKIG
YPDKWRDYSS VDIKPLDFYG NVSRAIAFES HRDWNKVGKP VDRGEWGMTP PTVNAYYNPQ
MNDINFPAGV LQPPLYDAKM DDAPNYGNTG GTIGHELTHG FDDEGRQFDA QGNLKDWWTK
QDADEFVKRA NCVVDQYATY VVVDDIHINS KLTEGEDVAD LGGEILAYVA WKDKTKDMKL
EDRDGLTPDQ RFFVGYAQWV CENDRPENLR VHAKTDPHSP GKYRINGVVV NMPEFGKAFA
CKADAPMVKA ADKVCHVW