Gene Acid345_1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1078 
Symbol 
ID4070038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1352782 
End bp1354833 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content60% 
IMG OID637983087 
ProductPgPepO oligopeptidase 
Protein accessionYP_590155 
Protein GI94968107 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00182798 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0175164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTCA AAGCTGCACT TCTTACGTGC TCTCTGCTAG CTGCTCCTCT CATCGCTCAA 
AACACTCTAC CGCCCGAGTC CCACGGAATC GCGATTTCGC ACATGGATAC TTCCGTTGTT
CCGGGCGACG ACTTCTACGA GTACAGCAAC GGCGGTTGGC TGAAGGCGAC CACTATTCCT
GCCGACCGCG GCGGGGTTGG TGTGTTCAGC GTTCTGCGCG ATCTGAGCGA CAAGCGCACC
TCTTCGCTCA TTGAAGAGAT GTCGAAGTCG AAAGCGGCGC CCGGCTCCAA CCAACGCAAG
GTAGCCGATC TTTACAACTC TTACATGAAT GAAGCGGCGA TCGAGAAGAC GGGCGTCGCT
CCGCTGAAGT CGCACCTCGA CGCCATCGCA GCGATCACAG ACAAAAAGAC GCTCGCGCAC
GCTCTCGGCG AAACGCTCCG CGCCGACGTC GACGCGCTCA ACAACACCAA CTTTCACACC
CCGAATCTCT TCGGCCTCTG GGTAGCGCCG GGTTTCAACG ATCCCGACCA CTATGCGCCC
TACCTGCTGC AAGGTGGACT TGGACTGCCC GATCGCGAGT ACTACCTCTC GGACTCCAAG
CAATTGACCG AGGTCCGCGA GAAGTACGTA GTACACGTCG CGGCGATGTT GAAGCTCGCC
GGCTTCGACG ACGTCGATAC CCGCGCCAAA CGGATTCTCG CCCTGGAACA CGCGATCGCC
GAAAAGCATC TCTCGCTCGC CGCGAACGAC GACATCCACA AGGCCAACAA CACCTGGACA
AAAGCCGATT TCGCATCCAA AGCTCCGGGC CTCGACTGGA CTGAGTACTT CCGCGGCGCC
GGGCTTGAGA ACCAGAAAGA TTTCATCGTC TGGCAGCCGA CCGCCTTTAC TGGTGAAGCC
GCACTCGTGC AGTCGGAATC CCTCGATGCG TGGAAAGACT GGCTCGCCTA CCATTTGATC
GAAAACTACG CCGGCGTTCT TCCCAAAGCC TTTGCTGACG AGCGCTTCGC CTTCTTCGGC
GGCACGCTGC AGGGCACCAC CCAGCAGCGT CCGCGTTGGG CCCGCGGTGT GAACGTCGTG
AACGGTCTGC TCGGGGACGC CGTCGGTCAG GAATATGCGC GGCGCTATTT CCCGCCGGAA
GCCAAGCAGC AAGCGCAGGC AATGGTTGCC AACATTATCG AGTCCTTCCG CCAACGCGTT
CGCAACCTGT CCTGGATGGC GGAGTCTACG AAGAAGGAAG CCGAAGCCAA GCTCGACACG
CTCTACGTTG GCATCGGTTA TCCGGAAACA TGGAAAGACT ACTCGTCTTA CGATGTGAAA
GCCGACGACA TTTTCGGCAA CATCTGGCGT GGCAGCATCT TCGATTATCA ATACGACCGC
GCGAAGGTTG GCAAGCCCGT TGACCACCAC TGGTGGTCCA TGACGCCCCA AACCGTCAAC
GCCGTCAACC TGCCGCTCCA AAATGGGCTG AATTTCCCCG CAGCGATTCT GCAGCCGCCC
TTCTTCGACC CGCAAGCGCC GGCCGCTGCC AACTACGGCG CCATCGGCAC CGTCATTGGT
CACGAGATCA GCCACACCTT CGATACCGAG GGTTCTGCCT TCGATTCCAA GGGCCGCGTG
CGCGACTGGT GGACTCCCGC CGACTTCGAG CACTTCAAGA AGGCGACCAA CGCTCTTGCC
GAGCAATACA GCACCTATAA GCCGTTCCCC GATCTATCGC TAAACGGCGA ACAGACCTTG
GCGGAGAACA TCGCGGACGT TGCCGGCATC GCCGCCGCCT TCGACGGCTA TCATGCCTCG
TTGAAGGGCG CCGCCGGGCC AACGCAGAAC GGATTCACCG CCGACCAGCA GTTCTTCATC
GCCTTCGGGC AGAACTGGGG CTCGAAGGCA CGTGAAAATG CGCTTCGCCA GCAGGTGCTC
ACGGATCCCC ATTCCCCCGG CCAATACCGC GCACTCACCG TTCGTAACAT CGACGCGTGG
TATCCCGCTT TCAAAGTGAA GCCTGGTGAA AAGTTGTTCC TCACACCCGA GGAGCGTGTC
CGCATCTGGT AG
 
Protein sequence
MNFKAALLTC SLLAAPLIAQ NTLPPESHGI AISHMDTSVV PGDDFYEYSN GGWLKATTIP 
ADRGGVGVFS VLRDLSDKRT SSLIEEMSKS KAAPGSNQRK VADLYNSYMN EAAIEKTGVA
PLKSHLDAIA AITDKKTLAH ALGETLRADV DALNNTNFHT PNLFGLWVAP GFNDPDHYAP
YLLQGGLGLP DREYYLSDSK QLTEVREKYV VHVAAMLKLA GFDDVDTRAK RILALEHAIA
EKHLSLAAND DIHKANNTWT KADFASKAPG LDWTEYFRGA GLENQKDFIV WQPTAFTGEA
ALVQSESLDA WKDWLAYHLI ENYAGVLPKA FADERFAFFG GTLQGTTQQR PRWARGVNVV
NGLLGDAVGQ EYARRYFPPE AKQQAQAMVA NIIESFRQRV RNLSWMAEST KKEAEAKLDT
LYVGIGYPET WKDYSSYDVK ADDIFGNIWR GSIFDYQYDR AKVGKPVDHH WWSMTPQTVN
AVNLPLQNGL NFPAAILQPP FFDPQAPAAA NYGAIGTVIG HEISHTFDTE GSAFDSKGRV
RDWWTPADFE HFKKATNALA EQYSTYKPFP DLSLNGEQTL AENIADVAGI AAAFDGYHAS
LKGAAGPTQN GFTADQQFFI AFGQNWGSKA RENALRQQVL TDPHSPGQYR ALTVRNIDAW
YPAFKVKPGE KLFLTPEERV RIW