Gene Acid345_2466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2466 
Symbol 
ID4072090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2914278 
End bp2916455 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content63% 
IMG OID637984483 
Productglutamate carboxypeptidase II 
Protein accessionYP_591541 
Protein GI94969493 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0461903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.618125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCGC TCGTTTCGCT CATCGTGTTC TGGACGCTGC TCATCGCTCC CTTCCTTGGG 
ACGGAGCTCG GTGACCCGAC GAAGTCTGTC TTCGAGCCGC CTAACGCGAT CGCCGGATTC
GAGAAGCCCA TAGCGCAACT CGCACTCGAG AAGAAATTCC TCGCCGTCCC CGATCCCAAG
CACGCCGAGC AAGACATGCG CGCTCTCACC GCGACGCCTC ATCTCGCCAC CACTCCCGAA
GATCGCAAAA CCGCCGACTA CGTCCTGCAA AAGTTCAAAG AAGCCGGGCT GCAAGCTTCT
ATTGAGGAAT ACAAGGTTTA CTTCGGAATC CCCGAGTCCG TCAGCATTGA CATCGTTGCT
CCTGAAGGCG TCGTCCTTCA CGGCCCCTCA CCTGAGCGCG TGGACGGCGA CGATGGCTCC
AAAGACTCCC GCATTCTTCC GGCCTTCAAC GGCTACACGC CTTCAGCCGA TGTCACTGCC
GAAGTCTTGT ACGCGAATTA CGGCCGCCCC GAAGACTTCG ACACCCTCGC GCAAGCCGGC
GTAGATGTAA AAGGGAAGCT CGTCATCATT CGCTACGGCG AAATCTTCCG CGGCGTAAAA
GTTCTGCAGG CGCAGCAGCG CGGCGCCGCC GGCGTCATCC TCTATTCCGA TCCCATGGAC
GACGGCTACT TCAAGGGTGA CGTCTATCCC AAGGGCCCGT ACCGTCCCTC CAGTGCCGTG
CAGCGTGGCG ATGTCCACTT TATGTTCCGC TATCCCGGCG ATCCCACCAC ACCCGGAGCG
CCTTCCAGCG CCACTGCCGA ACGCACCGAA GCCGCGCAAT CCGCCAGCCT GCCCAAGATC
CCGGTTCTCC CGCTCTCTTA CGCCGACGCC TCACCGATTC TCCAGCACCT CGCTGGCCCC
GAGTCTCCGC GCTCGTGGCA AGGCGCGTTG CCGTTCACAT ATCACCTCGG TGTTGGCCCG
GTGAAGGTCC ACCTCAAGAT CGCTGTCCAC TACGAGTACC GCACCATCTG GAACGTCATC
GGCACCGTCA AGGGCGCCGA GTATCCCAGC GACATCGTCC TGTCCGGTAA TCATCGTGAC
GCCTGGGTCT TCGGCGCCGC CGATCCCGGC AGTGGCACCG TCGCCCAGCT TGAGGCCGTG
CGCGGCATCG GCCAGCTTCT AAAAGCAGGC TGGCGTCCCA AGCGCACTAT TATCTTCGCC
AGTTGGGACG CCGAAGAGCA GGGAATGGTC GGCTCCACTG AATGGGTCGA GCAACACGCG
CAGGAACTCA GCGGGGCCGT CGCCTACTTC AACATGGACG TCGCGGTCAC CGGCCCCAAC
TTCGCCGCCG CGTCGGTGCC CAGTCTCAAG CAGTTCATGC GCGATGTCGC GAAGTCCGTT
CCCAGCCCGC AAGGCGGCAG CGTCTACGAC GCCTGGACCG AGCGCACTTC CGGCAAATCG
CCGCAGCGCA ACGAAGTCTT TCCCGATGTC AACGGCTCCG CACGCCACTC TTCCGCCGCG
CCCCCGCATC AGGATGTCGC CGTCGGCGAT CTCGGCAGCG GCTCCGACTA CACCCCGTTC
CTCGAGCACA TCGGCGTCGC CTCCGCCGAC ATGGGTAGCC ACGGTCCCTA CGGCGTCTAC
CACTCCGCCT TCGACGACTA CACCTGGTTC ACCAAGTTCG CCGATCCAAA GTTCGCCTAC
GAGCAGCAAA TGGCGCGACT CCACGGCCTG CAAGTCCTCC GCATGGCCGA CGCCGATGTT
CTCCCCTTCG ATTACGAAGA TTACGGTCAG GAAATTGAAG CCTACATCGA ATACACGAAA
CAACGTGCCG CCGAATCCTT CCCCGAAGGC GGCCCAAAAT TCGATGAGCT GACCAAAGCC
TCCAAGCGCC TGCAATCCGC GGGGAGCATG CTGCTAGGCG CGGTGAAAGC CGGTCGCGCC
GCCAGCCCCG CCCGCATCAA CACCGCGTTG CGCGATGCCG AACGCGCCTT CCTGACCAAC
GGCCTCCCCA GCCGCCCCTG GTTCCGTCAC GCCATCTACG CCCCCGGCGA GTCCACCGGC
TACGAAGCCA TCGTCCTCCC CGGCATCACC GAAGCCATCG AGCACCACGA CCAGCAAGCC
CTAGTCGAAC AGATCCAACT CACCACCGCC GCCATCAACC ACGCGTCAGA AATCCTGGAA
TCCGCCCACA GCTTCTGA
 
Protein sequence
MRPLVSLIVF WTLLIAPFLG TELGDPTKSV FEPPNAIAGF EKPIAQLALE KKFLAVPDPK 
HAEQDMRALT ATPHLATTPE DRKTADYVLQ KFKEAGLQAS IEEYKVYFGI PESVSIDIVA
PEGVVLHGPS PERVDGDDGS KDSRILPAFN GYTPSADVTA EVLYANYGRP EDFDTLAQAG
VDVKGKLVII RYGEIFRGVK VLQAQQRGAA GVILYSDPMD DGYFKGDVYP KGPYRPSSAV
QRGDVHFMFR YPGDPTTPGA PSSATAERTE AAQSASLPKI PVLPLSYADA SPILQHLAGP
ESPRSWQGAL PFTYHLGVGP VKVHLKIAVH YEYRTIWNVI GTVKGAEYPS DIVLSGNHRD
AWVFGAADPG SGTVAQLEAV RGIGQLLKAG WRPKRTIIFA SWDAEEQGMV GSTEWVEQHA
QELSGAVAYF NMDVAVTGPN FAAASVPSLK QFMRDVAKSV PSPQGGSVYD AWTERTSGKS
PQRNEVFPDV NGSARHSSAA PPHQDVAVGD LGSGSDYTPF LEHIGVASAD MGSHGPYGVY
HSAFDDYTWF TKFADPKFAY EQQMARLHGL QVLRMADADV LPFDYEDYGQ EIEAYIEYTK
QRAAESFPEG GPKFDELTKA SKRLQSAGSM LLGAVKAGRA ASPARINTAL RDAERAFLTN
GLPSRPWFRH AIYAPGESTG YEAIVLPGIT EAIEHHDQQA LVEQIQLTTA AINHASEILE
SAHSF