Gene Acid345_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4003 
Symbol 
ID4071139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4729802 
End bp4731487 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content59% 
IMG OID637986030 
Productpeptidase S10, serine carboxypeptidase 
Protein accessionYP_593077 
Protein GI94971029 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.927019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAT CCTGGCTGAA CGCCCTACTG TGCTGCGCGA TTTTGGGCGC AACCGCAGGG 
GCACAGCAGA AGAATCCTCC GAAACCTCAA GTTGAAAAGA GTTCGCCCAG CGAAAACATC
GCAAACCGAC CGGCGAACGC GCCGGAAGAA CCGCGCGCTC AGCGCGAAGA ACGACGGCAG
GAAGACGCTC CGCAACAACC ACGCGGTGAG GGGCAGAAGC CGTCGATGAA GTGGGACATG
ACGGAGACAG CTCCGGTGGT GACCCACCAC GAAATCAACG TAAACGGCCG GGCTCTGCGA
TACACCGCCA CGGTCGGACG GTTGCCGATT AAAGACCTCA CTGGCACCAC CGAAGCGCTG
ATGTTCTACG TCGCTTACAC GCTCGATGGA CAGGATGCGA CGAAGCGTCC AGTCACGTTT
GCGTTTAATG GCGGTCCGGG ATCGGCGTCG ATTTGGCTGC ACATGGGAGC GCTCGGCCCG
CGCCGCGTCG CGCTGCAGCA GGACGGCATG ATGCCGCCGT CGCCGTATCA CCTCATCGAC
AACCCCGGCA CGCCGCTCGA AAAGACCGAC CTGGTATTGA TTGACGCCAT CGGCACCGGC
TTCAGCCGCC CCGCGGACCT GGAAAAGGGC AAGAAGTTCT GGAGCGTGAA GGGCGACATC
GAAGCGTTTG GCGAATTCAT TCGCCTCTAC ATCACGCGGA ATGAACGTTG GGCTTCGCCG
CTCTACATCT TCGGCGAAAG CTATGGAACC ACGCGCGCAG CGGGAATTTC GGGCTATTTA
GTGGATCGCG GCATTGCCTT CAACGGCATT TGCTTGCTCT CGGAAGTGCT GAACTTCGAG
ACGTTGGAAT TCAGCAAGAG CAACGACCTT GGCTATCAGC TCACGCTGCC GTCGTACACC
ATGATCGCCG GATACCACAA GATGCTCGCC CCGGAGCTTC TCCAGAACAT GGAGAAGACG
AAGTCTGAGG TGGAGCAGTT TGCGAATGGT GAATACGCGC AGGCGCTGCA AGCGGGCGAC
AGTCTCACTG CCGACCAGCG GGCGCACATC GTTGAGCAGC TCGCGAAATA CACGGGACTC
AAGAAGGACT TCATCGAGCA GTCGAACATG CGGATCGATG TCCGCGGCTT CACGCATAAT
TTGCTCATCG ACCAAAAGCT GCGCGTCGGA CGCCTCGACG GCCGCTATAC CGGACCGGAT
CCAAACGGCC TGATGGATAC GCCGTTCTAC GATCCGACGG GCTCGGCAAC CGATCCACCG
TTCACCGCGA CGTTTAACAA CTATCTGCGG AATGACCTGG GCTACAAAAC CGACATGCCT
TACTACGTAT CGGCACGCGA CATGGCGGGC GCAACCGAGC CCGGGCAGCG CGGTGGTGGA
CCGTTCCAGT GGGAGTGGGG ATCCGCAATT GAAGGCTATC CCGACACTGC GACCGCATTA
CGTGCAGCGA TGGTGAAGGA CCCGTACTTG AAGGTGTTGG TGATGGAGGG CGATTACGAC
CTCGCCACGC CGTATTTCGC GGCGAACTAC ACCATGAATC ATCTCGACCT GACGCAGCAG
TACCGCAAGA ATATTTCGTA CGCGCGGTAT GCGGCGGGTC ACATGGTTTA TCTGCCGATG
GATGGGCTTG CGAAGATGAA GAAGGACTAC GACTCGTTCC TCGATCAAAC CGCGAACCGT
CAGTAA
 
Protein sequence
MKQSWLNALL CCAILGATAG AQQKNPPKPQ VEKSSPSENI ANRPANAPEE PRAQREERRQ 
EDAPQQPRGE GQKPSMKWDM TETAPVVTHH EINVNGRALR YTATVGRLPI KDLTGTTEAL
MFYVAYTLDG QDATKRPVTF AFNGGPGSAS IWLHMGALGP RRVALQQDGM MPPSPYHLID
NPGTPLEKTD LVLIDAIGTG FSRPADLEKG KKFWSVKGDI EAFGEFIRLY ITRNERWASP
LYIFGESYGT TRAAGISGYL VDRGIAFNGI CLLSEVLNFE TLEFSKSNDL GYQLTLPSYT
MIAGYHKMLA PELLQNMEKT KSEVEQFANG EYAQALQAGD SLTADQRAHI VEQLAKYTGL
KKDFIEQSNM RIDVRGFTHN LLIDQKLRVG RLDGRYTGPD PNGLMDTPFY DPTGSATDPP
FTATFNNYLR NDLGYKTDMP YYVSARDMAG ATEPGQRGGG PFQWEWGSAI EGYPDTATAL
RAAMVKDPYL KVLVMEGDYD LATPYFAANY TMNHLDLTQQ YRKNISYARY AAGHMVYLPM
DGLAKMKKDY DSFLDQTANR Q