Gene Acid345_4468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4468 
Symbol 
ID4070951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5301446 
End bp5303317 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content60% 
IMG OID637986507 
Productpeptidyl-dipeptidase A 
Protein accessionYP_593542 
Protein GI94971494 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.19029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAC CTCACTACGA CGCGAGCACC GCCGCTTGTA CAATTCCCGC CGGAGGTACC 
ATGCTGCGAA AAGCATTCCT AACTGCTGTC CTTACTGCTG CAATCGTTCC GCTTTCTGTG
TTCGCGCAAT CCACTCCCAC CGTCGCCGAC GCCGAAAAGT TCGTGAAAGA CGCCGAGACC
AAGCTCGACG ACCTCGGTGT AAAAGCCCAG CGCGCTGAGT GGGTCGCGGA AAACTTCATT
ACCGACGACA CCCAGGAAAT CGCGGCCGAA GCCAACGAGA TCGCGAACGC TGAAGCGACG
AACTTCGCCA AGCAGACGAT CCAGTTTGAG AAGCTGCAAC TCCCGCCGGA GTTGGCGCGC
AAGATGTTGC TCCTCAAACT CGCGGCCATT GCGCCTAGCA ACCCCAAAGA CCTCTCGGAA
CTGACCCGGG TGCAGGCCTC GATGGCCGCC GACTACGGCA AGGGCAAGTA TTGCCCCACC
ACCGGCAAGC ACGCCGGCGA GTGCCTCGAC ATCACCAAGA TCGAGCACAT CATGGAAACC
TCCACCGATC CCGACGAACT GAAGGACCTC TGGATCGGCT GGCACAAGGT CGGCGCACCC
ATGCGCCAGC GCTATGCGCG CTTCGTCGAG CTCAGCAATA ATGGTGCCCG CGAAATGGGA
TGGGCCGACA CCGGCGCCTA CTGGCGTGCC GGCTACGACA TGCCTCCCGA CCAGTTCAGT
GCTGAGTTGG AACGGCTGTG GCAGCAGATG CGTCCGCTGT ACGTTTCTCT GCACACCTAC
GTCCGCAACC AGCTGGTAAA GAAGTATGGG GAGCAGGCCG TGAAAGACGG CATGATCCGC
GCCGACCTCC TCGGCAACCC CTGGGCCCAG GAGTGGGGCA ACATCTACCC CCTCGTCGCC
CCGCCCACCA AGCATCCGCA GCTCGACGTC ACCCAGATTC TGCAAGACAG AAAAGTTGAC
GAGCTCGGCA TCGTTCACTA CGGCGAGAAT TTCTTCAAAT CGCTGGGCTT CCCGGCGCTG
CCGCAAACTT TCTGGGAGCG CTCTCTCTTC CTGAAACCGA AAGACCGCGA CGTAATCTGC
CACGCCAGCG CATGGGACAT CGACAACAAA GATGACCTCC GCATCAAGAC CTGTCTCCAG
GTTCGCGCCG ATGACTTCGT CACTGTCCAC CACGAACTTG GCCACAACTT CTATCAGCGC
GCCTACAAGG CCCAGTCGCC GCTTTTCGAG AACGGCGCCA ACGACGGCTT CCACGAGGCC
ATTGGCGACA CCATCGCGCT CTCCATCACG CCGGAGTACC TGAAGCAGGT CGGCCTCATC
GACACCGTCC CGCAACAGGA TGATGTCGCT CTGCTGCTGC GCCAGGCGCT CGATAAGGTC
GCGTTCCTAC CCTTCGGATT GCTGATCGAC CAGTGGCGCT GGAAGGTCTT CAACGGGCAG
ATCAAGCCGG AAGACTACGA GAAGTCGTGG GTGCAGATGC GCGAGCACTA CCAAGGCGTC
TACCCCCCGA CCGACCGCAC CGAAGCCGAC TTCGATCCGG GCGCGAAGTT CCATGTGCCG
GCGAACGTCC CGTACACGCG GTACTTCCTC GCGCGCGTTT TGCAATTCCA GTTTTATCGC
GCGATGTGCA AAGAGGCCGG ATTCACCGGT CCGCTGCACC AGTGCTCGTT CTACAACAAC
AAGAAAGCCG GCGCGAAACT GGATGCCATG CTTGAGATGG GCGCCAGCAA GCCCTGGCCG
GAAGAGCTCA AAGTCCTGAC CGGCGAAGAC AAGATGGACG CCGGGGCCAT GCTCGATTAC
TTCGCGCCGC TAAAAAAATG GCTCGACGAA CAGAACAAGG GGCAGCAAGC CGGCTGGACT
GAACAGAAGT AA
 
Protein sequence
MRRPHYDAST AACTIPAGGT MLRKAFLTAV LTAAIVPLSV FAQSTPTVAD AEKFVKDAET 
KLDDLGVKAQ RAEWVAENFI TDDTQEIAAE ANEIANAEAT NFAKQTIQFE KLQLPPELAR
KMLLLKLAAI APSNPKDLSE LTRVQASMAA DYGKGKYCPT TGKHAGECLD ITKIEHIMET
STDPDELKDL WIGWHKVGAP MRQRYARFVE LSNNGAREMG WADTGAYWRA GYDMPPDQFS
AELERLWQQM RPLYVSLHTY VRNQLVKKYG EQAVKDGMIR ADLLGNPWAQ EWGNIYPLVA
PPTKHPQLDV TQILQDRKVD ELGIVHYGEN FFKSLGFPAL PQTFWERSLF LKPKDRDVIC
HASAWDIDNK DDLRIKTCLQ VRADDFVTVH HELGHNFYQR AYKAQSPLFE NGANDGFHEA
IGDTIALSIT PEYLKQVGLI DTVPQQDDVA LLLRQALDKV AFLPFGLLID QWRWKVFNGQ
IKPEDYEKSW VQMREHYQGV YPPTDRTEAD FDPGAKFHVP ANVPYTRYFL ARVLQFQFYR
AMCKEAGFTG PLHQCSFYNN KKAGAKLDAM LEMGASKPWP EELKVLTGED KMDAGAMLDY
FAPLKKWLDE QNKGQQAGWT EQK