Gene Acid345_2579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2579 
Symbol 
ID4070542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3047545 
End bp3048834 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content58% 
IMG OID637984596 
Productmembrane dipeptidase 
Protein accessionYP_591654 
Protein GI94969606 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGAT TCGCATTCCT ACTGTTGCTC TGCGGGCTAT GTGCCGCACA ATCTCCCACC 
CCTAAGCAGC CGGCCAAAGC AGCCCCCGTC GGCTGGAAGG CCATTCACGA CTCTGCCCTC
GTTGTTGATA CCCATGCCGA CACTCCGCAA CCTTGGCTGG ACAAGAACAT CAACGCCGCC
GACCCCGACT CGAAGCTGAT GGTCACGATT CCGGCGGCCA AAGCCGGCAA CCTCGGTGCC
GAGTTTTTCT CGATCTGGGT GGATCCAGTA AAGTTCAAAG GCCATTACCC CGACCGCACT
CTCGCGCTCA TCGATGCCGT CTATCAGCAG GTCCAGCGCA ATCCGAAAGA CATGATGTTC
GCCACCAGCG TGAAGGACAT CTACGCCGCT CGCCGCGAAC ATAAGCTCGC CTCGTTGATG
GGAATCGAGG GTGGCCATTC CATCGCCAAC GATCTCGGAC TACTGCGCGA TTACTACCGC
CTCGGCGTGC GTTATATGAC GCTCACCTGG TCGAACACCA ACGACTGGGC TGACTCCTCC
GGTGACGTGG ACGACAAGAA CATTCAGCAC CACGACGGTC TTACCGATTT CGGCCGCGAC
GTGGTCCGTG AGATGAACCG CATCGGCATG ATCGTGGACA TCTCCCACAC CTCTGACCGC
ACCTTTTACA AGACGCTGGT CGTCGCCCGC GCTCCCGTAA TCGCCTCGCA CTCTTCTTCT
CGCGCGCTCA CCAACGTTCC CCGCAACATG ACCGACGACA TGCTCCGCGC CCTCAACCGT
AACGGCGGTG TCGCCATGGT CAACTTCAAC TGCGGATTCA TCAGCAACGA ATACGCAGCC
GCAGAGAAGA AACTCGAAGC GGAAGACCAC TCCATCGCCG ACCTTAAGAA GAAAGCTGCC
GAACCGGGTT CAAATATCAC CGAAGCCGAC ATCCAGAAGG CGGAAGACGC GTTCTACGCC
AGTATTCCCC GCCCTCCGCT CAGCAACTTG ATTGACCACA TTGATCACAT GGTGAAAATC
GCCGGGATAG ATCACGTCGG ACTGGGTTCA GATTTCGACG GCGTTAGCTG CACCCCCGAG
GGTATCGATT CGGCCGCCGA TCTTCCGAAA ATCACGCAGG CACTCCACGA CCGCGGCTAC
AATGCAGAGC AGATTAAAAA GATTCTCGGC GGCAATATCC TCCATGTCTT TTCCGAAGTG
GAGAAGACGG CCGCGCAGCT TCAGGCGGAG TCGCCCGAGA ACAAAGACAC GCGGCACGAG
GTAAAGCTCG ACGCCCAGCC CAAGAAATAG
 
Protein sequence
MRRFAFLLLL CGLCAAQSPT PKQPAKAAPV GWKAIHDSAL VVDTHADTPQ PWLDKNINAA 
DPDSKLMVTI PAAKAGNLGA EFFSIWVDPV KFKGHYPDRT LALIDAVYQQ VQRNPKDMMF
ATSVKDIYAA RREHKLASLM GIEGGHSIAN DLGLLRDYYR LGVRYMTLTW SNTNDWADSS
GDVDDKNIQH HDGLTDFGRD VVREMNRIGM IVDISHTSDR TFYKTLVVAR APVIASHSSS
RALTNVPRNM TDDMLRALNR NGGVAMVNFN CGFISNEYAA AEKKLEAEDH SIADLKKKAA
EPGSNITEAD IQKAEDAFYA SIPRPPLSNL IDHIDHMVKI AGIDHVGLGS DFDGVSCTPE
GIDSAADLPK ITQALHDRGY NAEQIKKILG GNILHVFSEV EKTAAQLQAE SPENKDTRHE
VKLDAQPKK