Gene Acid345_3491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3491 
Symbol 
ID4069067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4117281 
End bp4118561 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content56% 
IMG OID637985513 
ProductPro-Hyp dipeptidase 
Protein accessionYP_592566 
Protein GI94970518 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.163826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCG GCTACTCAGC GCGACATCTT GCTTACATCC GAATTCGACC CACCGCGCTG 
GCGTTGCTCG TGTGGACGCT CTCTTGGGCT CTGACCGGTT TCTGCTCTGA TCTGGCGCTT
GTTCACGCAA AGATTTATCG TTCTCCCACT CAACCCGCGA TTACTGACGG TGTAATTCTT
GTGCGTGGGA GCCGCATTGT TGCAGTCGGC CCAGGCGCCA GGGTCAAAGT CCCGATTCAT
GCGAATGTCA TCGACTGCCA AGGCCGAGCT GTGACGGCTG GCTTCTGGAA CAGCCATGTG
CACATTCTTT TCCCGGGCAT TCTCCATGCA GAAAAACTCA CTTCTCGGCA TGTCAGTTCC
GAACTGCAGG AGATGTTTAC TCGCTGGGGT TTCACAACCG TGTTTGACAT CGCGTCAGTC
CAGCAAAACA CCACTCTCAT TCGCCGTCGC ATAGAGAGTG GTGAGGTTAC CGGCCCCACG
ATCCTTACCG TTGGTGAGCC GTTCTGGGTC AAGGGCGGAA CGCCGATCTA CATTAAAGGA
TTTCTTGAAG CCAACCACAT CGTCATGCCC GAGGTTTCAT CCCCCGAACA AGCCGTCGTT
AGGGTGCGCC AGCAGATCAA TGGAGGTGCG GACGCGATTA AGATTTTTGC GAATTCCGTC
GAACGCGACA GGATATTGAC GATGCCATCG GATTTGGCAA AGGCGATCGT CGCTGAGGCC
CACCGCGCTG GCAAACCGGT TTTTGCTCAT GTCTCCAACG ACCAGGGGAT CGAAGTCGCG
CTGCAGAGCG GCGTCGATAT ACTCGCCCAC ACCACTCCCG CCGGGGATCT GTGGAGCGCG
CCTTTTGCCG AGCGCCTGGT AGCTGCCCAC ATCGCACTCA CTCCCACCCT GACTCTGTGG
GATGTAGAAG CCAAGAAAGG TGGCGTCTCA TCCGAGCAAG CTGAGAAATG GATGTCCAGG
GCGGCCGAGC AGTTGAAGGC CTTTTCTGAG GCGGGAGGAG AAGTGCTATT CGGTACCGAT
GTCGGCTATA TCGAACAGTT CGATACCTCC GAAGAATTTA CGTGGATGGC CCGTGCCGGG
TTGAATTTCC AGCAGATTTT GGCTTCTCTC ACCACGAATC CATCTGCGCG TTTTGGGTAT
TCGAGTCACC GCGGGCGCAT CGCAGAAGGG ATGGATGCCG ATCTTGTGGT GCTGAATGGG
GATCCTGGCA AAGATGTCAT CGCCTTTTCT AAAGTTCACC AAGTGATTCG CGGCGGGCAG
TTGATCTACC AAGCACGGTA G
 
Protein sequence
MIPGYSARHL AYIRIRPTAL ALLVWTLSWA LTGFCSDLAL VHAKIYRSPT QPAITDGVIL 
VRGSRIVAVG PGARVKVPIH ANVIDCQGRA VTAGFWNSHV HILFPGILHA EKLTSRHVSS
ELQEMFTRWG FTTVFDIASV QQNTTLIRRR IESGEVTGPT ILTVGEPFWV KGGTPIYIKG
FLEANHIVMP EVSSPEQAVV RVRQQINGGA DAIKIFANSV ERDRILTMPS DLAKAIVAEA
HRAGKPVFAH VSNDQGIEVA LQSGVDILAH TTPAGDLWSA PFAERLVAAH IALTPTLTLW
DVEAKKGGVS SEQAEKWMSR AAEQLKAFSE AGGEVLFGTD VGYIEQFDTS EEFTWMARAG
LNFQQILASL TTNPSARFGY SSHRGRIAEG MDADLVVLNG DPGKDVIAFS KVHQVIRGGQ
LIYQAR