Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3491 |
Symbol | |
ID | 4069067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4117281 |
End bp | 4118561 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637985513 |
Product | Pro-Hyp dipeptidase |
Protein accession | YP_592566 |
Protein GI | 94970518 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.163826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCCG GCTACTCAGC GCGACATCTT GCTTACATCC GAATTCGACC CACCGCGCTG GCGTTGCTCG TGTGGACGCT CTCTTGGGCT CTGACCGGTT TCTGCTCTGA TCTGGCGCTT GTTCACGCAA AGATTTATCG TTCTCCCACT CAACCCGCGA TTACTGACGG TGTAATTCTT GTGCGTGGGA GCCGCATTGT TGCAGTCGGC CCAGGCGCCA GGGTCAAAGT CCCGATTCAT GCGAATGTCA TCGACTGCCA AGGCCGAGCT GTGACGGCTG GCTTCTGGAA CAGCCATGTG CACATTCTTT TCCCGGGCAT TCTCCATGCA GAAAAACTCA CTTCTCGGCA TGTCAGTTCC GAACTGCAGG AGATGTTTAC TCGCTGGGGT TTCACAACCG TGTTTGACAT CGCGTCAGTC CAGCAAAACA CCACTCTCAT TCGCCGTCGC ATAGAGAGTG GTGAGGTTAC CGGCCCCACG ATCCTTACCG TTGGTGAGCC GTTCTGGGTC AAGGGCGGAA CGCCGATCTA CATTAAAGGA TTTCTTGAAG CCAACCACAT CGTCATGCCC GAGGTTTCAT CCCCCGAACA AGCCGTCGTT AGGGTGCGCC AGCAGATCAA TGGAGGTGCG GACGCGATTA AGATTTTTGC GAATTCCGTC GAACGCGACA GGATATTGAC GATGCCATCG GATTTGGCAA AGGCGATCGT CGCTGAGGCC CACCGCGCTG GCAAACCGGT TTTTGCTCAT GTCTCCAACG ACCAGGGGAT CGAAGTCGCG CTGCAGAGCG GCGTCGATAT ACTCGCCCAC ACCACTCCCG CCGGGGATCT GTGGAGCGCG CCTTTTGCCG AGCGCCTGGT AGCTGCCCAC ATCGCACTCA CTCCCACCCT GACTCTGTGG GATGTAGAAG CCAAGAAAGG TGGCGTCTCA TCCGAGCAAG CTGAGAAATG GATGTCCAGG GCGGCCGAGC AGTTGAAGGC CTTTTCTGAG GCGGGAGGAG AAGTGCTATT CGGTACCGAT GTCGGCTATA TCGAACAGTT CGATACCTCC GAAGAATTTA CGTGGATGGC CCGTGCCGGG TTGAATTTCC AGCAGATTTT GGCTTCTCTC ACCACGAATC CATCTGCGCG TTTTGGGTAT TCGAGTCACC GCGGGCGCAT CGCAGAAGGG ATGGATGCCG ATCTTGTGGT GCTGAATGGG GATCCTGGCA AAGATGTCAT CGCCTTTTCT AAAGTTCACC AAGTGATTCG CGGCGGGCAG TTGATCTACC AAGCACGGTA G
|
Protein sequence | MIPGYSARHL AYIRIRPTAL ALLVWTLSWA LTGFCSDLAL VHAKIYRSPT QPAITDGVIL VRGSRIVAVG PGARVKVPIH ANVIDCQGRA VTAGFWNSHV HILFPGILHA EKLTSRHVSS ELQEMFTRWG FTTVFDIASV QQNTTLIRRR IESGEVTGPT ILTVGEPFWV KGGTPIYIKG FLEANHIVMP EVSSPEQAVV RVRQQINGGA DAIKIFANSV ERDRILTMPS DLAKAIVAEA HRAGKPVFAH VSNDQGIEVA LQSGVDILAH TTPAGDLWSA PFAERLVAAH IALTPTLTLW DVEAKKGGVS SEQAEKWMSR AAEQLKAFSE AGGEVLFGTD VGYIEQFDTS EEFTWMARAG LNFQQILASL TTNPSARFGY SSHRGRIAEG MDADLVVLNG DPGKDVIAFS KVHQVIRGGQ LIYQAR
|
| |