Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1826 |
Symbol | |
ID | 4072887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2205264 |
End bp | 2206544 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983835 |
Product | Pro-Hyp dipeptidase |
Protein accession | YP_590901 |
Protein GI | 94968853 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGGC TGATTGCGGT TCTCTTGTTC CTCTGTACGC TTGCCTTCGC GCAATCCACT CCCAACAACG TGGTCGTCGT GAAAGCCGGC CATCTGCTCG ACGTCAAAAC TGGTCAGTAC CAGAACAACG TCAACATCGT CATCGAGAGC GGCGTAATCA AGAGCGTCGG CTCAGGCGCT CCTCCGGCAG GCGCGAAGGT CATCGATCTC TCCAACGCGA CGGTGCTGCC CGGACTCACA GACGCGCACA CGCACCTCAC CTATGACGCC GGCGATGTCG GCCTGAAGGG CCTGACGATC TCGCCGGAAA AAGAAGCTTT GAATGGCGCA GGAAATGCAC GCATCACGCT CCTAGCCGGA TTCACGACCG TCCGCAATGT TGGAGCGCGT GCTTTTGCCG ATGTTGCTCT CCGTGATGCC ATCAACGATG GCCACGTCCC TGGCCCGCGC ATGCTCGTCA GCGGTCCGGC ATTGAGCATC TCGGGCGGCC ACGGCGATAA CAATCTCCAA CCCTGGGAAG ACCACTCCTA CGGCGACGGG GTTGCCGACG GCGTTGACGC CGTGCAGCAC AAGGTCCGCG ACAACATCAA GTACGGCGCC GACGTCATCA AGTTCATGGC CACCGGTGGC GTTATGTCGA AGGGCGATAA CCCCGAGCAC TCGCAGTACA CGCTCGAGGA AATGAAAATG ATCGTCAGCG AGGCGCACCG CTTCGGCCGC AAGGTCGCCG CGCACGCACA CGGCGTGCAG GGCATTATGT GGGCGACTGA GGCCGGAGTG GATTCTATCG AGCACGGCAC CTATATCAAC GATGAAGCCA TCGCGCTCAT GAAGCAGCAC GGCACCTACC TTGTCCCCAC GCTCTATCTC ACCGAGTGGC TGCCCGAGAA CGCCGACAAG ATCGGCATTC CGCCATACGT GAAAGCCAAA ATGAATGTCG TGCTACCCCT GCTCCGCAAG AACATTTCGC ACGCGTTCGC CAGCGGCGTG AAGGTCGCTT TTGGCACCGA CGCCGCCGTT TATCCGCACG GCCTCAACGG TCATGAATTC AAGACCTACG TCGATCTTGG CATGACGCCA CTCCAGGCCA TTCAAAGCTC GACCATCGGC GCGCCCGACC TGCTCGGCAT GACCGACAAG ATCGGTACCG TCGAAGCCGG CAAGTTCGGT GACCTGATCG CCGTCACCGG CGACCCGTTG AAGGACATCA CAGAACTGCA GAGAGTGAAG TTCGTGATGA AGGGCGGCGA GGTGTACAAA GACGAAATCC ACGCCCACTA A
|
Protein sequence | MKRLIAVLLF LCTLAFAQST PNNVVVVKAG HLLDVKTGQY QNNVNIVIES GVIKSVGSGA PPAGAKVIDL SNATVLPGLT DAHTHLTYDA GDVGLKGLTI SPEKEALNGA GNARITLLAG FTTVRNVGAR AFADVALRDA INDGHVPGPR MLVSGPALSI SGGHGDNNLQ PWEDHSYGDG VADGVDAVQH KVRDNIKYGA DVIKFMATGG VMSKGDNPEH SQYTLEEMKM IVSEAHRFGR KVAAHAHGVQ GIMWATEAGV DSIEHGTYIN DEAIALMKQH GTYLVPTLYL TEWLPENADK IGIPPYVKAK MNVVLPLLRK NISHAFASGV KVAFGTDAAV YPHGLNGHEF KTYVDLGMTP LQAIQSSTIG APDLLGMTDK IGTVEAGKFG DLIAVTGDPL KDITELQRVK FVMKGGEVYK DEIHAH
|
| |