Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1504 |
Symbol | |
ID | 4069251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1835543 |
End bp | 1837003 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637983513 |
Product | peptidase M20C, Xaa-His dipeptidase |
Protein accession | YP_590580 |
Protein GI | 94968532 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.774685 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCTT CTACTGCCGT GGCGGTAGCC GAACTGGAAC CGAAAGCCAT CTGGAAGCAC TTCGAGGCGC TGACCAAGAT TCCTCGCCCC TCGACGAAGG AAGATGCCGC CCGCAAGTAC GTGATCGGTA TCGCCGAAAA GCACGGTCTG AAACATGTGA TGGATGCGGC GGGGAACCTC GTGGTGAGCA AGCCGGCGAC GAAGGGCCGC GAACATGCCC CGATGGCCGC GCTGCAAGGG CACCTCGACA TGGTGTGCGA GAAGAACGAA GGCACGCACT TTGACTTCGA TAAGGACGCG ATCAAAGTCA TCCGCGGCGA AGATTGGCTC TACGCCGACG GCACCACGCT CGGTTCCGAC AACGGCGTAG GCGTCGCGAC TGCGCTGGCT GTGATGGAGA GCAAGGACAT CGCGCATGGG CCGCTGGAAT TCGTGTTCAC GATTGACGAA GAGACCGGTT TGACCGGCGC TTCGGACTTC CAGCCGGGGC TGCTGAAGTC GAAGTACTTC CTCAACCTCG ACGGCGAAGA GAAGGGAACG CTTTGCATTG GTTGCGCGGG CGGTGTGAAC ACTCCGGCGC ACCGCAAGGT GACGAAGAAA GCCGCACCCG CCGGTACGGC GCTGCGCGTG AAAGTTTCCG GACTGAAGGG CGGCCACTCC GGCGTGGACA TCCACCTTGG ACGCGGCAAC GCGGTGCGCA TTCTGGGGCG CGTGTTGGAA ACTCTGCTGC GCGACGAACA CGCGAATCTC GCCGACATCA AGGGCGGCAG CGCGCACAAC GCGATTCCGC GCGAGGCCTA CGCGGTTGTA GTGATCGATC CGAAGCGCGA AGAAGAAGTG AAGACCGTGG TGGCACGCAT CGCGGAGGAC GTGAAGGCGG AGTTGGGCGC GTTCGATCCC GACGTGAAGA TCTCGGTGGA AAACGTGGCG GCACCGAAGG AGATCATGGA GCACGCGGAC GCGACGAAAG TTGCGGACCT GCTGGCGACG GTCCATCACG GCGTGCTTGC AATGAGCCCG GACATCAAGG GGCTGGTGCA GACGTCCACG AATCTTGCGA CGGTTTCCCT GAATGGCGAT ACGGTTGAGG TTGTCACCAG CCAGCGCAGT TCGATCGAGA GCAGCAAGAA CGCGATTGCA CGCATGGTCG CTGCGCTCTG CAAAAACACG GGATTCCACG CGGAGCACAC CACGGGGTAT CCGGGATGGA AGCCGGAGCC GAACAGCGAC ATCGTAAAGA TCTCGCGCAA GGTGCACGAG GAGGTCCTCG GCAAGGACCC GGAACTGGTA GCGATGCATG CTGGCCTGGA GTGCGGCGTG ATCGGCGAAA AGCACCACGG CATGCAGATG ATTTCGTTCG GGCCGCAGAT CGAGAACCCG CACAGCCCGA ATGAACGCGT CCAGATTTCC TCGGTTGAGA GCTTCTGGAA GTTCCTGCGC GTGCTGCTCG AGCGGATTTA G
|
Protein sequence | MSSSTAVAVA ELEPKAIWKH FEALTKIPRP STKEDAARKY VIGIAEKHGL KHVMDAAGNL VVSKPATKGR EHAPMAALQG HLDMVCEKNE GTHFDFDKDA IKVIRGEDWL YADGTTLGSD NGVGVATALA VMESKDIAHG PLEFVFTIDE ETGLTGASDF QPGLLKSKYF LNLDGEEKGT LCIGCAGGVN TPAHRKVTKK AAPAGTALRV KVSGLKGGHS GVDIHLGRGN AVRILGRVLE TLLRDEHANL ADIKGGSAHN AIPREAYAVV VIDPKREEEV KTVVARIAED VKAELGAFDP DVKISVENVA APKEIMEHAD ATKVADLLAT VHHGVLAMSP DIKGLVQTST NLATVSLNGD TVEVVTSQRS SIESSKNAIA RMVAALCKNT GFHAEHTTGY PGWKPEPNSD IVKISRKVHE EVLGKDPELV AMHAGLECGV IGEKHHGMQM ISFGPQIENP HSPNERVQIS SVESFWKFLR VLLERI
|
| |