Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1442 |
Symbol | |
ID | 4071631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1740708 |
End bp | 1741961 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983451 |
Product | peptidase M20 |
Protein accession | YP_590518 |
Protein GI | 94968470 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCAG TCGAACCACT TCGGGTAGAG GCCACAAGCA AGCGCGGTCG GTTAAGCGAG ACCCAATGGG CGCGTTCCGC CCAGCAGCGC GTCGTTCGCC TTGCCCAATT ACAGGTGGTG CACGACGCCT TCGACTGGCT TCACCGCAAG GAATTTCAGC TTCAGGAACG CCAGATTGAG CTCGCAAAAA TTCCCGCTCC TCCGTTTGGT GAGGCCGCCC GTGCAGACTG GCTACAGTTC CGCTTCCGCG AGCTGGGATT GAAAGACATT CATACCGACG CCGTCGGTAA CGTTCTTGCC ATCCTGCCCG GCGCCAACAC CTCTGCCGGC TACGTCTCGA TCAGCGCGCA CCTTGACACC GTTTTCCCTG CCGAGACCAA ACTCGATATC AAACGTGATC GTGACCGCAT GGTTGGTCCC GGCATTTCGG ACAACGCCGC CGGCGTAGTA GCCATGCTTG CCATCGCTGG CGCCATCAAA GACGGCCGCA TTCGCCCCTC AGCGCCAATC CTTTTTATCG GCAATGTTGG GGAGGAAGGC GAAGGCAATC TCCGCGGCGT CCGCCATATT TTCGACGACA CGCGCTGGAA GGACGCTATC GAGTTCTCCA TCATCCTCGA CGGTGCAGGA ACCGATTCGA TCATCGCGGA AGGTCTTGGC AGTCGTCGCT ATGAAGTCCG CGTTAATGGC CCTGGCGGCC ATTCCTGGAG CGATTTCGGA GCACCCAACC CGATCGTCAT ACTTTCCAAG ATCATCCAGG AATTCAGTGC CACTTCCCTT CCCTCGAAAC CGAAAACTAC CTACAACATC GGCGTGATCC AGGGCGGCAC TTCCGTCAAC TCCATCCCGG AGATGGCCTG CATGCGCGTG GACTTGCGGT CGGTAGCAGT CGCCGAAATC GAAAGATTGG AAGAAGCTCT GCGCCGCGCC GCGGACTCCG CAAAGACCCG CGACGTGGAA ATCGAAATCC GCACCATCGG TGATCGTCCT GCCGGCGAAC TCAAAAGCGA TTCGCCCATG CTGGCCATGG TCCGCGCCGT AGACGCGCAT CTCAGCATTA CCTCACGCAT CCAGCGCGCC TCCACGGACG CCAACATCCC GCTCTCACAA GGACGCGAGG CGATCGCACT CGGCGCAGGT GGAACCGGAG CCGGCGCCCA CACTCTTCAC GAGTGGTTCG ATCCCTACGG ACGCGAAGTA GGCTTGAAGC GTATCCTGCT TCTGACGCTG GCTCTCACCG GCATCGACGA GTAA
|
Protein sequence | MSAVEPLRVE ATSKRGRLSE TQWARSAQQR VVRLAQLQVV HDAFDWLHRK EFQLQERQIE LAKIPAPPFG EAARADWLQF RFRELGLKDI HTDAVGNVLA ILPGANTSAG YVSISAHLDT VFPAETKLDI KRDRDRMVGP GISDNAAGVV AMLAIAGAIK DGRIRPSAPI LFIGNVGEEG EGNLRGVRHI FDDTRWKDAI EFSIILDGAG TDSIIAEGLG SRRYEVRVNG PGGHSWSDFG APNPIVILSK IIQEFSATSL PSKPKTTYNI GVIQGGTSVN SIPEMACMRV DLRSVAVAEI ERLEEALRRA ADSAKTRDVE IEIRTIGDRP AGELKSDSPM LAMVRAVDAH LSITSRIQRA STDANIPLSQ GREAIALGAG GTGAGAHTLH EWFDPYGREV GLKRILLLTL ALTGIDE
|
| |