Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4618 |
Symbol | |
ID | 4070775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5470932 |
End bp | 5472461 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637986658 |
Product | hypothetical protein |
Protein accession | YP_593692 |
Protein GI | 94971644 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0329503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.488499 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTATT CCCACCCGAA CGACCCCAAT GCGGGTCGCC TGATTGAATC TCTCCGTCAC CTTGGCTATG GAAACTATGA AGCGGTAGCC GACATCGTTG ACAACTCAAT TGATGCGGAT GCTCAGAACA TCAACATCCG AGTACAGACT AAGTCAAATC AAATCATCAT TAGCATTGCC GACGATGGGC GAGGTATGTC GAAATCCATC CTCGACCAAG CTATGCGCCT GGGATCGCTG ACCGACCGCA ATGCCGAGTC GGATCTCGGC AAATTCGGCA TGGGGCTGGT GACAGCAAGC CTTTCGATGG CAAAGAAGCT ACATGTCGTC TCACGTGGCG ACGATGGGTG CTGGTCGAGC GCATGGGATG TCGACGAGAT CGTTGCGCAG AATGCGTTTC TCAAGCACTT TGAAGCTGCA ACATCCGACG AGGAAGAACT TCTAGCTGAA GAGATCGGTA AGAAGAAAAC CGGAACGCTG GTGCTGCTTT CAAAATGCGA CAACCTTGCC AATAAAAACA CCAGCTCATT TGCGTCTAAT CTGAGATCGC ATCTTGGGCG CGTACATCGC TACTTCATCG GTGCTGGTAG AGTTGTGACC GTGAATGGCG AGCCTGTGGA AGCGATCGAT CCACTTCAAC TCGCGGATCC AGACACGGAA ACCGTGCTCG ATGATGTCAT CTCGGTAACA TTGACAGACG ACGGCGAGAA GAAGACTGAC AACGTTAGGG TCAGAGTTGT GCTCATCCCG GAATCTCCCG TCACTGACCT CGATGTCGGC AAGTCTCTCA AGGCTCAGGG TTTCTATGTA ATGCGCAATC AACGCGAGGT GATGAACGCG GCCGCCCTCG GGTTCTTCAC CAAGCACAAC GATTTCAACC GAATGAGGGG TGAACTGTTT TTCCCAGGCA CTCTGGACCG CCTTGTTGGA ATCGAGTTCA CGAAACGGCA GGTTGAATTC GAACAGAGTC TTCAGGATCA ACTAAACAAC GTTCTGATAC CGGTCTGTCG AACAATCAAA AGGCGCGAAG CAACCAAGAA GCGAGTTCAA AGCGGCGAAG CACAGTTGAA GTTGCACGCT CAATCGATGA AGGTCATCGC GGAAAAAGAC AAACTTTTGA TCAAGCCGAA GGCCGTCATT GAAAAGCGTT CATCACCGCG TAACGGCAGC GGTGTGCAAG TCGATGACGC TCTAGATACA AATAAAGAAC GCAAAAACTT CAATCGTTCA CAGCTGGTTG AAACGAGGCT CAATTGCGTC ATTCGAGAAG AAAGACTCGG GCCGAACGGC CAAATTTATG AATGCGAGAT GGAGGGAAGA AAGCTCGTCA TTCGCTATAA CGTTGAGCAT CCCTTCTACC AACGGTTCGT GACCGACAAC ATGGATGAAG CTCGCGCTGT CACTGCCACC GATTTTTTGA TTTACAGCAT GGCTTCGGCG GAGTTGAAGT TTCTGGATGA AGGTGATCTG GAGGCTGTGA ATAACTTCAA GGCCGTGCTT TCCGCTAACT TGCGAACGCT TCTGAACTAA
|
Protein sequence | MRYSHPNDPN AGRLIESLRH LGYGNYEAVA DIVDNSIDAD AQNINIRVQT KSNQIIISIA DDGRGMSKSI LDQAMRLGSL TDRNAESDLG KFGMGLVTAS LSMAKKLHVV SRGDDGCWSS AWDVDEIVAQ NAFLKHFEAA TSDEEELLAE EIGKKKTGTL VLLSKCDNLA NKNTSSFASN LRSHLGRVHR YFIGAGRVVT VNGEPVEAID PLQLADPDTE TVLDDVISVT LTDDGEKKTD NVRVRVVLIP ESPVTDLDVG KSLKAQGFYV MRNQREVMNA AALGFFTKHN DFNRMRGELF FPGTLDRLVG IEFTKRQVEF EQSLQDQLNN VLIPVCRTIK RREATKKRVQ SGEAQLKLHA QSMKVIAEKD KLLIKPKAVI EKRSSPRNGS GVQVDDALDT NKERKNFNRS QLVETRLNCV IREERLGPNG QIYECEMEGR KLVIRYNVEH PFYQRFVTDN MDEARAVTAT DFLIYSMASA ELKFLDEGDL EAVNNFKAVL SANLRTLLN
|
| |