Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1143 |
Symbol | |
ID | 4069914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1423782 |
End bp | 1426862 |
Gene Length | 3081 bp |
Protein Length | 1026 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983153 |
Product | hypothetical protein |
Protein accession | YP_590220 |
Protein GI | 94968172 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0637346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAGT TCGCCTTCCT GCTGTTGGTG TGCTCATTCT TTCTGGTTGC GACGAATGAA GCTGGTGCGA GTGCGATGAC GACTGTCGTT CTGGAAGAGC CGGGATTTCC GACGGCGGAT GCGGCGGCGC CGGACATGGC GCGATTACAT GCGCTCATAT CCGACGCGAA GTTTGTGGTG GCGGACCAAC TGGTTGCTTC GTTAGCGGAC AGGGCGACCA CGCTGCTGGT GCTGCCCTAT GGGTCGGCGT TTCCGGAAGC GGTGTGGCCG GCAATTGATA GCTATTTGTA TCGCGGCGGG AACCTGTTGG TGATTGGCGG GCGACCGTTT ACGCGTGCGG CGTTTCGCGG AAAATCAGGA TGGGAGTTGC GCGAGTACAG CGTGCGCTTC GTGCTCGGAC TGAATATCGA TCAATACCAG GAGACGCCGG GCAGCGACGG AATGCAGTTC GAGGCGAATC CGGACGTGAT GGTGAAGGCG TCGCAGTTTA GGTGGAAGCG CGGATTTAGC CCGGTGATCC GGCTGTCATC CAGCGATGTT TATAAGCGGC AGGGATCGGC GGGAGAACTC GACGCGCGGC TCGATGCACT GGCGTGGGGA CTGCGTGATG GACGAAAGTT CGCAGCGCCG GCAATTGGGA TTGATCATGT GCGCGGCAAG TGGGGCGGTG GGCGCTGGGT GTTCGTGAAT TCGGAGATCA CGTCATCGGT ATATGCAGGC GATCTCATTC GCGATCTTGT GGAGTACACG GAGCGCGGAG CGCAGGAGTT CACGGCGCGT CCGACGCTGC CACTCTATGC GGAAGGCGAG CCGGTGCAGG TGGAATTGAA CTGGTCGGCA AACTCTTCTT CGGCCTCGTT GAGAGCAGAA GTGTCGATCG CGCCGGAAGA CAAGCCGGAG CAGAAGGTGG CTCGCACCGC GACACTCGCA AATGGCGGCG CAGTGGTGGA GTTCCCGCCG GTGCAGGAAA AAGGATTGTA TCGAATTGAA TCACGATTGT TCGATGGCGA TCGAACTGTC GCGGCGTATC ACTCTGGCTT TTGGATGCGT GACCTCGAGT ATCTTCGCTC GGGGCCGAAA CTTGGCGTCA ATAAAGACTA CTTTGAACTC GATGGAAGGC CCTTGGCGGT TGTCGGCACG ACTTACATGG CGAGCGACGT GCAGCGGTTG TTCTTCGATC ATCCGAATGT GTATGTGTGG GACAAAGAGT TGGGGCAGAT CAGCGGTGCC GGCTTGAACA TGATTCGCAG CGGCTGGTGG ACGGGCTGGG ACAAGCTGTG CGACGAGACG GGACGTCCGT ATGAGCGGAC GCTGCGAACG CTGGAAGCCT ATCTGATGAC CGCGCGTAAA CATGGACTGC CGGTGCAGTG GAACTTCCTC GCATTTCTCC CCGAGGTGCT CGGTGGAGAG AACCCATATC TTGACCCCGT GGCGGTGCGG CGGCAGAAGG CGTTCTATTC TGGCGTGGCC GCGCGCTTCC ACGATGTGCC TTTCGTGGCG TGGGACCTGA TTAACGAGCC GAGCATTTCA CAGTTCGTGT GGAAGACGCG GCCGAACCAG GATTGGATCG AACTGCAGCA GTGGAACGAG TGGTTGAAGC AAAAGTACCC GGATCGCGCT GCGCTGGCGG ATGCATGGAA CATGCCGCAA CTTGGCGACA CCGCGCCGGT CCCGACAGAG TCTGAGTTTG CGCCGCGCGC CATGTACGCT GGCCCGAATT CGCTCAAGAT TTACGACTTT TATGTGTTTG CGCAGGAGAA GTTCGCGGGA TGGGCGCAGC AGATGCGGGA GGCCATCCGC GCCACCGGCG CACAGCAACC GATCGTGGTG GGACAGGATG AGGGTGGATA CAACGATCGT CCGAACCCAG CGTTCTTCGG CAACGCCGTG GACTTCACGG CGAACCATTC GTGGTGGGAG AACGATTCGC TGTTGTGGGA TTCGCTCGTG GCGAAGCAGC CGGGGAAAGC GATGCTGATC CAGGAGACTG GATTGCAGCG TGAACTGAAC ATGGATCAGA CCGCGCGGCG CACAGTTGAA AGCGAAGGCG CACTTTTCGA GAGGAAGATG GCGCTCTCGT TCGCGCAGGG AAGCGGTGCG ATCCAGTGGC TGTGGAACAC CAACACCTAC ATGACCGAAG GCAACGAAGC GCCGATTGGT GCGTTACGCG GAGATGCGAC GGAAAAACCG GAAGCCACAG TGCTGCGCAA CTTCGGAACG TTTGTAGCCA AGGCGCGTGA GGTGCTTCGC AATCCGGTGC AGCCGGATGT GGCGATCGTG ACGTCGCAGG CAATGCAGTT CTCAGTGATC GGCGATGCAC AACTGGAGGC CCAACGGAAA GCGGTGCGTG CGCTGGCGTA TGCAAACCAC GTGGCGCCGT ATGTAATTGC TGAGAACCAG ATTGCGAAGC TGGGCAATCC GAAGCTGGTC GTGCTGCCTT CGCCACAGGC GCTGAACGAG AAGACATGGC AGACGCTGGT GGCGTACGTG AAAAACGGCG GAAGTTTGCT CGTTACAGGT GGCGTGGGAC GTGACGAACA TTGGCACGTT GTCGATCGTT TTAACGCGCT CGGCATCAAG GGGGCGACGG AGCCGCTGAC GTACAAAACT GCCTCCGTGA AGCTTGGCGC TAACGAAGTC CGCATGAGCT TCGATCAGAC CAAGCAGGCG TGGTTGGAGA CGGCGCGCTT TGCGGACGAA AAGAAAATTA GCGAAGCCTC GTTGGGTCGC GGAAAGATCT ACTGGGTTGC ATATCCGGTA GAGCTAGCTG AGGGACTCGA TGCAGCGGCG CAGGTTTACA AGTATGCGTT GGCGCAAGCG GGAGTGCAGC CGCTGTATGG CGTCGAAGGT GTAGTTTCTC CGGGTGTATT GATCTATGCG ACGGTGCTCG AGGATGCCGT AGCCTACCTA TTCGTGTCGG ATGACGCGGC GGATACGAAT ATCGCGGTGC GCGATCGCAC GACCGGAGCA CGCTTGCAGG TGACGTTGCC GTCGCAACGG GCGGCGATTC GGATCATCCG CAAGAAAGAT AAGTCGGTGG TTGCGGAGTA TTCAAACGGC GGCGTTCTAG AGGATGAGTA A
|
Protein sequence | MMKFAFLLLV CSFFLVATNE AGASAMTTVV LEEPGFPTAD AAAPDMARLH ALISDAKFVV ADQLVASLAD RATTLLVLPY GSAFPEAVWP AIDSYLYRGG NLLVIGGRPF TRAAFRGKSG WELREYSVRF VLGLNIDQYQ ETPGSDGMQF EANPDVMVKA SQFRWKRGFS PVIRLSSSDV YKRQGSAGEL DARLDALAWG LRDGRKFAAP AIGIDHVRGK WGGGRWVFVN SEITSSVYAG DLIRDLVEYT ERGAQEFTAR PTLPLYAEGE PVQVELNWSA NSSSASLRAE VSIAPEDKPE QKVARTATLA NGGAVVEFPP VQEKGLYRIE SRLFDGDRTV AAYHSGFWMR DLEYLRSGPK LGVNKDYFEL DGRPLAVVGT TYMASDVQRL FFDHPNVYVW DKELGQISGA GLNMIRSGWW TGWDKLCDET GRPYERTLRT LEAYLMTARK HGLPVQWNFL AFLPEVLGGE NPYLDPVAVR RQKAFYSGVA ARFHDVPFVA WDLINEPSIS QFVWKTRPNQ DWIELQQWNE WLKQKYPDRA ALADAWNMPQ LGDTAPVPTE SEFAPRAMYA GPNSLKIYDF YVFAQEKFAG WAQQMREAIR ATGAQQPIVV GQDEGGYNDR PNPAFFGNAV DFTANHSWWE NDSLLWDSLV AKQPGKAMLI QETGLQRELN MDQTARRTVE SEGALFERKM ALSFAQGSGA IQWLWNTNTY MTEGNEAPIG ALRGDATEKP EATVLRNFGT FVAKAREVLR NPVQPDVAIV TSQAMQFSVI GDAQLEAQRK AVRALAYANH VAPYVIAENQ IAKLGNPKLV VLPSPQALNE KTWQTLVAYV KNGGSLLVTG GVGRDEHWHV VDRFNALGIK GATEPLTYKT ASVKLGANEV RMSFDQTKQA WLETARFADE KKISEASLGR GKIYWVAYPV ELAEGLDAAA QVYKYALAQA GVQPLYGVEG VVSPGVLIYA TVLEDAVAYL FVSDDAADTN IAVRDRTTGA RLQVTLPSQR AAIRIIRKKD KSVVAEYSNG GVLEDE
|
| |