Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2741 |
Symbol | |
ID | 4069432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3244127 |
End bp | 3246541 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984758 |
Product | hypothetical protein |
Protein accession | YP_591816 |
Protein GI | 94969768 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.732341 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGTC GCCTGACATG CCTTTGCATG TTCTTGTGGG TTTGCGTCTT CGGGTTTGCA CAGCAAGCGT CCGCACCGCT TTCCGCCGCG CCACAAGTGA TTGTCCCTCG TCTGATCCGA TTCTCCGGTC ACTTGAAAGG CGTATCCGGA ACTGTGGGCG TTACCTTCAC GCTGCACAAG TCGGAAGAAG ATGATGTCGC GCTATGGACG GAAACCCAGA ACGTTCAACT CGACCGCACG AATAAATATG ACGTACTGCT CGGTGCGACC AAAGCGGAAG GTATTCCGAT GGAGCTGTTT ATCTCCGGCG AGGCGCAATG GCTGGGAATT CGCGCGGAAG GCCAAGCCGA GCAGACGCGG GTGTACCTGG TCAGCGTGCC GTATGCGCTA CGGGCAGCCG AGGCAGATTC GCTTGCCGGA CATCCTCCGA GCGAATTTGT GACCAATGAG AAGTTGGCGT CGGTAGTGAA GCAGGAAGTG CAGGAGCAAG CGTCATCGAC GGGTGCGGAG CGCACGGCGA CGGGTGCGAT CGCAAACGCA CTCGCGGGCA CGCCGACGAA TTTCAGCGGC AGCACGACGG ACCAGGTAGT AGGCGTCACG CAGAGCGGTA CAGGGAGCGG CGTTTCGTCG AATGCGACCA CGGGATACGC GCTCTACGGA AGAAGCGCGG GCACGGCAGT GTATGGGAAC AGTACCGCAA CCGCAACGGC CGGCTACGGA GTGTACGGCA CGTCGAGTTC TCCGCTGGGC TATGGAATCT TCGGATCGAA CAGCGCGACG ACGGGAACAG CGGTGGGCAT ACGCGGGACG TCTACATCGA ATGCCGGCAT CGCAGTTTAT GGCACTGCAA ACACGGCGAC AGGGACGGCA ACCGGCGTAA AGGGCATCAC GCAATCGCCC GATGGTTACG GCGTGTTTGG ACAGAACACA TCCCCGACAG GAACGGGAGT GGGTGTTCGC GGGACATCCG CTTCGACGAC CGGCATTGCG CTGTACGGCA CGAACACGGC GAGCAGCGGC GTCACGAAGG GCGTGTTCGC GTCGGTCTCG AGCGCGAGTG GAACGGCGGC GGTATTTCAG AACACCGCGA GCGGCAAACT GTTCAGCGGC GTAGTAGGCG CAGGGACCGA GGTATTCAGT GTGAACGGCG CCGGAGACAT CTCGGGAGCG GGCAGCCTGA ACGTTGGCGG CACCGCGACG TTTGGCGGAC TGGTGGGCTT CGCGAGCGGA CAGCAATTCC CCGGCACTGG AGCGCTCAAC ATTGCCAACA CCTTTACGAT GCCGCAGACC ATCACCGGCA GCACCCAGAC CATGCTCCAG GTTACGGGAT CCGCCTCTTC AGCCTCAGTC ATTTACGGGC ACTCAACGGA CTTAGGCAGT TTCTCGTCCT CGGGCGTACA GGGCGCCTCT GACGGACCGG TGGGACTCGG AGTGTACGGG GTCAGCTACG GCATCAGCGG AATCGGCGTG GAAGGAAACG GAAAGATGAA CGGTGTGCTC GGGTTCTCCA CAACAACCAG CAAGCTATGG AACACCTACG CCGCTGTAGG TGTTCATGGC GACACCGGGG CCACCAACGG CGTTGGAGTA TTAGGCACGG TAGACGGGGG ATACGGGGTA AAAGGCATTA ACAATGGAAC GCTGGCAAAT ACGGCGGGAA CCTATGGCGC CGCTGGCCCG GCTTCGGGGT TCGGCGGCAT AGCGGGCGTT TGGGGAGACT CAGCGAACCA TGTCGGCACC ATGGGCTCAA GTGTGAACTT CGCGGGCGTG TACGGCGTGA GTAGCCTGAA TAGCGGAGTG CAGGGAGTGA ACAACAGCCA GGGCTACGGG GTGCTTGGCA CGGCGGCGAA TGTCGCTCTC GACTATGGCA CTGGCGTGCA AGGGGAAAGC TTCGGGAAAG TAGTGTTTCC CAGCGGCTAC GGGTCGGACG GTGTACGCGG AATCACGCAC ACTACCAGCG GCGCAGGTGT CTCCGGCATC AATGATGCAG CGGGTGGGGT CGGTGTCTAC GGCTTGTCGA CCAATGGCGG CTTCGGCTTC GCAACTCCCA GCAACGTGCA ACAGAACCGC TCCATGGGCG GCTGGGTCAA AGCGATGGTG TACGTGGATC CCACGCTTGC AAGCGGCCAG ATCGTCCGGT GCTTCAACTC GCAGCTCACG GGAGCGGCGA GTTCCACTCC GCCATGTGGT TTTACCTATT CCACCTACGG TACGCCAGGT TGTCCTATCG TTGACTTTGG GTTCAAGGTC GACGACCGCT TTGTTTCGGT CGTGGTGCAG TCTGATGGCG AACACACTGC TGCGATATCG CTGTTCCCAG TGTGCCACCC CAAGACGGCG AACAACATTT GCCTTTCGAC CGTTGACACA GCTGGATTTG GCTTAGACGA ACCGTTTTAC TTGGTCGTTT ATTGA
|
Protein sequence | MSSRLTCLCM FLWVCVFGFA QQASAPLSAA PQVIVPRLIR FSGHLKGVSG TVGVTFTLHK SEEDDVALWT ETQNVQLDRT NKYDVLLGAT KAEGIPMELF ISGEAQWLGI RAEGQAEQTR VYLVSVPYAL RAAEADSLAG HPPSEFVTNE KLASVVKQEV QEQASSTGAE RTATGAIANA LAGTPTNFSG STTDQVVGVT QSGTGSGVSS NATTGYALYG RSAGTAVYGN STATATAGYG VYGTSSSPLG YGIFGSNSAT TGTAVGIRGT STSNAGIAVY GTANTATGTA TGVKGITQSP DGYGVFGQNT SPTGTGVGVR GTSASTTGIA LYGTNTASSG VTKGVFASVS SASGTAAVFQ NTASGKLFSG VVGAGTEVFS VNGAGDISGA GSLNVGGTAT FGGLVGFASG QQFPGTGALN IANTFTMPQT ITGSTQTMLQ VTGSASSASV IYGHSTDLGS FSSSGVQGAS DGPVGLGVYG VSYGISGIGV EGNGKMNGVL GFSTTTSKLW NTYAAVGVHG DTGATNGVGV LGTVDGGYGV KGINNGTLAN TAGTYGAAGP ASGFGGIAGV WGDSANHVGT MGSSVNFAGV YGVSSLNSGV QGVNNSQGYG VLGTAANVAL DYGTGVQGES FGKVVFPSGY GSDGVRGITH TTSGAGVSGI NDAAGGVGVY GLSTNGGFGF ATPSNVQQNR SMGGWVKAMV YVDPTLASGQ IVRCFNSQLT GAASSTPPCG FTYSTYGTPG CPIVDFGFKV DDRFVSVVVQ SDGEHTAAIS LFPVCHPKTA NNICLSTVDT AGFGLDEPFY LVVY
|
| |