Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3218 |
Symbol | |
ID | 4070430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3807170 |
End bp | 3810397 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985239 |
Product | hypothetical protein |
Protein accession | YP_592293 |
Protein GI | 94970245 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.158023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.188502 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCGA AGCTCCTGCT GCTGGTTGTA TTGCTCGCGT CCAGTTTCAC CCTCCTAGCT CAAACATTCA GAGGCGGCAT CGAAGGCACA GTCACAGATG CCTCTGGTGC AGCCATCCCC GGCGCACAAG TCACCGCCAA CGATCCCGCA ACCGGCACCT CGCGTAGCGC GACGACCGAT GGATCCGGCA ACTACGTGTT TACGGAAATG CCGCTTGGCG CTTATGACGT TACAGTCGAG CACGATGGAT TCCGCAAGCA GGTAATTCGC GGGGTGAAAG TGGAAGTCGG CGCTCCGAAC CGCGCCAACG CCACGCTCAC TCCGGGCCAG GTGAAGGAAA CGATTGACGT CTCGGCAGAG ATTCCGGTAA TCGAGACGCA GGCCGACACC ACGGGCGACA CGATTTCCGG CGACCAGGCG AAGGACCTGC CGGTCAACGG GCGTGATTTC ACCAAGCTCT TCCAGTTAGT CCCAGGCGCA GCGGGTGACC CGAGCGGTAT CAATGACTCG CCTGGTTCAT TCGGCATGGT CAGCATCAAC GGCAACCGTG GACGCTCCAA CAACTACCTG CTCGACGGTA CCGACATGAA TGACGGTTAC CGCAATCTGC CGGCCATCAA CGAGGGCGGC GTGTTCGGTA CACCTGCAAC GATTCTTCCC ATAGACGCCC TTGCCGAAGT TCCGGTGATT TCGAATACGG AAGCCGAATA CGGACGCAAC TCCGGCGCGG TGGTGAACAG CGTCACGCGC TCTGGGACGA ACGCGCTGCA CGGCAGCGTG TACGAGTACT TCCGCAATAA CGGACTGGAT GCGCGCAACT ACTTCAACAG CTCCGGCCCG CAGGACGCGT TCCACAACAA TCAGTTTGGC GGATCGCTGG GCGGCCCGAT CATCAAGGAC CGTACGTTCT TCTTCTTCTC TTATGAAGGA CAGCGTGAAA GCGGCGGCAT TCCAACGCCC GAGACAGTGC CGACGCTGGA CCAAATCGGC GCCTACACTG CCGGTGGCGG CGTGGTAAAT CCCGTGATCG CGAGCCTGCT CGCTCGCAAT CCGTGGGGAA CCTTGCCGCA ATCAGACGGC AACGTGCTCT TGACGAATCC GTTCACGAAC ACAGTCGATA GCCTGATCGC AAAGATCGAT CACCACTTCC TGGGCGCCGA TAAGCACGAT CTCATCACGG GCCGTTATTA CTACGGCAAC AGCAGCCAGA GCTTCCCGTT GGCGCTGGTC GGCGGCGGCG TAACTCCAGG TTTCAACACC ACGACGCCAA CGCGGGTGCA GATTGTTTCG CTGTCGTACA CGCACATCTT TTCGCCGAAG TTCCTGATGG AGTTCCGCGG CGGCTGGAAC CGTTTTGCGG AGCAGTTCTT CGCGCAGGAC AAGAGCTTCG ATCCGGCGTC GATCGGGTTG TACTCGGCAT CGCCGAGCGC TACGGCGAGG GACGGGGGAT TGCCGCTGAT GACGTTCGGC GATGGCACTG GCAGCATCGG CGCGAACCTT TCTGTGCCGC GCGGACGCGT CGATACAAAC ACGCAGTTCT TTACCAATGC GTCGTATAGC ACCGGCAAGC ACAATTTCAA ATGGGGCTAT GAATTTCGCC GCACGTTCGT GAATGGCTAC TTCGACGCGG CCTATCGCGG CCGAATCCAC TTCAACTCGT TCGACGATTT TCTCGCGGGC ACGCCTGCGG ATTCAGGAAA CCACTCGGCC ACGGGGTACT CCGCGCGCCA CACCTTCGAG AACAACCACG CATTCTATTT CCAGGACAAC TGGCGGCTGA CCAACCGGCT GACGGTCAAC TACGGATTGC GCTGGGATTA TTTCGGCGTG ATCGGTGAGC AAAACAACCT GTTCAGCTTC CTCGACGTCC CGACCGGAAA CCTGAAACAG GTGGGAGCAA ATGGCGGCCC GAGCACGCTC TACCCGAAGG ACTTCAACAA CTTTGGGCCC CGCCTGAGCC TCGCCTATGA CGTCTTCGGT ACCGGCCATA CGGTGGTCCG CGCCGGCTAT GGAATGTTCT ACGACGCATT CTCGCAGGAC TTTTTCGTAG GACAGTTGCC GTGGAACACC TTCAATCCCG GTCCCGCATA CAACGCGGTT CCCGGCGCCG AGATTGACTT TACCGGCAGC GTGAATCCGA TCGATCCGAA TCCTGCAAAC CACACGCCGA TATTCACCGG CTACGGTGCC ACGGATGTGT TCAGCGTGGA CCAGCACCTG CGGACGCCGT ACATCCAGAG CTACAACGTC AACGTTGAAC AAGAGATCCG CAACGGTGTG GCGGTGAGCC TGAGCTACGT TGGATCGCAG GGCCGCAAAC TGTTCCGCTA CATCGATCTT AACCAGGTCA ATCCGGCCGA TGGCTCGATC GCGTATCCGC AGTATTACTA CGTGAACCAG TTCCAGTCAT CGGCGGCTTC GGGTTACAAC GCGCTCCAGG CGCAGTTCAA AATCTCGAGC TGGCACGGAC TGACCTCGAC GATGAACTTC ACGTGGGGCC ACTCGATCGA CAATGCCAGC GACGGTCAGG ACTATGTGAC CAACGCTACG CAGCCGGACA ACAGCTTCAA TCCTGGCGCC GAGAGAGCTA ACTCTAACTT CGACTTGCGT AAGGCGTTCA AGTGGTATTA CACGTACGAA CTGCCGAAGT TCGAGACAGC GAAGTGGATC ACGAACGGGT GGGCGCTCAA CGGTGTACTG TCGCTCGCTG ATGGGCAGCC GTTCAACGTG ACCTGGCTCG ACAACTTCAA TTACGACATC AACGGAACGG GCGAGTACTT CGGCCGCCCG GACTTGGTTG GAGATCCTTG GGCAGGCACG CATGGACCGG CCAATTTCCT TAACCTCTCG GCGTTCGCAG CGCCTTGCAA CTGGGACAAC GTGAACGGTG GCTGTATCGA CGGCCAGCAC ATTGGGAGCT TGAGCCGCAA CGCGTTCCGC GGTCCGGCGT ACAAGAATTT CGACTTCTCA GTGTCGAAGA CGTTTGCCTT CACGGAACGA GTAAACGCTC GCTTCGGCGC GGACTTCTTC AACATCTTCA ACCATCCGAA CTTCTCCAAC CCGGTGCTTC CGAATTACGT GGTGGACGCG GCTTACAACG GAGACGCGAG CGGCGTGGGA CATGGATTCC TGCCGATCAC GGCGACTCCT GATGTAGGCG GTGGCAATCC GTTCCTCGGT GGCGGCGGCC CGCGCGACAT CCAGTTGTCG CTCAAAGTCA CGTTCTAA
|
Protein sequence | MRAKLLLLVV LLASSFTLLA QTFRGGIEGT VTDASGAAIP GAQVTANDPA TGTSRSATTD GSGNYVFTEM PLGAYDVTVE HDGFRKQVIR GVKVEVGAPN RANATLTPGQ VKETIDVSAE IPVIETQADT TGDTISGDQA KDLPVNGRDF TKLFQLVPGA AGDPSGINDS PGSFGMVSIN GNRGRSNNYL LDGTDMNDGY RNLPAINEGG VFGTPATILP IDALAEVPVI SNTEAEYGRN SGAVVNSVTR SGTNALHGSV YEYFRNNGLD ARNYFNSSGP QDAFHNNQFG GSLGGPIIKD RTFFFFSYEG QRESGGIPTP ETVPTLDQIG AYTAGGGVVN PVIASLLARN PWGTLPQSDG NVLLTNPFTN TVDSLIAKID HHFLGADKHD LITGRYYYGN SSQSFPLALV GGGVTPGFNT TTPTRVQIVS LSYTHIFSPK FLMEFRGGWN RFAEQFFAQD KSFDPASIGL YSASPSATAR DGGLPLMTFG DGTGSIGANL SVPRGRVDTN TQFFTNASYS TGKHNFKWGY EFRRTFVNGY FDAAYRGRIH FNSFDDFLAG TPADSGNHSA TGYSARHTFE NNHAFYFQDN WRLTNRLTVN YGLRWDYFGV IGEQNNLFSF LDVPTGNLKQ VGANGGPSTL YPKDFNNFGP RLSLAYDVFG TGHTVVRAGY GMFYDAFSQD FFVGQLPWNT FNPGPAYNAV PGAEIDFTGS VNPIDPNPAN HTPIFTGYGA TDVFSVDQHL RTPYIQSYNV NVEQEIRNGV AVSLSYVGSQ GRKLFRYIDL NQVNPADGSI AYPQYYYVNQ FQSSAASGYN ALQAQFKISS WHGLTSTMNF TWGHSIDNAS DGQDYVTNAT QPDNSFNPGA ERANSNFDLR KAFKWYYTYE LPKFETAKWI TNGWALNGVL SLADGQPFNV TWLDNFNYDI NGTGEYFGRP DLVGDPWAGT HGPANFLNLS AFAAPCNWDN VNGGCIDGQH IGSLSRNAFR GPAYKNFDFS VSKTFAFTER VNARFGADFF NIFNHPNFSN PVLPNYVVDA AYNGDASGVG HGFLPITATP DVGGGNPFLG GGGPRDIQLS LKVTF
|
| |