Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3831 |
Symbol | |
ID | 4071115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4529575 |
End bp | 4533000 |
Gene Length | 3426 bp |
Protein Length | 1141 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985854 |
Product | hypothetical protein |
Protein accession | YP_592905 |
Protein GI | 94970857 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0885568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.451604 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTCT TCGGGCAACG TATGGTGCTG CATATCGCTT GTGTGTCGTT GGTACTCATC GGGCTCGTGA ACACAGTTCC GGCCCAAACC GCTTCCACAG GCGCCATCGC CGGTACGGTG ACGGATCCGG CTGGAGCCGT CATCCCGAAC GCCACGGTCA CGGCGACCGA CGCCCGCACG GGCGAAACTC GCACCACTAC AACCTCCAAC ACCGGCGCCT ACGTGGTTTC CCTCCTTAAT CCCGGCACGT ATGTGTTGGC GGTCACTAAG ACCGGCTTCA AGCGCGCGGA GCGGCCCGAC ATCACCGTCC ATATCACGGA GACCGTCGCT GACAACGTCC AGATGGCAGT TGGGTCGCAG AATGAAACCG TTTCCGTGAA CGATATGGGC GAACTGCTGA AGACCGAAGA TAGCTCTCTT GGTAACGTCG TGGATCAGCG TCAAGTCGCG AACCTGCCCT TGGTCACGCG CAACTATCAG CAAATCCTCG GCCTATCACC GGGCGTCTCG GCTGAGATTT TCAATGCCGG CGAGATCGGT CGCGGCGGCG TGGATGGTGC TCTCGTGACC GGCGGCGCCA GTTATTCCGA CAACAATTTT CAGATGAATG GCGTCAACGT CAACGACCTC CAGGGGAGCG GTCACTTCAG CGGTGGCGTG TCCGCTCCGA ACCCTGACAC CATCGAGGAA TTCAAGGTCC AGACTGGCCA ATACGACGCT TCGTTCGGAC GGAACGCGGG CGCCAACGTG AACGTGCTGA CGAAGTCCGG CACCAATCGC TGGCATGGCA GTGGTTGGGA GTTCTTCCGC AACGAGGCCA TGAATGCCAA TGATTACTTC CGCAAACAGA CCGACCAGCC TCGTGCCGAG CTGCGGCAAA ACCAGTTCGG TTTCACGTTT GGCGGCCCTA TCGTCAGGGA CAAACTTCTT TTCTTCACGT CCTATCAGGG AACGCGCCAG AACAACGGCA TCGATCCGAG CTGTTCGAGC AGCGTAACGC TGCCTGTGTT GACCGATGAT CGCTCCAATG CAGGGTTGGC AGCAGCCGTT GGAGCGACGA CAGCGTTCGG CGGTATGGAC CCGTATACCG GAAATCCAGT AACTGCGGCG AACATCAGTC CGCAGGCCGC GGCGCTCTTC AATGCGAAGC TTTCGAATGG GCAATACCTG ATTCCCAACC CGCAGGTCAT CAAAACCGAT CCTGCGACTG GCTTGCCCGA AGGCTTTTCC ACGTATAGCG TGGCGTGTCC CTATCACGAA GACCAGTTCA TGGTGAACCT CGACTGGCTG CAGAACTCCA AGAGCACGTT CCAGGAACGC TTCTTCTACG CGGACAGTGA AGCGACATCC ACGTTGCCGC AAACCCAGAC AGTTGGCGAT CAAGTTCCCG GTTCTCCCTC GAAGAACCCG CAGAACTTCC GCGATTTCTC GCTCAGCCAT ACCTATGTGT TCACCTCGGC ACTGGTGAAC CAGGCACAAA TTGGATTCAC CCGCAACCTG GCCGGCACCA ACCAGTCGTT CCCGCTGAAA TATTCCGACA TTGGTGTGAC TGCACCCGGA TTTGACGATG CACGTGCAAA CATCTCGGTG CTCGGCGGCT TCGATGAAGG CGGCAACGGC CAGACGACCG TCATCGCTCA GAACAACTAC ATCTTCCAGG ACACGCTCTC CTGGTTCCAC GGACGTCACT CGTTCCGCTT CGGCGGGAAC ATCACGCGTT CACAGGACAA TATCTCCGAG TTCGCGTTTG CCGGCTATAC GATCTTCCTC GACTATCCGG GCTTGATGAT TGGCGACGGT CCCTTCAATC CTTACCAGTC TGTCGACCTC GCGGGCATCA CCCAGCGCGG CTACCGCGTG TGGGACGGGT CGCTCTACGC GCAGGACGAT TTCAAAGTTA CCCAGAGACT CACCCTCAAT CTGGGCTTCC GCTATGAGCG ACTCGGTGAT GTCGGAGAGA ACGCGGGCAG AAATGCCAAC GTGAATCCTT CGCTGGTGAA TCCGAATCCA GGAGCCGCTG GAAGTCTCGA AGGCATCATT GTTGCCAGCA ACTTTTCGGG GCAGATTCCC GACGGAGTCA CTCGCGCCAG CAACGATCTC GCGATCAACG GTGACGGGCA GAACACGTGG AACCCACGCA TCGGTTTCGC ATGGATGTTG CCCGGCTCAG ATCGCTTCGT TTTACGCGGT GGTTACGGCC TTTACCGCCA GCGAATTACC GGCCAACCCT ACTTCCAGCT CGAGACCAAC CAGCCGTGGG GACAGTATCG AGCTGCCGTA GGGACTGCAG GCTTCGCCAA TCCGTTCGGC CCCGATCCCG GAGCATTCCC GCAGTTCTTC CCGTACTCAG CTCCCGTGGA ATACCTTCCC GGACAATTTG CTGCCACTAC CACGCTCTCT CCGTTTGCCT TGGCGCAGAA CCTCCGCCCG CCGCTGTTCC AGCAATACGG ACTGAATTTG CAGGCGCAGA TCACCAAGTC AACGGTGGTA CAGGTGGGCT ACGCCGGCTC GCACGGCACG CACATGCTCC TCTACAACAA CCTGAACCAG GCATCGGCGG CGAGTGCCGA CAATCCGGTG CGCGGTCAGA CCGACACTAC CTTAGGCAAC TTCTACGCCC GGATTCCCTA TGAAGGCTTC GGTGCCCTCT ACTACGACCA GAGCACCGGC TACTCGTGGT ACAACGCGCT GCAGGTCAGC GTGGAACATC GATTGAGCCA CGGATTGCAG TTCCTGGCCT CATATACCTA TGCCAAAGAC CTCACCAGCG TGTGGGGCGC CACGACCGGC GCGAACGGCG GAACACAGGT TGGCGATAAC TTCAACCCGA ACCGCGACCA CGGTCCGGAC ATCTTTATTC GTCCCCACCG TTTTGTGCTC TCGTACGTTT ACGAAATTCC CGGGTTCCAC GACCATGGCT GGGCGAGCGC GCTGCTGTCA GACTGGAAAG TTGCCGGCGT GACGACGCTT CAATCCGGAC ATCTCCTGCC GGCGCTCGAC GTGAATCCAA CGAACGTCTA CACCCAGGGT TATAACTACG ACTTCGCGAC CATGACACCC GGGTGTTCGC TGAGCAAAGG CGGTTCTGTT ACTGGTCGCC TGAACGGATG GATCGACACA ACCTGCTTCA CTTCCGCTCC TCCCGCATCG GCGGATGGCG GCACGGGTTT CGGAAACACT TCGCTGGGAC TGTTCAAGGG CCCGGCGCAA GCGAGCTCGG ACCTCTCGTT GATCAAGGTC TTCCCAGTAC GTCGGTTGAG TGAAGCTGCC AATTTCGAGT TCCGCGCGGA AGCCTTCAAC GTTTTCAACC AGGTCAATTT CGCCGATCCC GATAACGTCT TCACCGATGG TCCAAGTTTT GGAACCATCA CGAAGACGCT GTCCAACCCG CGCATTCTGC AGTTGGCGCT GAAGTTCTCT TTCTAA
|
Protein sequence | MTLFGQRMVL HIACVSLVLI GLVNTVPAQT ASTGAIAGTV TDPAGAVIPN ATVTATDART GETRTTTTSN TGAYVVSLLN PGTYVLAVTK TGFKRAERPD ITVHITETVA DNVQMAVGSQ NETVSVNDMG ELLKTEDSSL GNVVDQRQVA NLPLVTRNYQ QILGLSPGVS AEIFNAGEIG RGGVDGALVT GGASYSDNNF QMNGVNVNDL QGSGHFSGGV SAPNPDTIEE FKVQTGQYDA SFGRNAGANV NVLTKSGTNR WHGSGWEFFR NEAMNANDYF RKQTDQPRAE LRQNQFGFTF GGPIVRDKLL FFTSYQGTRQ NNGIDPSCSS SVTLPVLTDD RSNAGLAAAV GATTAFGGMD PYTGNPVTAA NISPQAAALF NAKLSNGQYL IPNPQVIKTD PATGLPEGFS TYSVACPYHE DQFMVNLDWL QNSKSTFQER FFYADSEATS TLPQTQTVGD QVPGSPSKNP QNFRDFSLSH TYVFTSALVN QAQIGFTRNL AGTNQSFPLK YSDIGVTAPG FDDARANISV LGGFDEGGNG QTTVIAQNNY IFQDTLSWFH GRHSFRFGGN ITRSQDNISE FAFAGYTIFL DYPGLMIGDG PFNPYQSVDL AGITQRGYRV WDGSLYAQDD FKVTQRLTLN LGFRYERLGD VGENAGRNAN VNPSLVNPNP GAAGSLEGII VASNFSGQIP DGVTRASNDL AINGDGQNTW NPRIGFAWML PGSDRFVLRG GYGLYRQRIT GQPYFQLETN QPWGQYRAAV GTAGFANPFG PDPGAFPQFF PYSAPVEYLP GQFAATTTLS PFALAQNLRP PLFQQYGLNL QAQITKSTVV QVGYAGSHGT HMLLYNNLNQ ASAASADNPV RGQTDTTLGN FYARIPYEGF GALYYDQSTG YSWYNALQVS VEHRLSHGLQ FLASYTYAKD LTSVWGATTG ANGGTQVGDN FNPNRDHGPD IFIRPHRFVL SYVYEIPGFH DHGWASALLS DWKVAGVTTL QSGHLLPALD VNPTNVYTQG YNYDFATMTP GCSLSKGGSV TGRLNGWIDT TCFTSAPPAS ADGGTGFGNT SLGLFKGPAQ ASSDLSLIKV FPVRRLSEAA NFEFRAEAFN VFNQVNFADP DNVFTDGPSF GTITKTLSNP RILQLALKFS F
|
| |