Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4013 |
Symbol | |
ID | 4071149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4741984 |
End bp | 4744356 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637986040 |
Product | hypothetical protein |
Protein accession | YP_593087 |
Protein GI | 94971039 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTCA TGAGGGGGGC ATTTCTGAAC GCAGTCCTGG CGACAGCAGT CACCGCCGGG TTCGCGTTCG CGCAGGACGC GCCACCGCCG CCGGATTCTC AGTATCCGGC GACGAGCAAT TCTGCACAGA ATGATTCCGG ACAGTACGCC ACACCGGATT CCAACACCAT TCCGCAGTAT TCCGCGCCGT CACCAAAGTA TTCGTCACCT TCGGCAGATC AGCCCGAGGA AGGCGGATAT CCGCAATCGG CACAGTCGCA AAATGCGCCT CCGCCACCGG CAGACGCGCA GCAGGCGAAT GCGAACGGTG AGGACAACGA CGACTCGCAG GACCCATCGC GCCGCGTAGC GCGTATGCAG TTCATGGACG GGCAAGTTTC CATCCAGCCC GGCGGCGTGA ACGACTGGGT TGCGGGAACG CTGAACCGCC CGATGACCAC CGGCGATAAC GTTTGGACCG ACCAGAATTC GAAAGCCGAA TTGAACGTCG GTACCGGCAC GTTCCGCATG GGCGCGGAAA CCAGCGTCAC GCTGGCGAAT GTTGCCGATA AGACCACGCA GTTGCAGGTG CACCAGGGCA CGCTGAATTT GCGCGTGCGT CACCTGTACG ACGGCGAAAC CTACGAGATC GACACGCCGA ACATGGCGTT CACCGTGCAG AAGCCCGGCG ATTACCGCTT TGACGTGGAC CCGAACGGCG ACACTTCGTT CGTCACGGTT TGGAAGGGCG AAGGCAACGC CACCGGCGAC GGACCATCCG TAGCGGTGCG TCAGGGTGAA AAGGCGAAGT TCTCGAATGG AACTTCGATG GCGTACACGG TGGATCGCGC GCCCGGACAA GATGAGTTCG ACGAGTGGGC GGTCGCACGC GATCGCCACG ACGAGAATTC CACGTCGGCG AAATATGTAT CGCCTGACGT GATTGGTTCG AGCGATCTCG ACGACTACGG CACCTGGAAG AAGGACGACC AGTACGGAAA CGTCTGGATC CCGAACGACC AGAATGACAA CTGGCAACCC TATAGCGACG GCAATTGGGC CTATCAGCAG CCATACGGCT GGACGTGGAT CGGCGCCGAG CCTTGGGGCT TTGCTCCGTA TCACTATGGC CGTTGGGTGC AGGGCGGCTG GGGCTGGGGC TGGACGCCCG GACCGTACGC TTACTGGGGC GCGCCGTATT ACGCTCCCGC ACTCGTCGGC TGGTATGGCG GTGGTTTCGG TATCGGCATA GGCTTCGGCG GCGGCTGGGG TTGGTGCCCG CTGGGGTGGG GTGAGGCCTA TCATCCTTGG TACCACCACG GACATTCGTA CTTCAATCAC GTGAATGTGA CGAATACGCA CATCACCAAC ATCAACAACA TTCACAACAA CTACGGCAAC CATGGCCAGC CGGCGAATTA TCGCCACGGC TTGGTGGTCG CGAATGGTAA GGCTGTCACG AGCGGCATGA ACATTCGCAA TAACCGGATG AATGTGACCG CGCAGCAGCG GACCGCAATG TTGCAGCACC CGGTCAACAA TCGCAGCCTC GGAAACGAGC TGCGTCCGAC GGCACAGAGC CGCATCGGCG GCCAGACCCG CGCGTCCGTT GCACCTCCGG CGCGTACTGC GAACCGACCG ACGTACTCGC ACCTTGCGCC GCCGGCACGT GGGCAGAATG GCAATGCGGT GAATGTACGC GCCAACGGCG GCATTAACAC GACACGCCCG AGCGCCGGCT TGAACAACGG CCGCAACGGT GTAGTCGCGA ACAACGGCCA TGCACCTGCG CCGATGACGC GCAACGTACC GCATCCGCCG AGCGCTACTC CAGGCTCGTC CAACAACAGC CACTACGTTC CGCGTCCGCC GGCAAGTTCT GGCCGGCAGA TGCAGAACGC ACCGGCAGCG ACCACGAATT CGCGTCCGGG AGGCACCTAC GCAAGGCCGG GACAGAGCTA CTCTGCTCCG CCGAGCCAGT CACACTCGAG CCAGTCACAC AATGTGCCGC GGATGAACGG CCCGGCGCAG CAGTCGTCGC GCAGCTACGC GGCGCCTCCG AGCCGCAGCT ATAACAGCCC GAACTACGGG CGCAGCTACA GCCCAAGTCC CAGCTACGGG CGTTCGTACA GCCCGAGCCA GGGCTACGGC CGTGCGCCGA GCTACAGCGC ACCGCACAAC GGTGCACCCA GTGCCCAGCA GCATTACAGT GCACCGCATT ACAGTGCGCC AAGCGCGCCC CACTACAGCT CACCGAGCTA CGGTGGTGGC CACGCTTCGG CACCGTCGTA CCACGGTGGC GGCAGCTACG GGGGTGGTGG TGGTCACGCC AGCAGCGGTG GCGGCGGTCA CAGCAGCGGT GGTGGTGGCG GCTCGCACGG TGGACATCAC TAA
|
Protein sequence | MRFMRGAFLN AVLATAVTAG FAFAQDAPPP PDSQYPATSN SAQNDSGQYA TPDSNTIPQY SAPSPKYSSP SADQPEEGGY PQSAQSQNAP PPPADAQQAN ANGEDNDDSQ DPSRRVARMQ FMDGQVSIQP GGVNDWVAGT LNRPMTTGDN VWTDQNSKAE LNVGTGTFRM GAETSVTLAN VADKTTQLQV HQGTLNLRVR HLYDGETYEI DTPNMAFTVQ KPGDYRFDVD PNGDTSFVTV WKGEGNATGD GPSVAVRQGE KAKFSNGTSM AYTVDRAPGQ DEFDEWAVAR DRHDENSTSA KYVSPDVIGS SDLDDYGTWK KDDQYGNVWI PNDQNDNWQP YSDGNWAYQQ PYGWTWIGAE PWGFAPYHYG RWVQGGWGWG WTPGPYAYWG APYYAPALVG WYGGGFGIGI GFGGGWGWCP LGWGEAYHPW YHHGHSYFNH VNVTNTHITN INNIHNNYGN HGQPANYRHG LVVANGKAVT SGMNIRNNRM NVTAQQRTAM LQHPVNNRSL GNELRPTAQS RIGGQTRASV APPARTANRP TYSHLAPPAR GQNGNAVNVR ANGGINTTRP SAGLNNGRNG VVANNGHAPA PMTRNVPHPP SATPGSSNNS HYVPRPPASS GRQMQNAPAA TTNSRPGGTY ARPGQSYSAP PSQSHSSQSH NVPRMNGPAQ QSSRSYAAPP SRSYNSPNYG RSYSPSPSYG RSYSPSQGYG RAPSYSAPHN GAPSAQQHYS APHYSAPSAP HYSSPSYGGG HASAPSYHGG GSYGGGGGHA SSGGGGHSSG GGGGSHGGHH
|
| |