Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4012 |
Symbol | |
ID | 4071148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4739525 |
End bp | 4741330 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637986039 |
Product | hypothetical protein |
Protein accession | YP_593086 |
Protein GI | 94971038 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03436] VWFA-related Acidobacterial domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.679014 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.800681 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCACC GATGTCTCGT TTGGTTTAGT TCCCTCCTCG TCATTCCTTT CGCTTATTCC CAAACTGCTG CGCCCAATGC GAGCCCTGCG CCCACGTTCG AATCCAAAGT GCGTGCCGTG CTGGTGGACG TGGTCGTAAT TGGCGGGAAG GGGGAGCCAG TTCTCGGCTT GCACAAGCAG GACTTTCGGG TGACGGAAGA CGGCAAGCCG CAGACCATTT CCTCGTTCGA GGAGCATACG GGCGTACCGC CGACTGAGGT CAAACTGCCC CCGATGCCGC CCGGGGTGTT TACAAACTTT CCTGCGCTGC TGAAGGCCGA TACGGTGAAT GTGCTGCTCG TGGACGCGCT GAACACGCAG ACGCAAGACC AGTCCTTCAG TCGTTCGCAG ATGATCAAGT ACTTGAAGAC GATTCCGCCG GGCGCACACA TCGCGATTTT TACGTTGACG TCGCAACTGC GGATGCTTCA GGAATTCACG ACCGATTCTT CGGTATTGTT GGCGGCGCTG AATGACCCTG CGGTCGCTGG TCCCCATCAG TCGCCGTTAC TCCAGTCCCA AGTGGAAAAG GACGCTTACG AGCGGCTGGG AGCAATGGTC GCCCTCGCAC CGAGCGCACC GATGCAGAAC TTGGCCAAGG AGGCGGTCAA CCCGGCGCTC GCAGTGAAGC AGATGCTGGA GGAAACCGTT GTGCGGATCA CCGAGTCGCG GGTCCAGATC ACGCTGCGAG CAATGCAGCA ACTGGGCCGC TATCTCGGGA GCTTTCCGGG CCGCAAAAAT GTGATCTGGA TCTCCGGCTC GTTTCCCATC AACTTTATGG CGGATCCCAG TTTGCCGGAT CCGAACGCCG TGGTACGGGG ATTTCAAGGC GAAATTCAAA GGACGGCTGA CCTTCTTACC GCGGCGCAGG TGGCAGTTTA TCCGGTCGGG GCGGCAGGCC TGAGAGTGGA CGCGCTCTAC CAAGCCAACG CAAAAGAGAT TGGGTTTTAC AGCACGGGCG GGTTCGTTCA GGACCAGGTG CAAGGGCTGC ATGCGGGAAT TGACGAGCGG GCTGGCAACG ATCTGACGAT GGAGGAGATG GCCAAGGACA CCGGAGGTCA GGCTTTCTAC AACAGCAACG GGATTAACGA TGTTCTGACC CGTATTACGA ACAACGGTAT GCGCTACTAC GAGATCAGCT ATACGTCGAC CAACACGAAG GTGGACGGGA GTTACCGGCA TATCTCCGTG GAGCTGCTCA AAGGAAAGCA CAAGCTCTCT TATCGTCGCG GATACTACGC GCTGGATGCT GCGGCCGTTC GGCAATCGGA ACTTGAGGCC GCACCCGATC CTTTGCTGCC CCTGGTGGGA TTCGCCGTGC CTGATGTTGC GCAGATCCTC TATAAGCTGC GCGTGTTGCC GTCGAGCCCG CAACCGGCAG TTGATGCTAC CCCTGCGGGG AGCAACCGCG ACTTGAAAGG GCCAGTGACA CGCTACGACG TTGACTTTGC CGTCGCGCCG GATGACCTCA AGTACGACAT TGGTCCGGAC GGTACTCGGC ACGGCGACGT CGAAGTGAAA CTTGTCGCAT ATGATTCCAG CGGGAAGCCC GTGAACATGG TGAGTGGGAG GAAGGCGATG TCCCTGGATC CGCAAACGTA CGCTACTTTG CAGAAGGTGG GGCTTCAGAT CCACGAACAG ATCGATGTTC CGAGCAAGGG CGATTTTCAC CTTCGCACAG GCATTTACGA TTTGAAGTCG AGCAACGCCG GAACCCTCGG AATCAAAATG AAAGATGTGG CCGCGTCGCA GCAAGCGACG AAGTAG
|
Protein sequence | MSHRCLVWFS SLLVIPFAYS QTAAPNASPA PTFESKVRAV LVDVVVIGGK GEPVLGLHKQ DFRVTEDGKP QTISSFEEHT GVPPTEVKLP PMPPGVFTNF PALLKADTVN VLLVDALNTQ TQDQSFSRSQ MIKYLKTIPP GAHIAIFTLT SQLRMLQEFT TDSSVLLAAL NDPAVAGPHQ SPLLQSQVEK DAYERLGAMV ALAPSAPMQN LAKEAVNPAL AVKQMLEETV VRITESRVQI TLRAMQQLGR YLGSFPGRKN VIWISGSFPI NFMADPSLPD PNAVVRGFQG EIQRTADLLT AAQVAVYPVG AAGLRVDALY QANAKEIGFY STGGFVQDQV QGLHAGIDER AGNDLTMEEM AKDTGGQAFY NSNGINDVLT RITNNGMRYY EISYTSTNTK VDGSYRHISV ELLKGKHKLS YRRGYYALDA AAVRQSELEA APDPLLPLVG FAVPDVAQIL YKLRVLPSSP QPAVDATPAG SNRDLKGPVT RYDVDFAVAP DDLKYDIGPD GTRHGDVEVK LVAYDSSGKP VNMVSGRKAM SLDPQTYATL QKVGLQIHEQ IDVPSKGDFH LRTGIYDLKS SNAGTLGIKM KDVAASQQAT K
|
| |