Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4346 |
Symbol | |
ID | 4071764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5154984 |
End bp | 5156651 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637986379 |
Product | hypothetical protein |
Protein accession | YP_593420 |
Protein GI | 94971372 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.372687 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCAG ACTGCAAGTT TTATGTGTTT CGTGAAGGCC GCAGGACTGT TCCGGGCGAG CAGTTACTCA CCGGGTTGCG CGGGAGTTTG TTCCGGGCGA AAGACGAAGA CTCGTGGACC GATGCGATGC TCCGGTGCGG CGAGCTCGAA TGCGCCCTGG AAGATGCGGG CGATCCTCAG GCGCGGCATG TGGCGGCGGT GAGCAACGCG TGCGCCGATA AGCTCGTGCA CCGAGATTAT TTCGGAGGCC GCGAACTTGG CGGCTGGTTG CCGGTGCGGC TGGAAGGCGA GGTCACGGTC GCGACGCCGG AGGGGTTCGC GTATTACGCG CTGCATCCGC GACAGTATGC GGATGTGGCG GCGAAGTTCG GCGTCTTTCG GCATGGCGCG AAGACGCACG AGGCCGCGCC CGAAGTGGTG GTGATTGGGA TACGCAGCAT CGGGACGACG CTGAGCGCGA TGACGACGGC GGCGTTGCGA TTGCGGGGGC TGCACGTGGA GCGATTCACG GTGCGGCCAG CGGGGCATCC GTTCGACCGT AAAGTGAAGT TCAATCCGGC ACAATCGCAC ACCATCCGGA CGGCGCGATT GCAGGATGCA TTGTTCGTGA TCGTGGATGA AGGACCGGGG TTGAGCGGGT CATCGTTTTT GTCGGTGGCG GAGGCGCTAG TGGCGGAGCG TGTGCCGTCA GGACAGATTG TGCTGGTGCC GAATCATGCG CCGCATTTGC CGTGGTTGCG GGCCAACAAT GCGGCGGTGC GATGGCAGCG CTATCGCACG GTGACGCCGG CCGCGGGACG GTCTCCGGAG GGGGAGTGGA TAGGAGGAGG CGAGTGGCGG AAAAAAACGT TCGCGAATGA GAGTGAGTGG CCGGGCGTGT GGACGAGCAT GGAGCGTGCG AAGTTCGCGG ATGATCGCGT GCTGTGGAAG TTTGAAGGCA TCGGGCCGTA TGGTGCGCGT GCACGCAGTA CTGCCCGTGC GCTGGCGGAT GCTGGCTTCG CGCCGAACGC GGTGCGGGAT GAACATGGCT ATGTGGGTTA TGACCTCATA CGCGGCCGCG CGGCGAAGGC ACAAGACCTG AGTGATGAGA GATTGAAACG GATCGCGGAG TACTGCGCGT TTCGAAGCGA AGAGTGCAAG ACCGAAGTGA CCGAGGCGCA GCAGAAAGAT CTTGCGACGA TGCTGCGGGT GAACTACGAG CGCGGCTTCG ACCGGAAGCT CGCGCCGCAA TTCAGGAATC TGCCGGTGGA GCGGCCGACG GTTTGCGACG GCAAAATGTC GCCGCACGAG TGGCTGCTGA CGGAAGATGG ACGCATGCTG AAGCTGGATG CGACCTCGCA CGGCGACGAT CATTTCTTCC CCGGCCCGTG CGACGTGGCG TGGGACCTGG CGGGCGCGAT CGTGGAGTGG GGGATGGACC GTGCGACGGG CGAGCAATTC CTGCGCCAGT ACACGGCGCT GACCGGCGAC AACGTGACCG GGCGGATGAG GAATTACCTG CTGGCGTATG CGATGTTTCG CATGGCATGG ACGCACATGG CCGCTGCGGC GATGAAGGGC ACGGCCGAGG CGACACGGCT GATGCGAGAT TCAGATCACT ATCGCGAGTA CGTGTCGGGG CTGGTTATGG GAGCCGCAAA AGCGGTTCCA GTGGCGCGTG CTTCGTAA
|
Protein sequence | MNSDCKFYVF REGRRTVPGE QLLTGLRGSL FRAKDEDSWT DAMLRCGELE CALEDAGDPQ ARHVAAVSNA CADKLVHRDY FGGRELGGWL PVRLEGEVTV ATPEGFAYYA LHPRQYADVA AKFGVFRHGA KTHEAAPEVV VIGIRSIGTT LSAMTTAALR LRGLHVERFT VRPAGHPFDR KVKFNPAQSH TIRTARLQDA LFVIVDEGPG LSGSSFLSVA EALVAERVPS GQIVLVPNHA PHLPWLRANN AAVRWQRYRT VTPAAGRSPE GEWIGGGEWR KKTFANESEW PGVWTSMERA KFADDRVLWK FEGIGPYGAR ARSTARALAD AGFAPNAVRD EHGYVGYDLI RGRAAKAQDL SDERLKRIAE YCAFRSEECK TEVTEAQQKD LATMLRVNYE RGFDRKLAPQ FRNLPVERPT VCDGKMSPHE WLLTEDGRML KLDATSHGDD HFFPGPCDVA WDLAGAIVEW GMDRATGEQF LRQYTALTGD NVTGRMRNYL LAYAMFRMAW THMAAAAMKG TAEATRLMRD SDHYREYVSG LVMGAAKAVP VARAS
|
| |