Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1354 |
Symbol | |
ID | 4070892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1642696 |
End bp | 1644639 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983363 |
Product | hypothetical protein |
Protein accession | YP_590430 |
Protein GI | 94968382 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCG TGTGCGCAGT CGTCGTACTG CTTGCCATTA CCTTGGTTGG TTTTTCGCAA ACGCAGCCTG CGACGGAAGT GGTTGTACCG CATCTCATTC GTTTTGCCGG TACCGTAAAA GGCGCTACCG GCCGGGTTGC CATCACCTTC AGCTTGCATA AGAGCGATCG CGACGATGCC GCACTGTGGA CTGAAACACA GAACGTGCAG CTTGAAGACG GCAAGTACAC CATCCTGCTT GGCGCAACTA AAGCCGAGGG ACTACCCTTC GACCTGTTTA CTTCCGGTGA AGCGCAATGG CTCTCGATTC GCGTGGAAGG CCGTCCTGAG CAGCGCGTAC TGCTGGTCAG CGTGCCTTAT GCGTTGAAAG CCGCCGAGGC CGAGACCCTC GCTGGACATA GCGCCAGCGA GTTCGTGACG ACCGAAAAAG TCACCAACCT GGTGCAACAG CAGTTGCAGC AGCAACAAAC TGCACCCACG AAATCCGCAA CAACGAAGAA AGACGCGGGC GCAAAAGGCA ACGTCTTAAC GAGTACTGCC ACGAACTTCA CTGACAACAC CGCAAACCAG GTTGTCCTCG TCACACAGAA AGGCGCCGGC AATGGCCTGG TGTCGAATTC GATTTCTGCC AACGGCGTTT CCGGAACGAC CGCGTCGAGT GCGGGGGTTG GTGTCTCTGG CGCGAACACT GCGGCAACGG GCCTCGCGAT TGGTGTGCGT GGATCAACGG TTGCTGATAG CGGCATCTCC GTATACGGCA CAGCGGGCGG AACTAGCGGC ACGGCGACTG GCGTGAAAGG AATTTCGGCG GCGCCGAATG GCTACGGCGT CTTCGGCCAG AACACTGCGA CCACCGGGCT GGCAATCGGC TTCCGTGGCA CAACGGCTTC CACCAGCGGC ATCGGCATCT ATGGCACTTC GACGGCGACA ACAGGCAGCA CTGTCGGCGT ACGTGCGTCT GTGGCGAGCG TCAGCGGCAC CTCTGCCATC CTGCAAAACA CCGCGGGCGG AAAATTATTG AGCGGCCTCT CTGGCAGCGG ATCGAGCGAA GTCTTCAGCG TGCTTGGCAA CGGGAACCTC ACCGCCGGCG CAGCGAGTTT CAACGGGCCC GTCACCTTCG CTTCCGGCCA GACTTTCCCC GGAACGCTAC CCAATTTCGG CGTGCAAACC ATGAATGGAG ACCTGATCCT GCAAGGTTTA TCCAGTCCTA CGCCGTTGAA CGCAGCCAGT GACAGTACCG CCGGAACCTA CTTCACGCTC TTCAACAACA GCGGCAGCGG AAGTGGCTGG CAGTTCGTCA CCACCGGCAC GGGCGCCTCG CAAGGAGCCG GACACTTGCT GTTCTACGGA GGCCCAAATC CCCAGTCGGT GATCATCCAG GCGCCTGTAA GCGCCGGCGA TGTCACATCG AGCTTCCTGC ATTCGAACAC TACGGTGCGC GCGGAAACCG GGCTCTCCCT CGGCGGCAAT GCAACTTTGA AAGTCGATGC GCCGGGCATT GTCGGCGGGC AGTTCGTGGT TCAGAACGGC ACCATGAGCA TTAATCAGGA CGTTCCCGTG AGCAGCAATT CGCGCATGGT GTTCACCGGT TACCTGTTCG GCGATACCGG CGACTCCGGC CTGCTCGGGT CGACTCTCGG TTACATCCAT CCGGAACGCG ACATCGTGGT CACCGGAATT TTTGGATCCA CCAACAACAA GGGAGTTGGG AACTGCGGCA ACGACGCGAT CATCACGCTC GAGCAACCAG GAAATCCAAG TACACCCAAG GTCAACCTGG ACATCATCGA GGGCATCCCG ACGTGGTCAA ACATGTTCCT GAGTGTCCCA TTCAACTCGG TTTACGATCT TCAAATCGTG CTGACGCAAA ATTCAGGAGG CTGTGCGCCG TTTTCGCATA CCACGAATAA CCCGGTGATC TCGGTGGTCT ACTACATGAA GTGA
|
Protein sequence | MKFVCAVVVL LAITLVGFSQ TQPATEVVVP HLIRFAGTVK GATGRVAITF SLHKSDRDDA ALWTETQNVQ LEDGKYTILL GATKAEGLPF DLFTSGEAQW LSIRVEGRPE QRVLLVSVPY ALKAAEAETL AGHSASEFVT TEKVTNLVQQ QLQQQQTAPT KSATTKKDAG AKGNVLTSTA TNFTDNTANQ VVLVTQKGAG NGLVSNSISA NGVSGTTASS AGVGVSGANT AATGLAIGVR GSTVADSGIS VYGTAGGTSG TATGVKGISA APNGYGVFGQ NTATTGLAIG FRGTTASTSG IGIYGTSTAT TGSTVGVRAS VASVSGTSAI LQNTAGGKLL SGLSGSGSSE VFSVLGNGNL TAGAASFNGP VTFASGQTFP GTLPNFGVQT MNGDLILQGL SSPTPLNAAS DSTAGTYFTL FNNSGSGSGW QFVTTGTGAS QGAGHLLFYG GPNPQSVIIQ APVSAGDVTS SFLHSNTTVR AETGLSLGGN ATLKVDAPGI VGGQFVVQNG TMSINQDVPV SSNSRMVFTG YLFGDTGDSG LLGSTLGYIH PERDIVVTGI FGSTNNKGVG NCGNDAIITL EQPGNPSTPK VNLDIIEGIP TWSNMFLSVP FNSVYDLQIV LTQNSGGCAP FSHTTNNPVI SVVYYMK
|
| |