Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2589 |
Symbol | |
ID | 4070552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3058112 |
End bp | 3059302 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984606 |
Product | hypothetical protein |
Protein accession | YP_591664 |
Protein GI | 94969616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCTG CCCGTCTCCC GGCCCCGTCA GTCACGGAAT TCGATTCGCC AATCCTCGCC CTTCACCATT GGTTACGTGA GCGAAACTAT GCCGGACACG AACCCTATGA TCTGCTGAAT TCGCCGCTGC TTCGCAAGTG GGCTGTGCAT CAACCTTTCG CCACTCTCTT CATTCAGGGC GGCAAACGGA TCGGCGGCGT TCACCTCCGC CAGTGGCTCC ACGTTCCACC CAGTCATAAT CCCAAAGCTC TCGCACTAGT ATTGAGCGCA TTCTGCGATC TCGCGCGCTC GGGTTGGTTC TCCCGTCGCC ACGCGGAACA TGTCCGGAAC TTGCTGCTTG AACTCCGCAG TCCGCACGAA TCCGACTTCT GCTGGGGATA CGACTGGCAT TACGTTTCAT TGCGCGGCGC TCGCATGCCG GCGTTCTCGC CGAACTCCGT CGTCACCGTC TTCTGCGCCC ACGCTCTCCT CGACTTCGCC AACATCTACC AGGACGAAGA ATCAAAAGCG ATCGCACATT CCGCGACAAA CTGGCTCGCA ACCCGATTGA ATCGTTCTAC CGACACCGAT ACTGGCCTCT GCCTCAGCTA CACGCCCAAC GACCATACCC GGATTTTCAA CAACAGCGCG CTCGCAGGTG CGTTGTTCGC GAGGATCGCG AGCGACTCAC GACTGCCCCA GTACGGAAGT CTGGCTCGCC GTATCATGGA ATACCTAGGC AACGGCCAGG CGAAAGACGG ATCCTGGACC TACGGCGTCG CGCGCTCACA ACAGTGGATT GACACCTTCC ACACCGGATA CAACCTTTGT GCGCTGCTCG AATACCAGCA ACTCACCGGC GATACCAGCT TTTCGCAAGC CCTCGCCCGC GGTTATGACT TTTATTGTTC CCACTTCTTC TGTCCGGACG GCGCGCCGCG CTACTTCCAT AACCGCACTT ACCCAATTGA TATCCATTCC TGCTCGCAGG CGATCCTGAC CCTCTGTGCC TTCGCTGAGC TTGACCCCGA TGCCCTCTCA CGCGCCGAGC AAATCGCGCG CTGGACCATC CAGCACCTCC GCAACTCCGA CGGCTCTTTC GGCTACCAGA TTCATCCTCA TCGGGTTGAC CGCACTCCTT ACATCCGCTG GTCGCAAGCC TGGATGCTTC GCGCGCTCGC CCGCCTGCGC CTGACAATCG GAGGCGAATA A
|
Protein sequence | MNAARLPAPS VTEFDSPILA LHHWLRERNY AGHEPYDLLN SPLLRKWAVH QPFATLFIQG GKRIGGVHLR QWLHVPPSHN PKALALVLSA FCDLARSGWF SRRHAEHVRN LLLELRSPHE SDFCWGYDWH YVSLRGARMP AFSPNSVVTV FCAHALLDFA NIYQDEESKA IAHSATNWLA TRLNRSTDTD TGLCLSYTPN DHTRIFNNSA LAGALFARIA SDSRLPQYGS LARRIMEYLG NGQAKDGSWT YGVARSQQWI DTFHTGYNLC ALLEYQQLTG DTSFSQALAR GYDFYCSHFF CPDGAPRYFH NRTYPIDIHS CSQAILTLCA FAELDPDALS RAEQIARWTI QHLRNSDGSF GYQIHPHRVD RTPYIRWSQA WMLRALARLR LTIGGE
|
| |