Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1858 |
Symbol | |
ID | 4069200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2234570 |
End bp | 2235526 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637983867 |
Product | hypothetical protein |
Protein accession | YP_590933 |
Protein GI | 94968885 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.293436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGATCA CGATCGCGAC ATCCCCAGCC GAGATCGAGC GCCTACGCCC TGCGTGGGAG CGGTTGCACG ACCGCGAACG CCAGAGCATC TTCCAGGACT ACACCCTCAA CTGTCTTGCC GCGACCCATT TCGCGGAGCG CGAATCGCCT TACATCATGA TGGCGGAGAG CGAATCGTCG GTGGCGATCG TACCGGCGGC GGTGCGTAAG CACGATGGGT CAATCACGTT GCTCGGCGAG ACGTTGTTCG ACTATCGCGA TGTGCTTTGC GGCGGTACCA ATGAGGCGCT GGAAGCGGCG TGGGCGGAGA TCGCCAAGCT GCAGCGTCGG CTCTCGTTCT TCGCGGTGCA TCCTGATGCG CAAGCGCGCT GGCAGATGAT TCCGATGCAT GACTTCGCCA ATGCTCCGCG GGTGCGCGCG GCGGATTGCG ACTCCGATGA GTTCCGTGCC TCGCATAACA AGCTCGGCAT GTTTTACCGG CGGATGATCA AGCGCGGCGC GCACCTGTTC ACGCACGCGG GAGACAATTG TGCGCTCATC CGAACGATTT ATGAGCGCAA GGCTTCACAG TTCCCGCACG AGACGAACAA CATTTTTCTC GATCCGCATC GGCGCGAATT CATGGAGGTT GCGTGCGCTG CACTTGGGTC GCGTTGCGAG ATTTTTTCTC TCGAGATCGG GACCGAGTTG ATCGCAGCGC TGGTGACGCT ACGCGATCAC ACGGTGCGGC GCTTCTACAC CGTCTACTTC CACCCGGCCT GGTCGAAATT TTCGCCGGGC GTGGTGCTCA TCTACGAGGT CACCGCGCGT TCCCTTACCG AGGGACTCGA CTGCGATTAC CTGACCGGCG AGTATGGCTA CAAGAACCGG CTGGCCACGG CGATGGTGCC GCTGCGGCGA GTAGAAGCGT CGGCCGAAGA ACTGGCGGCA ATCGCGGCGC GGAAGAAGGC GGCTTAA
|
Protein sequence | MRITIATSPA EIERLRPAWE RLHDRERQSI FQDYTLNCLA ATHFAERESP YIMMAESESS VAIVPAAVRK HDGSITLLGE TLFDYRDVLC GGTNEALEAA WAEIAKLQRR LSFFAVHPDA QARWQMIPMH DFANAPRVRA ADCDSDEFRA SHNKLGMFYR RMIKRGAHLF THAGDNCALI RTIYERKASQ FPHETNNIFL DPHRREFMEV ACAALGSRCE IFSLEIGTEL IAALVTLRDH TVRRFYTVYF HPAWSKFSPG VVLIYEVTAR SLTEGLDCDY LTGEYGYKNR LATAMVPLRR VEASAEELAA IAARKKAA
|
| |