Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4612 |
Symbol | |
ID | 4070769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5464902 |
End bp | 5466215 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986652 |
Product | hypothetical protein |
Protein accession | YP_593686 |
Protein GI | 94971638 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.439374 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAA GCCCTGAGAC CTTGCCCGGC CTCCCAGCCC GCCAATCCCG CGGGCTGGCG CAGTCTCTGC GCATTGCGCC GGCGCACTCG CGGCTTGTGG GCGGCAGTCT CATCATGCTC GGCGGCATGG TGCTCGTCAG CCTTCTGAAC TTCGGTTACA ACATCGCCGT TGCCCGCATG CTCGGCGCCG CCGAATTCAG CCAGGCAGCA GCGGCGGTCA CCCTGCTGAT GATTGTTTCC TGTCTCACAC TGGCTTTCCA GATGGTCTGC GCCAAGTTCG TGGCCAGGAA CGCAACCAAC TCGGAGAAAT CGCACGTCTA TCGCGCGCTG TTGCGCCGTG CCTGGACTGC CGGCCTCAGC ATTGGCATTG TCCTTACGAT CTTCAACCGC CAGGTCGCTG CGTGGCTCAA CATGCCCTCC GCGACGCTCG TTATCGTCCT CGCGCTCGGC ATGGCTTTCT ACGTTCCTCT CGGCGTGCGA CGCGGCGGCA TGCAGGGTGT TTATCAATTC CGCCGGCTGA GCCTCAATTT CATCATCGAG ACCAGCGTCA AGCTCGTCTC CGCAATCGTC TTAGTCCACT TGGGTTACGG AATTCTCGGC GCCGTCGCCG CCATCTCCAT CTCAGTGGTG GCTGCCTACT TCCTTCCTCC CACTCCAATT GCCTTACGTG AGCAGCCGAA AGCAGGGCTG CCGGCATCTT TTGGGGAAGG CATACAGGCG ATCATCTTCT TCATTGGACA GGTGATCATC AACAACATCG ACATCCTGAT GGTGAAGCAT TTCTTCCGAC CCGATGTCGC CGGTCTGTAC GCCGCAGTTG CTTTGGTCGG ACGCGTTCTT TACATCGCGT CCTGGCAAGT GATCAGCGCT ATGTTTCCGA TTGCCGCCGC AGGCCGCTCC GAATCCGAAG GCCGTGAAAG CCGAATGGTC GTGCTCATTC CATTCGGCTT CGTCACCGCG ATGACCGTGG TCTTCATGGC GATTCTCGGT CTCTTCCCGC AAACGATCCT GCACTTGCTC TTCGGCGCGA AGTTCAACAC TGACTCCAGC AACCTGCTTC TTCTCTACGC CGCCGCTACC GGCGGTTACG CACTCAGCGT GGTTCTGATG GCCTACGAGA TGTCGCGCCG CATCGCCAAC ACCGGCTGGT TCCAGCTCGT CATCAGCGGA CTCGTCGTCC TCGGCATCAC CATGTTCCAT AACACGCTCC GCGACGTCAT CGTGGTGCAG CAGGTCCTGA TGGTCGTCCT ATTTACCGCC GTAGCCGTGC CGTTTGTTCT CGCGCGGCGC TTCCGAACCC GGGGGGCAGC ATGA
|
Protein sequence | MSTSPETLPG LPARQSRGLA QSLRIAPAHS RLVGGSLIML GGMVLVSLLN FGYNIAVARM LGAAEFSQAA AAVTLLMIVS CLTLAFQMVC AKFVARNATN SEKSHVYRAL LRRAWTAGLS IGIVLTIFNR QVAAWLNMPS ATLVIVLALG MAFYVPLGVR RGGMQGVYQF RRLSLNFIIE TSVKLVSAIV LVHLGYGILG AVAAISISVV AAYFLPPTPI ALREQPKAGL PASFGEGIQA IIFFIGQVII NNIDILMVKH FFRPDVAGLY AAVALVGRVL YIASWQVISA MFPIAAAGRS ESEGRESRMV VLIPFGFVTA MTVVFMAILG LFPQTILHLL FGAKFNTDSS NLLLLYAAAT GGYALSVVLM AYEMSRRIAN TGWFQLVISG LVVLGITMFH NTLRDVIVVQ QVLMVVLFTA VAVPFVLARR FRTRGAA
|
| |