Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3826 |
Symbol | |
ID | 4071110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4522327 |
End bp | 4523466 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637985849 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_592900 |
Protein GI | 94970852 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.107341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.560245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACTT TCAGAGTAGG TCTGAATCTC ATTTATCTGC AACCGGGCCG GCTTGGTGGG ACGGAAGTAT ATGCTCGCGA ATTGCTGCAA GAGATCGAGG AGCAGAACCA AGAATTTGAT TTCGTTCTGT TTCTGAGTCC TGAATCGTAT GAGACCGTAA ATTATGTTTC TTCGCGATTT CGCAAGGTAC GGGTGCCGAT TTCGTCACAA TCTCCGTCGA AGCGGCACTT GCTCGAGCAG ACGATCCTGC CTCGGTTAAT TGCCCGAGAG GGAATCGACC TGCTGCACTC GATGGGGTAT GTGTGTCCGC TGTTGGCTGA GTGCAAACAG GTTGTGACCG TGCACGACAT GCTGTATGAG GTCCATCCAG AGTACTTGTC GAAACTGAAG TTGTTATTTT GGCGTTTTTT TGTTCCGCGA TCGGTTCGCC GTTCGGTACG CACGATCGCG GTCTCGCAAA ATGCAAAAGA GGACATCGTC AAGTACTGCG GAGTGGATTC TGCCAGGGTA GTCGCCATCC ATTCAGGAAT ACGGTTTCAA CCGCCTGCGG ACGAGGCGCA AGTGACGGCA ACGTTGGACA AGTTTGGGAT TAAGCGGCCA TTCGTGTTGG CTGTAGGCTG CGGCCGTCAC AAGAGGGTGG ATTTGATTGA GGAGGCTTGC AGACAACTGG ACGTCCAGTT GGTAGTCACT GGCCTGCCTG AGAGCAGAGT AGTTCCACAT AGGACGGAGC GAACGTTCTA CGCAGGATTC GTTTCGGCTG AGGATCTGCG CGCGCTTTAT GCCGCAGCAG AGGTCTATGC AACCGCATCG AGCATGGAAG GGTTCGGCCT GACACTGCTG GAGTCAATGA TGCAAAACAC TCCGGTGATC AGCTCTGCCG CGGGCTCGCT TCGAGAGGTT GGAGGCGACG CTATTCTTGC TATTGAGACC CCGACTTCCG CTGCACTGGC GAAAGCGATT TCCGAAGTTA TGTTGGATCG GCAACTGCGC GATCGACTCG TGAATGCGGG GAAGCAGCGA CTGGGACAGT TCACTTGGAA AGAGTCGGCC CGTCGTCATC TCGATGTTTA TCGAGAGGTT CTGTCGGGTA GCGCCGAACG CCCAATGGCG CCGCCCAGCA GGGTCGGAGT TGGAGGCTGA
|
Protein sequence | MATFRVGLNL IYLQPGRLGG TEVYARELLQ EIEEQNQEFD FVLFLSPESY ETVNYVSSRF RKVRVPISSQ SPSKRHLLEQ TILPRLIARE GIDLLHSMGY VCPLLAECKQ VVTVHDMLYE VHPEYLSKLK LLFWRFFVPR SVRRSVRTIA VSQNAKEDIV KYCGVDSARV VAIHSGIRFQ PPADEAQVTA TLDKFGIKRP FVLAVGCGRH KRVDLIEEAC RQLDVQLVVT GLPESRVVPH RTERTFYAGF VSAEDLRALY AAAEVYATAS SMEGFGLTLL ESMMQNTPVI SSAAGSLREV GGDAILAIET PTSAALAKAI SEVMLDRQLR DRLVNAGKQR LGQFTWKESA RRHLDVYREV LSGSAERPMA PPSRVGVGG
|
| |