Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2085 |
Symbol | |
ID | 4069684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2499246 |
End bp | 2500286 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637984100 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_591160 |
Protein GI | 94969112 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.057085 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATTC TGTACGTCGC TTATCCGCTT CATCACGTGT CCGACGCCAG CGCCGGCGGC GCCGAGCAAA TGCTCTGGAC ACTCGAACGA GAAATGCACC TCCGCGGACA TGAAACGACA GTCGCTGCTT GTGCCGGTTC GCGCGTCAAC GGGCGGCTTT TCTCGACCGG TGATATCCCA ACTCAGTCCG ACACTTTTGA AGAGCGTAAT CGCGAGCACC ACGCCGCCAT TCGCAGCCTC CTTGCATCGG AATCTTTCGA TCTTATCCAC GACAAGAGCG GCTCCTTCTT CGCCAGCGCG GCGGATGTCG CCGTCCCGAT CCTCGCCACC GCACATCTTC CCCGCAGCTT TTACCCGGGC GTAAACTGGC ACGTGCTCGG CCACAACATC AACGTCAATT GCGTCTCCGC GACTCAGGCC CACACCTTCG CCGACGTGCC GAACCTCGTG GGATGGGTGC AGAACGGCAT CGCCATCGAC CGATTCAAGT TCCGCGAACA GAAGGACGAC TATCTCCTCT GGCTCGGGCG CATCTGCGAA GAGAAGGCCC CGCACCTCGC CATCGAAGCC GCCAAACGCA GCGGCAACCG ACTCATCCTT GCCGGCCAGG TTTATCCCTT CACCTATCAC GAAGCGTATT TTGCGCGCGA AATTCAGCCG CGCCTCGACG ATCAAATCAC ATTCATCGAC AGCCCCACTT TCGACGAAAA GCTCGACCTC CTCTCCCGCG CCTCCGCTCT CCTCATCCCG AGCCAGGTCG ACGAAACCAG CTCCCTCGTG GCGATGGAAG CTATGGCCTG TGGTACGCCG GTCATCACCT GGCGCCGCGG AGCCCTTCCG GAGATCGTTG CCGACGGCGT CACCGGCTAC ATCGTCGATT CCCTCGAAGC CATGGTCAGC GCTATTTCCG ACGTCAGCCG CATCCGCCCC GAGGCCTGCC GTGCCCGCGT GGAACAGCAC TTCTCGGCCA GCCGCATGGC CGCAGACTAC GCCGCGGTTT ACCAGCGGGT TCTCGGACGA AGCATTGGAG AAGCAGCCTG A
|
Protein sequence | MRILYVAYPL HHVSDASAGG AEQMLWTLER EMHLRGHETT VAACAGSRVN GRLFSTGDIP TQSDTFEERN REHHAAIRSL LASESFDLIH DKSGSFFASA ADVAVPILAT AHLPRSFYPG VNWHVLGHNI NVNCVSATQA HTFADVPNLV GWVQNGIAID RFKFREQKDD YLLWLGRICE EKAPHLAIEA AKRSGNRLIL AGQVYPFTYH EAYFAREIQP RLDDQITFID SPTFDEKLDL LSRASALLIP SQVDETSSLV AMEAMACGTP VITWRRGALP EIVADGVTGY IVDSLEAMVS AISDVSRIRP EACRARVEQH FSASRMAADY AAVYQRVLGR SIGEAA
|
| |