Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0893 |
Symbol | |
ID | 4069143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1112965 |
End bp | 1114134 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637982900 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_589970 |
Protein GI | 94967922 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.021774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTCG TAATATCCGC ACTTTGGGGC GCGACTCAGC CCTCTGGCAT ATGCCGGACG GTCGATGGGC TTGTCCGTGG AATAAAGGAA ATTGCTTCGG ATGTCGAACT GGCGATCGTC GTCGGCAAGT GGCAGCGCGG TTATTTCGAG GATTGCTTCG AAATCAATTC GTCAAATGTG CGGCTCGTTG ACGTAGATAT TCGAAACAAC TCTGTCAGTC GCAACTTGTG GTACGTGTTC GGACTGCCGC GGCTCGCGAG GAGAGTCTCC GCCGACATCG TACACATGGC GTTCCCGGCT CCAGTGATTC GCTCGGCTTT TCATTGTCCA ATCGTTACCA CCCTGCACGA TCTTTACCCG TACGACAGTC CGAGCAATTT TGGTTACCCG CACGTGTTTG CGAATCGAAT GGCGTTGAGA CGAGCGATCA GCGCCGCCGA TCGCGTCATT TGCGTTAGCG ACTTTACACT GTCGAGATTT CGTGAGCGAT TTCCGGTGCC AGCTGCACAC AAGGGCGTCC ATATTGCCAA CGCTATTGTG GCGAGCACTT CAGCGGAAGC GGGACAGTCG CAGATCGATG GTCCTCTGTT CCTTGCTGTG GCACAGCATC GAGCTAACAA GAATCTAAAT TTATTGCTTC GCGCGTTCAA CTTTTATCGA TCGCTCACCG ATGCCAAGGC GGGTCTGAAA CTGGTAATCG TGGGGATGGA TGGCCCTGAA ACCACGCGCT TACATCACCT TGTTGATCGA CTCAGTTTAC ATGAGACCGT TGTGTTCCTC GCAGGTCTCA CGGAGGGCGA ACTGGCGGCC CTCTATAGGG ATTGCGAGTT GTTTGTCACA CTCTCCAGCG TTGAAGGATT TGGATTACCA CTAAGAGAAG CGATCGAGTC TGGATCGCGA GTTGTTGCTT CCGACATCCC GGCGCACGGG GATGTTGAGC GTGGCCGCTG TGAATTCGTT GCTCTCGGTG GGGACGACGA GGTTTCGCGA GTTGTTGCGG CGTTTCAACA GGCCTTGAGC TCTCCGAGAA GGTTTTCGAC TAGTTCCCGT CAACAATCGC CTTCCCGAAC CGCAAGCAAG TATCTCGATC TCTATAGCGC AGCGGTACGC GGGCAGGCAA CGAAGGGAAA GCATCCGTCC AACGAGTTAT CGACCTTGGA GCATCGATGA
|
Protein sequence | MRVVISALWG ATQPSGICRT VDGLVRGIKE IASDVELAIV VGKWQRGYFE DCFEINSSNV RLVDVDIRNN SVSRNLWYVF GLPRLARRVS ADIVHMAFPA PVIRSAFHCP IVTTLHDLYP YDSPSNFGYP HVFANRMALR RAISAADRVI CVSDFTLSRF RERFPVPAAH KGVHIANAIV ASTSAEAGQS QIDGPLFLAV AQHRANKNLN LLLRAFNFYR SLTDAKAGLK LVIVGMDGPE TTRLHHLVDR LSLHETVVFL AGLTEGELAA LYRDCELFVT LSSVEGFGLP LREAIESGSR VVASDIPAHG DVERGRCEFV ALGGDDEVSR VVAAFQQALS SPRRFSTSSR QQSPSRTASK YLDLYSAAVR GQATKGKHPS NELSTLEHR
|
| |