Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4212 |
Symbol | |
ID | 4072171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4988627 |
End bp | 4990300 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986243 |
Product | glycosyl transferase family protein |
Protein accession | YP_593286 |
Protein GI | 94971238 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.872167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0154714 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAC GAGCACCTCA ACTGCGGCTG CTGCTTGAAG TCTTTCTGCT TGCCGCGCTC TGCTATTTCT TTTTCTTTTT TGGACTCTCT GCTTTCGGCT TAACCGGCGC CGACGAGCCT CGCTATGCGC AGGTCGCACG CGAGATGCTT CAGCACCACG ACTGGGTCAC GCCGACTCTC TACGGGAACG TCTGGCTGGA AAAGCCGATC CTCTACTACT GGGGCGCAAT CGTCAGCTAC AGGATCTTCG GCGTGAGCGA CTGGGCCGCA CGGATTCCTG GCGGAGTCTT CGCGAGCGCG ATGCTCCTAT TCCTCTACGC GTGGACGCGC CGCTTCCGCA ACGGCTCGCA ACTCGATGCG ATTGTGATGA CCGCGTCGTC CGTGTTTGTA TTCGCTTTTG CCCGCGCTGC TTCGATTGAC ATTCATCTGG TCGCTCCGCT GACGATCGGC ATGCTCGCGT GGTGGGCATT CTACGAAACG GGCCATCGCG GATGGCTGGC GTTGTTTTAC GCCATGATCG CGATCGGAGT GCTCGCGAAA GGACCAGTCT CAGCGGCGCT GGCGGCAATG GTCATTCTCG TCTTCGTGGC GATCCGCCGT GATTGGTCGG CCATCGTCCG CACCCTCTGG ATTCCCGGCA TCCTGATATT CTTTGCAATC GCGCTGCCCT GGTACGTCGC GGTGCAGCAT GCGAATCCGG GCTTCGTCCG CGAGTTTTTC ATCACCCATA ACCTGAGTCG TTTCACGACC AACCGTTTTC AGCACCGCCA GCATTTCTGG TACTACATCC CGGTCCTGAT CGGCGGCACC ATGCCGTGGA CGGTCTTCGT CATCGCTGCG CTTGCAGGCG GAATCAAGTC GCTGCGCGAT AAGAACGAAG ATCCGTTGCT CACCTATCTC GCGGTGTGGG TACTCATCCC GCTGATCTTC TTTTCGTTTT CGCAGTCGAA GCTGCCGGGC TACATCCTGC CTTCGATCGT TCCGTGCGGT CTGCTCGTCG CCATCTGGCT GCGCCGCGAG AACACCAAGC CGTCGCCGGT GCTTATTTCG GTGCATGCCT TGCTGTCCGG AGCTGTGCTC GCAGTTGCTC TGCTCGCGCC CTACAAGCTC TACAAAATGC CGCTGCCCGG CCAGGTGCTG CGCATCGCAA TTCCAGTAGG GTTGATCGTC GCGCTTGTGG TTGCGATTGT TGTTTTCCTG CGCGGATACG CGGCACTCCG CTTCGCTACG ATATTTCCCG TCGCGCTCGC ACTGGCGTTT CTGCTCAAAG CCGCCGGCCC TGCTATTGAT TCCACGCAAT CCATCCGGCC AGTTGCAGCG CGCATCACGA ACTCCTTTGC CACAAATGAG CCGGTAATGT TCTACAACGT CCCGCGCGGC GTTGAGTATG GACTGGCCTT TTACCTCGAT CGCCCCCTGC CCGAGCCGCC ACCGGACGAA ATCGTCAGGT TCGGCACGGC TGCTGCCGGC AACCAGCAGA GAGAAAAGAC TTTGAAAGAT ATCAGCAACT CCCTCCCGCC GAACCACGGG AATTATGTGC TGGTTACGCG CGCGGGAGCG ATCAACCGCT TTGCCGATAC CGTGCCTCCG AACTACCAGA TCGAGCCGTT CTTCCGCTTC CAACCGCAGC GCCTCGACGT GTATTTCTTG CGCGATATCG GCCCTGGACG CTGA
|
Protein sequence | MNERAPQLRL LLEVFLLAAL CYFFFFFGLS AFGLTGADEP RYAQVAREML QHHDWVTPTL YGNVWLEKPI LYYWGAIVSY RIFGVSDWAA RIPGGVFASA MLLFLYAWTR RFRNGSQLDA IVMTASSVFV FAFARAASID IHLVAPLTIG MLAWWAFYET GHRGWLALFY AMIAIGVLAK GPVSAALAAM VILVFVAIRR DWSAIVRTLW IPGILIFFAI ALPWYVAVQH ANPGFVREFF ITHNLSRFTT NRFQHRQHFW YYIPVLIGGT MPWTVFVIAA LAGGIKSLRD KNEDPLLTYL AVWVLIPLIF FSFSQSKLPG YILPSIVPCG LLVAIWLRRE NTKPSPVLIS VHALLSGAVL AVALLAPYKL YKMPLPGQVL RIAIPVGLIV ALVVAIVVFL RGYAALRFAT IFPVALALAF LLKAAGPAID STQSIRPVAA RITNSFATNE PVMFYNVPRG VEYGLAFYLD RPLPEPPPDE IVRFGTAAAG NQQREKTLKD ISNSLPPNHG NYVLVTRAGA INRFADTVPP NYQIEPFFRF QPQRLDVYFL RDIGPGR
|
| |