Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0962 |
Symbol | |
ID | 4072950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1220176 |
End bp | 1221285 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637982969 |
Product | glycosy hydrolase family protein |
Protein accession | YP_590039 |
Protein GI | 94967991 |
COG category | [R] General function prediction only |
COG ID | [COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAG CCATCATCCT AGCCGCTGTC TTGCTCTGCA TGCCGGTCTT TGCCGCGCAA GCCCAGCCCG CACCGTCCGC CACCGAAGTG CTGGCGACGA TGGAGCGCGT CGCCGACTGG CAACTCGCGC ATCCCTCGTC TCACGCCACA ACCGAGTGGA CGCAAGCCGC CGGATACGCC GGCATGATGG CGCTCGCGAA TCTTTCGAAG GGCCCCAGGT ACCGTGAAGC CGTGCGCTCG ATGGGGGAAG CCAACGCCTG GAAGGCCGGC CCGCGCACGT ATCACGCCGA TGACTTAGCG GTTGGCCAGA CCTACGAAGC ACTCTACGTC TTCTACAAAG ATCCCAAGAT CATCGCGCCC CTTCGCGAGC GCCTCGACTT CATCCTCGCG ACCCCGTCCA AAGTCCAGTC GCTCGACTTT CAGCAGAAAT ATGATCAGGT GAGCGAACTA TGGTCGTGGT GCGATTCGCT TTTCATGGCG CCGCCGGTGT GGGTCGAAAT GTCGGCGATC ACTGGCGATC CGCGCTACCG CGAGTTCGCG ATCAAAGACT GGTGGCGCAC CACGGATTTC CTTTACGATC CCGCCGAGCA TCTTTATTAC CGCGACAGCA CGTATTTCAA TCGCCGCGAA GCCAACGGGC AGAAGGTGTT CTGGAGTCGC GGCAACGGCT GGGTGATGGC CGGACTTGTT CGCGTGCTTG ATGCTTTGCC CCCGCAGGAT CCATCGCGGC CGCGCTTCGA GAAGCTCTAT CGCGACATGG CGCAAGCCGC GCTTAAAGCA CAGCAGCCCG ACGGTCTTTG GCACGCCAGT CTTCTCGATC CGCAGAGCTA TCCACTCAAA GAGACCAGCG GCTCTGGTTT CTTTACCTAT GCTCTGGCGT GGGGAGTGAA CCACAAGCTG CTCGACCGCG CTGAATACGA GCCCGCTGTC GTAAGAGCAT GGAGCGCGCT GGTGGCTTGC GTGGATAAAG ACGGCAAGCT CACGCACGTC CAGCCCATTG GCTCCGATCC CAAATCGTTC GCCGAGGATT CGACTGAGGT ATATGGCGTC GGGGCATTCC TGCTCGCAGG CAGCGAGGTC TATCGCATAG CTGAGAAGGG AAAGCACTGA
|
Protein sequence | MKRAIILAAV LLCMPVFAAQ AQPAPSATEV LATMERVADW QLAHPSSHAT TEWTQAAGYA GMMALANLSK GPRYREAVRS MGEANAWKAG PRTYHADDLA VGQTYEALYV FYKDPKIIAP LRERLDFILA TPSKVQSLDF QQKYDQVSEL WSWCDSLFMA PPVWVEMSAI TGDPRYREFA IKDWWRTTDF LYDPAEHLYY RDSTYFNRRE ANGQKVFWSR GNGWVMAGLV RVLDALPPQD PSRPRFEKLY RDMAQAALKA QQPDGLWHAS LLDPQSYPLK ETSGSGFFTY ALAWGVNHKL LDRAEYEPAV VRAWSALVAC VDKDGKLTHV QPIGSDPKSF AEDSTEVYGV GAFLLAGSEV YRIAEKGKH
|
| |