Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0916 |
Symbol | |
ID | 4069127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1154426 |
End bp | 1155412 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637982923 |
Product | glycosidase, PH1107-related |
Protein accession | YP_589993 |
Protein GI | 94967945 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00526558 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGATTGC GTTTGACAGC AGCTGCTGTC ACCCTTCTCA GTTTGATCGT GGCCGCGTCC AGCACAGGGA GCACGCCGGT TGCGGGATTG CCTTTCGGGC CGTGGACACG CGCATCGGAC AAGCCGCTGC TTTCGCCGCG CGGCGATGGT TGGGAGTCAG CTGGAACCTT CAATCCTGCC GTGATCGAGC GTGATGGACA GGTCGTCATG CTGTACCGAG CGCAAGACAA GTCAGGGACT TCTCGTCTGG GATATGCGAC GAGCAGCGAC GGGATTCACT TCGAGCATCG TGACAAGCCG GTGTTTTCAC CAGAAGCGGA GTACGAGCGC GATGGCGGAG TGGAGGACCC ACGGTTGGTT GAGATCGACG GAATGTATTA CTTGACCTAC ACCGGGTACA ACCAGAAAGA CGCGCAGCTC TGCCTGGCTG CCTCCAAAGA TTTGCGGCAC TGGGATCGCA AGGGCGTGAT CCTGCCAGCC TACAAGGGAA ACTGGAACGT GGGCTGGACG AAATCCGGCG CGATCTTGAA GCAGAGGATC AATGGGAAGT ATTGGATGTA CTTTCTCGGC ACCACCCCTG AAAAGACGGA CGAAATGGGG CTGGCCTCGT CGACGGATTT AATCCACTGG ACGGAGGAAA CGAAGACTCC GGTTTTGCCA CGACGGCCGG GAAAGTTCGA TTCGCGCGTT GTGGAACCGG GACCTCCGCC GAGTTTGACG GACCGAGGAA TTGTGCTGGT CTACAACGGT GCCGACGATC ATTTGGTATA CAAGACCGCA ATCGCTGTGT TCGATCGGGA TGATCCCCGT AAATTGATCT ACCGCAGTGA GGAACCGATT TTCGCCCCTG AGAAAGACTG GGAAAAGGTT GGACAGGTGC CGAATGTCGT CTTTGTGGAG GGCATGGTGA AGAGGGGGGA TCGCTACTTC TTCTATTATG GTGGGGCCGA TACGCACGTT GGTGTAGCCA CGGCCGAAGC GAAATAA
|
Protein sequence | MRLRLTAAAV TLLSLIVAAS STGSTPVAGL PFGPWTRASD KPLLSPRGDG WESAGTFNPA VIERDGQVVM LYRAQDKSGT SRLGYATSSD GIHFEHRDKP VFSPEAEYER DGGVEDPRLV EIDGMYYLTY TGYNQKDAQL CLAASKDLRH WDRKGVILPA YKGNWNVGWT KSGAILKQRI NGKYWMYFLG TTPEKTDEMG LASSTDLIHW TEETKTPVLP RRPGKFDSRV VEPGPPPSLT DRGIVLVYNG ADDHLVYKTA IAVFDRDDPR KLIYRSEEPI FAPEKDWEKV GQVPNVVFVE GMVKRGDRYF FYYGGADTHV GVATAEAK
|
| |