Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1565 |
Symbol | |
ID | 4068674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1912450 |
End bp | 1913418 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983574 |
Product | polysaccharide deacetylase |
Protein accession | YP_590641 |
Protein GI | 94968593 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.252193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00761587 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCTAA TGCAAATCGG AAAAGCGCTC CTTCGTCTCC TCGCATCTTC CCTATTACTT TCCTGCTCTT TGCTCGCCCA ACAGCGCGAA GTCGCCATCA CCATCGACGA TCTTCCCGCC GCCAATCCGC GCCTCAGCGG CAAGCAGATG AACGAACTCA CGTCGAAGCT ACTCGCCACA CTCAAGCAGG AGCAGGTTCC CGCGATTGGC TTCGTGAACG AGCAAAAACT CTACGTCAAA GGTGAGGTCG ACGATCGCAT CGGCGCTCTG CGCCAGTGGG TCGACAACGG TTTCGAGCTC GGCAATCACA CCTTCAGCCA CATGTCGCTC GACACCAACA CCCTGCAAGC CTGGGAAGAG AACGTCGTGC GCGGCGAGAC CGTCACCTCC ATGCTTCTTG CTGAAAAGAA GATGAAGATC CGCTACCTGC GTCATCCATA CTTGATCGTC GGCCGCGATC TCGATACCCG CCGCCAAGCC GAGAATTTCC TCGCGCAGCG TGGATACAAA ATCGCTCCCG TCACCATGGA CGCCTGGGAC TGGATGTACT CCGGCGCCTA CGATGCCGCC CGCGAACAAG GCGATACCGC CATGCAGCGC AAGCTAGTGG ATTCCTATCT TGAGTACACC AACCAGGTCT TCGATTACTA CGAGAAGTTC TCCAAACAAT TCCTCGGCTA TGAACCCAAG CAGGTCCTGC TTCTCCACTG CAACTGGCTC GAAGCCGAAC ACATCAACGA ACTCATCGCC ACCCTCCGCA AGCGCGGCTA CAAATTCGTA ACCCTCGACG AAGCCCTCAC CGACTCCGCC TACAGTCTCC CCAACACGTG GGTTGGCGAC GACGGCCAAA CTTGGATCGA CCAGTGGGCC ATCACCCAGG GCAAAATCCC CACTGGCCAA CCAGAATTCC CAAGGTGGGT AGAGGACATC TCCGAGAAGT ACCGCAAGTC CGGCGCGCAG CCCTACTAG
|
Protein sequence | MALMQIGKAL LRLLASSLLL SCSLLAQQRE VAITIDDLPA ANPRLSGKQM NELTSKLLAT LKQEQVPAIG FVNEQKLYVK GEVDDRIGAL RQWVDNGFEL GNHTFSHMSL DTNTLQAWEE NVVRGETVTS MLLAEKKMKI RYLRHPYLIV GRDLDTRRQA ENFLAQRGYK IAPVTMDAWD WMYSGAYDAA REQGDTAMQR KLVDSYLEYT NQVFDYYEKF SKQFLGYEPK QVLLLHCNWL EAEHINELIA TLRKRGYKFV TLDEALTDSA YSLPNTWVGD DGQTWIDQWA ITQGKIPTGQ PEFPRWVEDI SEKYRKSGAQ PY
|
| |