Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3282 |
Symbol | |
ID | 4072694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3886455 |
End bp | 3888023 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637985303 |
Product | hypothetical protein |
Protein accession | YP_592357 |
Protein GI | 94970309 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.172995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCG TCACCGCACA GGAAATGCGC GACATCGATC GCATTACGAC CGAGCGCTAC GGCGTGCCGT CGCTGACGCT GATGGAGAAC GCCGGGCGCG CGGCAGCAGA GATGGTTGTG GAGAATTACC CCGAGGCCAG GTCCATTGCC GTGGTGTGCG GGAAGGGAAA TAACGGCGGT GATGGCTTCG TAGCGGCGCG GCATCTGCAT AAGATGGACC GCGGGGTCGA GGTGCTGCTG CTTGCTGATC CAGAGGGCTT GCGTGGCGAT GCAGCGGAGA TGTATCGGCA GCTTGGGTTC GCTGCGACGA TCGTGAAATC GGAAGAGACG ATCTCATCCA ATTTGCAGCG CGCGTTTGCA GAAGCCGACG TGATTCTCGA TGCGGTGCTC GGCACGGGAT TCAAGCCACC AGTCTCGCCG TTGTACGCGA AGGCAATCGC CGCGATGAAC GCGAGCAAAT TGCCGATCGT CGCTGTGGAT GTGCCATCTG GAGCGGACTC CGACGGTATG CAGCCGCAGT CGGGCGAGGC GATTGCGCGC GCCGATGCAG CGGTAACTTT CACCGCGCCC AAACCGGTTC ACGTGTTCGG CGATCTGGTT CGCGGAAAGA CTGTGGTTGC ACCGATTGGT TCTCCCGACG AAGCCATTGT CAGCAATCTG GGTCTGAACG TAATCACGCC GGCTGACTAT GCGGCCGTGC TGGCCGCTCG GCCGCTCAAC AGCAACAAGG GAATGTACGG CCACGCGCTG ATCGTGGCGG GATCGTTTGG AAAATCCGGC GCAGCGGCGA TGGCGGGTAT GGCGTGTCTG CGCGCTGGTG CCGGGCTCGC GACCGTGGCA ACGCCGAAGT CGGTGCTCAC CAGCGTGGCG TCCTACGCGC CGGAGTTGAT GACTGAGTCG CTCGCCGAGA CCGCGGACGG CACGATCTGC GAAGCCGCGA TTTGGGCCAT TCAGGAACTC GCGAAGAAGA TGACAGTACT TGCGATTGGA CCCGGGCTGA CGCAGAACGC TGAGACCATC CAGGTCGTAC GAGAGCTCGT GCGAGCCAGC GAAAAGCCCA TGGTGATTGA TGCCGACGGG CTGAATGCGC TCGTCGATCA AACCGAGGTT CTGAAAGATG CGAAGGCAGC CACGATCATC ACCCCGCACC CCGGTGAGAT GTCGCGGTTG TGTGGGATAA GTACGAAAGA GGTCCAAGCC GACCGCGTAG GAATCGCAAA GAACTTCGCT GCATCTCGTT ATACGATCGT TGTGCTCAAG GGAGATAAGA CCGTCATCGC CGCGCCTTCG GGAGAAACGT GGATCAACTG CACCGGCAAT CCCGGCATGG CAACCGGAGG CACTGGCGAC GTGCTTACCG GTATCCTCAC CGGCCTGCTG GCGCAACATC CGCAGGATCC GCTGCTGTGC GCGATTGCGG CGGTACATCT CCACGGGATG GCCGGCGACC TTGGCCGCGA TAGGGTTGGT GAGATTTCCC TGATTGCCAC CGACTTGATC CATGCCCTGT CTGGGGCATT CGAGCGCGCG AAAAAGAGTT TGCAGAAGCC CTGGGTTCCC CTAAATTAG
|
Protein sequence | MKIVTAQEMR DIDRITTERY GVPSLTLMEN AGRAAAEMVV ENYPEARSIA VVCGKGNNGG DGFVAARHLH KMDRGVEVLL LADPEGLRGD AAEMYRQLGF AATIVKSEET ISSNLQRAFA EADVILDAVL GTGFKPPVSP LYAKAIAAMN ASKLPIVAVD VPSGADSDGM QPQSGEAIAR ADAAVTFTAP KPVHVFGDLV RGKTVVAPIG SPDEAIVSNL GLNVITPADY AAVLAARPLN SNKGMYGHAL IVAGSFGKSG AAAMAGMACL RAGAGLATVA TPKSVLTSVA SYAPELMTES LAETADGTIC EAAIWAIQEL AKKMTVLAIG PGLTQNAETI QVVRELVRAS EKPMVIDADG LNALVDQTEV LKDAKAATII TPHPGEMSRL CGISTKEVQA DRVGIAKNFA ASRYTIVVLK GDKTVIAAPS GETWINCTGN PGMATGGTGD VLTGILTGLL AQHPQDPLLC AIAAVHLHGM AGDLGRDRVG EISLIATDLI HALSGAFERA KKSLQKPWVP LN
|
| |