Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0444 |
Symbol | |
ID | 4071691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 521765 |
End bp | 522871 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982448 |
Product | LacI family transcription regulator |
Protein accession | YP_589523 |
Protein GI | 94967475 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.104683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCAGGA AGAGACGCGG GATTCATCTC ATCGCGGAGA TGGCGCAGGT GTCCATCGGC ACGGTGGACC GTGCGCTGCA CGGGCGAAAC GGAATCAGCC ACGCGACGCG CGAACGGATC CTCCAGATCG CGCGGGAAAT TGGATACACG CCGAACCTTG CTGCTCGCGC TCTCTCGGCA GGGAAAGCCG GAGTGCGCAT CGGAGTTTGC ATTCCGCGCG AAATCCATTT CTTCTACGAC CAACTCTGGG GCGGAGTGCT CGAAGAAGCC CGCCGCCTGG AGCATATGGG CGTTGCATTC GAGTTCCGGC CGGTACGAAA TCTCGGCGAG GGCGATACCG AGGCGCTGCG TGAATTGATC GAGGACGGCG TGGATGGCGT CATTCTCACC GCAGGAAATC CAGATGGATT GACGCCCCTG GTGAATGAGG CTGAGGGCCG GAACATTCCT GTCGTCTGTG TATCTACCGA CGCTCCGGAG AGCCTGCGTT CCAGCATCGT TTGCGTTGAG CCGAGACTCA ATGGCCAGCT TGCCGGCGAG TTGATGGGAA AGTTCGTGCC CGCAGGATCG AAGGTTGCCG TGGTTGCCGG CATGCTCACT GCCATGGACC ATCTCAGCAA GACGGAGGGC TTCTCGGTAA CGTTCCCGAA ACACTGCCAT GGCGGCCAAA TCGTGGGCGT TATCGAGGGC CACGAGGACG AGGACGAAAG CTTCCAGAAG ACCTTCGATC TACTGGGTAG AGTTCCGGAC TTGGCTGGTC TTTACGTCAA CACCGTGAAC TGTCTTCCCG TGTGTCGAGC ACTTGGGGCG CGCCAACTCG CAGGGAGAGT CAAACTGATT ACGACCGACT TGTTTGCGGA GATGGCGACC TATTTCGCCA AGGGCACAAT CACCGCATCG ATCTACCAGC AACCCCACCG ACAAGGCCAA CTGGCGGTCA GATTACTCGC CGACAACCTC ACGGCAAACC AGCCATTTCC GCCTACTGTG CACTTAAGTC CTGGGGTTGT CATGTCTTCG AATTTGCACC TTTTCCGCGA GATGCGTCGC AGTGAAACGA AGCTTCCGGA CGTGGTGCGC GTGGCCTCTC TCGCCACGAA GGTGTAG
|
Protein sequence | MTRKRRGIHL IAEMAQVSIG TVDRALHGRN GISHATRERI LQIAREIGYT PNLAARALSA GKAGVRIGVC IPREIHFFYD QLWGGVLEEA RRLEHMGVAF EFRPVRNLGE GDTEALRELI EDGVDGVILT AGNPDGLTPL VNEAEGRNIP VVCVSTDAPE SLRSSIVCVE PRLNGQLAGE LMGKFVPAGS KVAVVAGMLT AMDHLSKTEG FSVTFPKHCH GGQIVGVIEG HEDEDESFQK TFDLLGRVPD LAGLYVNTVN CLPVCRALGA RQLAGRVKLI TTDLFAEMAT YFAKGTITAS IYQQPHRQGQ LAVRLLADNL TANQPFPPTV HLSPGVVMSS NLHLFREMRR SETKLPDVVR VASLATKV
|
| |