Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3536 |
Symbol | |
ID | 4069267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4182955 |
End bp | 4184223 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985559 |
Product | CBS domain-containing protein |
Protein accession | YP_592611 |
Protein GI | 94970563 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACCT TCTGGCATGT GGTGATCCTG CTGCTGCTCA TGGCGCTGCT GACGCTGGTG TCGTACGTTG ACCGCGTCTT CAAGGAAGCG GGCAAGTTCC TCTCGCGTGA GTTCCAGGAG AACATCGACT ACTTCGAGTC CAACATCGAG CCGAAGTTGG GACGCAATCC GCAGCGCGCG GCGCTGGCCA TGGCCGTCTT GCCGCAAATG TTGCTGGGAA CGATCGCGTT CCTCATGGCG TACACCGTGT TCTCGCATCC GTGGTCTGGA CTTGAGCTCT TGCAGGCCGC GTTGGTGCTT GTGTTCGTCA TCGTGATCTG CAGCCGCCTG GTGCCATACC TGCTCTTTGC GCGGACGCGC GGCGAGTGGG CGAAGAACTT TGTTTGGCTC TTCCGCCTGA TGATTTATCT CGCGATTCCG GTGACGATCA TGTTGGGCTT CACCATCTCC GTCGCGGCGC TGGCGAAGGA AAACGCGGAA GAAGAGCCGG AGCATCCCTC GGAAGCCGTG GATGCGCTGA TTGAAGCCGG AACGGAAGAG GGCATTCTCG AAGAGAGCGA TCGCCATTTG ATCCAGTCGG TGGTTGAGTT TGGCGATAAG ACGGTGCGCG AGGTGATGAC ACCGCGGCCG CGGATTTTCG CGGTGCCGAC AGACTGGACC CTCGAACAAT TAACCGATGC GCTGCGCGAC CAAGGCTATT CGCGGATTCC AGTGTTTCGC GGTTCGATCG ACAATCTTGT GGGCATCGTC TTCTCGCGCG ACTTGCTGCA GATTGCCGAT ACCGATGCTC GTACACGAAA AGTTGGCGAC CTCGTCCGCG AGGAACTGAT GTTCGTTCCA GAGACGAAGC GCGGCAGTGA GCTACTGCGC GAGATGCAGC GGGACAACGT CCGCATGGCC GTGGTGATCG ACGAATACGG CAGCGTCGCC GGCTTAGTCA CGATTGAAGA TTTGATTGAG GAGATCGTCG GCGAACTGCG CGACGAGGAC GAAACCGACA TTGTGAAAGA AGGCGAGCAC ACTTATGTTG TGCCGGGAAG CATGGATGTG GACCGTTTGA ACGAACTATT TGGTGTACGT GTGGATGAAG ATCATGAATC CTCGACCGTT GCGGGCCTCG TCAGCGAGAT AGCGGGACGC ATCCCGCAGC CTGGAGAAGT GGTGGAGAAC CTCGGATTGC GATTTGAAGT GTTGGCTTCT ACCGATCGCC GCATCGAACG GCTGCGCATC AGCGAGGCGA CCCAAACCCC CGACCAGGTG CAAGCCTGA
|
Protein sequence | MITFWHVVIL LLLMALLTLV SYVDRVFKEA GKFLSREFQE NIDYFESNIE PKLGRNPQRA ALAMAVLPQM LLGTIAFLMA YTVFSHPWSG LELLQAALVL VFVIVICSRL VPYLLFARTR GEWAKNFVWL FRLMIYLAIP VTIMLGFTIS VAALAKENAE EEPEHPSEAV DALIEAGTEE GILEESDRHL IQSVVEFGDK TVREVMTPRP RIFAVPTDWT LEQLTDALRD QGYSRIPVFR GSIDNLVGIV FSRDLLQIAD TDARTRKVGD LVREELMFVP ETKRGSELLR EMQRDNVRMA VVIDEYGSVA GLVTIEDLIE EIVGELRDED ETDIVKEGEH TYVVPGSMDV DRLNELFGVR VDEDHESSTV AGLVSEIAGR IPQPGEVVEN LGLRFEVLAS TDRRIERLRI SEATQTPDQV QA
|
| |