Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0636 |
Symbol | |
ID | 4069573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 780278 |
End bp | 781474 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982641 |
Product | cysteine desulphurase-like protein |
Protein accession | YP_589715 |
Protein GI | 94967667 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01977] cysteine desulfurase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.024881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.355065 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGC TGATCTATTT CGATAATGCC GCAACGGGCT GGCCGAAGCC GGAGAACGTG TATCGCTTCA TGGATGAGTT TTACCGCACG CACGGGGTGA ATCCGGGGCG CAGCGGCTAC GACTTGGCAA TGGAGACCGG TTCGATGGTG GACCGGTCAC GCAAGCGCCT GACGAAATTC TTCGGCGGAG ATGAGGATGC GCCAGACCGG CTGGTGTTCA CCTCCAACGT GACCGATGCA CTGAACCTGG TGATTCCTGG TTTGGTGGGG CATGGCGATC ACGTGGTGAC TACGAATCTC GAACACAATT CGGTGATTCG TCCGGTGAAC CATATGGTGC GCGATTGCGG GGCGGAGGCG ACGTATGTGC CCTTCAAAGC GCACGGATTC ATTGAACCGG AAGCGATTGC CGCGGCAATA CGCCCGAATA CAAAAGCCGT GGTTGTGAAC CACGGGTCGA ACGTTATCGG CACGGTACAA CCGGTGGCGG ATATCGGCAA GATCTGCCGT GAGCGCGGTG TGACCTTCGT GATCGATACG GCACAGACAG CCGGAGTAGT GCCGATCAAC ATGAGAGCGA TGAACGTGGA TGTGGTTGCG TTCACCGGTC ACAAGGCGTT GATGGGCAGC GTGGGGATAG GCGGGTTGTG CATTCGCAAG CATGTTGAAG TGAAGCGGGT GCGCAGCGGC GGCACGGGCG TTCGGTCGGT GGATCCGTAT CATCTCGAGG AGTATCCGTG GCGGCTGGAG TATGGGACAC CGAACCTGGT AGGGATCGCT TCACTGTGGG CGGGACAGGA TTGGCTCGAT GAGCATGGAG TGGAAATGGT CCATGCCCGC GAGATGAAAC TGGCGAAGAA GCTGGTGGAC GGCTTCCGAC AAGTCGAAGG CGTGACGCTC TATTGCTGCG AGAATCTGGC GAACCATCTG CCGACGATAT TGATGAACAT CGATACGATG GACCCCGGAG ATGTGGGCGT GATGCTGGAT GTGGACTACA ACATCGCAGT GCGCACTGGG CTGCATTGCG CTCCACTGGT ACATACGCAG CTCGGTACCG TGAAACGCGA TGGGGGAGTG CGCTTCTCGA TCGGAGCGTT CAACACCGAA GAAGAGGTAG ACGCTGCGAT TCACGCGGTT TCTGAAATCG CGAAGTGGGC GATGAGCCGG AGCGCGAAGT CGAGGGTTAC GGCATAG
|
Protein sequence | MEKLIYFDNA ATGWPKPENV YRFMDEFYRT HGVNPGRSGY DLAMETGSMV DRSRKRLTKF FGGDEDAPDR LVFTSNVTDA LNLVIPGLVG HGDHVVTTNL EHNSVIRPVN HMVRDCGAEA TYVPFKAHGF IEPEAIAAAI RPNTKAVVVN HGSNVIGTVQ PVADIGKICR ERGVTFVIDT AQTAGVVPIN MRAMNVDVVA FTGHKALMGS VGIGGLCIRK HVEVKRVRSG GTGVRSVDPY HLEEYPWRLE YGTPNLVGIA SLWAGQDWLD EHGVEMVHAR EMKLAKKLVD GFRQVEGVTL YCCENLANHL PTILMNIDTM DPGDVGVMLD VDYNIAVRTG LHCAPLVHTQ LGTVKRDGGV RFSIGAFNTE EEVDAAIHAV SEIAKWAMSR SAKSRVTA
|
| |