Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4166 |
Symbol | |
ID | 4072125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4931914 |
End bp | 4933170 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637986197 |
Product | cysteine desulphurase-like protein |
Protein accession | YP_593240 |
Protein GI | 94971192 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.744365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.275915 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGA GTGCACATGC CAGTGTGAAC CTGGAAGCGA TTCGAGCGCA ATTTCCTGCC CTCAGCCAAA CGTACAACGG GCATCCGCGA GTGTATTTCG ATGCGCCCGG TGGAACTCAG GTACCGCAGC AGGTGATTGA CGCGATCTCG GGCTACCTGG TGCATTCGAA CTCGAACACG CATGGGCAAT TCCATACCAG TCACCTGACC GACGAAGTGC TGGAGCACGC CCACGCAGCA ATGGCCGACA TGCTTGGGTG CGATGCGGAT GAGATTGTGT TCGGACAGAA CATGACCACG CTGACCTTTG CGTTGAGCCG CGCGCTGGGC CGCGATTTAC GCGCAGGGGA TGAGATCGTG ACGACATTGC TCGATCACGA TGCGAATGTA GCGCCGTGGC GCGCGCTGGA AGAGACCGGG GCGAGGGTGC ACGCGGTGAA GTTCCATCCC GAGGACTGCA CGCTGGATCT GGAGGATTTG CAGTCGAAGC TGAACGGGCG GACGAAGATT GTGGCGGTGG GATTTGCGTC GAACGCGGTG GGCACGATCA ATCCCATTAA AAAGATTGTG GAGATGGCGC ACGCGGTGGG AGCGTTGGTT TTCGTGGACG CCGTGCACTT TGCGCCGCAT GGGTTCATCG ATGTGCGCGA TCTGGATTGC GATTTTCTCG CGTGCTCGAC GTATAAGTTT TTTGGTCCGC ACATGGGGGT CCTATTTGGG AAGCATGAGC ATCTGTTGCG GTTGAAGCCG TATAAGGTGC GTCCGGCGGC AGATACTTTG CCGGACCGGT GGGAGACGGG CACCCTGAAC CATGAGTGCA TTGCGGGAAT CACGGCATGC GTGGAGTACC TGGCCGATGT CGGGCTGAAG ACGGTGAAGC ATCCGGAGTC GCGGCGGGAT GCGATTGCGG CGGCGTATGC GTGGATGAAA GAGCATGAGC ATGAATTGGC GAGGCAGCTT ATCGGCGGAC TGCTGGAGAT TCCGGGGCTG ACGTTTTATG GGATCCGGGA TTTGAGCCGG CTGGATGAGC GAACGCCAAC GGTGTCAATA CGAATGGCGA AACTTTCGCC AGCAGAGTTG TCGAAGAAGC TTGGCGATCT CGGGATTTAT ACGTGGGATG GGAACTTCTA CGCGATCAAT GTGACGGAAC AGTTGGGCGT GGAAGAAGAT GGCGGGATCC TACGGATCGG GTTGGCGCAT TATGCAACTT CAGCGGAAGT GGAAAGGTTG TTGAAGGCGC TGCGAGAGTG GGCATAG
|
Protein sequence | MATSAHASVN LEAIRAQFPA LSQTYNGHPR VYFDAPGGTQ VPQQVIDAIS GYLVHSNSNT HGQFHTSHLT DEVLEHAHAA MADMLGCDAD EIVFGQNMTT LTFALSRALG RDLRAGDEIV TTLLDHDANV APWRALEETG ARVHAVKFHP EDCTLDLEDL QSKLNGRTKI VAVGFASNAV GTINPIKKIV EMAHAVGALV FVDAVHFAPH GFIDVRDLDC DFLACSTYKF FGPHMGVLFG KHEHLLRLKP YKVRPAADTL PDRWETGTLN HECIAGITAC VEYLADVGLK TVKHPESRRD AIAAAYAWMK EHEHELARQL IGGLLEIPGL TFYGIRDLSR LDERTPTVSI RMAKLSPAEL SKKLGDLGIY TWDGNFYAIN VTEQLGVEED GGILRIGLAH YATSAEVERL LKALREWA
|
| |