Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0249 |
Symbol | |
ID | 8413097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 290763 |
End bp | 291941 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645021817 |
Product | Cysteine desulfurase |
Protein accession | YP_003179272 |
Protein GI | 257784055 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01977] cysteine desulfurase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.246818 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATATC TCAACTGTGC TGCAACGTCA TATAAGCGTC CTCAATGTGT TGTAGATGCT GTTTCACGCG CCCTTTGCTC ATTTGGAAGT ACGGGAAGGG GTGCAGCGTC TGCCGAGCTT GATGCTGCTC GCGCTGTTAT GGTTGCACGA GAACGTATCG CTTTGTTGTT AGGTTTTTCA CATCCAGAGC GTGTCGTGTT TACCGCAAAC GCAACAGATG CCCTAAACAA GGCAATTCTC GGCATTGTTA AGCCCGGAGA TCATGTAGTT GCAACAGATT GGGATCACAA TTCTGTTTTG CGCCCTCTGA ATCGTCTTCA AAAGAAACGC AATGTAAAGG TTGATTATGT TCCTGCTAAT TTGCAGGGCT GCCTTGATTG GGATGTTCTT GAACGATTAG TTTCTCCTGG TACAAAGTTA GTGGTAGTAA CACATGCATC TAATCTTACA GGTAATGTAT GTGATCTTGA ACGTGTGGTT ACTGTTGCTC ACGCTGTAGG TGCACTTGTG CTTGTTGATG CTTCACAAAC CGCAGGGTCA ATACCCATTA ATTTTGATGA TTTAGGTGTT GATGTACTTG CGTTTACTGG TCATAAGGCA CTTATGGGTC CTCAGGGGAC AGGTGGCCTT TTGGTTGCTC CACATGTTGC GATAGAAGCA GTTATTCAAG GAGGAACGGG CGTACTTTCT TTTGAAGAGG AAATGCCTCA AGTATATCCA GAACATCTTG AGGCAGGAAC GCTTAATAGT CACGGTATTG CCGGTCTTTC TGCTGCAGTA GATTTTGTAT CAACACAAGG AGTATGTGCT GTTCACCGTC ATGATTTAGC ATTGGTTAGA CAGTTTGTAG CTGAGGCTCG TCAGGTGCCT GGGATAGAGT TATATGGATG TTTTCCTGAT AACATTTCTG AACTGAATGG GGAGAGTTCG CAAAAAGATC ACGCACCCAT TGTGACACTT AATTTAGATG GGTGGACTTC TTCAGAACTT GCAAGAGTTT TAAGTGTTGA GTATGACATT TCAGTTCGTG CAGGAGCTCA TTGTGCTCCT CGTATGCACA GAGCATTAGG AACGCAAGAT ACGGGGTCGG TCCGTTTCTC ATTTGGATTT TATACGACAG AGGATGAGAT ACATAAGGCA ACCACGGCCC TTAATGAGCT AGCAAAATCG GTGATGTAG
|
Protein sequence | MIYLNCAATS YKRPQCVVDA VSRALCSFGS TGRGAASAEL DAARAVMVAR ERIALLLGFS HPERVVFTAN ATDALNKAIL GIVKPGDHVV ATDWDHNSVL RPLNRLQKKR NVKVDYVPAN LQGCLDWDVL ERLVSPGTKL VVVTHASNLT GNVCDLERVV TVAHAVGALV LVDASQTAGS IPINFDDLGV DVLAFTGHKA LMGPQGTGGL LVAPHVAIEA VIQGGTGVLS FEEEMPQVYP EHLEAGTLNS HGIAGLSAAV DFVSTQGVCA VHRHDLALVR QFVAEARQVP GIELYGCFPD NISELNGESS QKDHAPIVTL NLDGWTSSEL ARVLSVEYDI SVRAGAHCAP RMHRALGTQD TGSVRFSFGF YTTEDEIHKA TTALNELAKS VM
|
| |