Gene Apar_0249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0249 
Symbol 
ID8413097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp290763 
End bp291941 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content45% 
IMG OID645021817 
ProductCysteine desulfurase 
Protein accessionYP_003179272 
Protein GI257784055 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01977] cysteine desulfurase family protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.246818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATATC TCAACTGTGC TGCAACGTCA TATAAGCGTC CTCAATGTGT TGTAGATGCT 
GTTTCACGCG CCCTTTGCTC ATTTGGAAGT ACGGGAAGGG GTGCAGCGTC TGCCGAGCTT
GATGCTGCTC GCGCTGTTAT GGTTGCACGA GAACGTATCG CTTTGTTGTT AGGTTTTTCA
CATCCAGAGC GTGTCGTGTT TACCGCAAAC GCAACAGATG CCCTAAACAA GGCAATTCTC
GGCATTGTTA AGCCCGGAGA TCATGTAGTT GCAACAGATT GGGATCACAA TTCTGTTTTG
CGCCCTCTGA ATCGTCTTCA AAAGAAACGC AATGTAAAGG TTGATTATGT TCCTGCTAAT
TTGCAGGGCT GCCTTGATTG GGATGTTCTT GAACGATTAG TTTCTCCTGG TACAAAGTTA
GTGGTAGTAA CACATGCATC TAATCTTACA GGTAATGTAT GTGATCTTGA ACGTGTGGTT
ACTGTTGCTC ACGCTGTAGG TGCACTTGTG CTTGTTGATG CTTCACAAAC CGCAGGGTCA
ATACCCATTA ATTTTGATGA TTTAGGTGTT GATGTACTTG CGTTTACTGG TCATAAGGCA
CTTATGGGTC CTCAGGGGAC AGGTGGCCTT TTGGTTGCTC CACATGTTGC GATAGAAGCA
GTTATTCAAG GAGGAACGGG CGTACTTTCT TTTGAAGAGG AAATGCCTCA AGTATATCCA
GAACATCTTG AGGCAGGAAC GCTTAATAGT CACGGTATTG CCGGTCTTTC TGCTGCAGTA
GATTTTGTAT CAACACAAGG AGTATGTGCT GTTCACCGTC ATGATTTAGC ATTGGTTAGA
CAGTTTGTAG CTGAGGCTCG TCAGGTGCCT GGGATAGAGT TATATGGATG TTTTCCTGAT
AACATTTCTG AACTGAATGG GGAGAGTTCG CAAAAAGATC ACGCACCCAT TGTGACACTT
AATTTAGATG GGTGGACTTC TTCAGAACTT GCAAGAGTTT TAAGTGTTGA GTATGACATT
TCAGTTCGTG CAGGAGCTCA TTGTGCTCCT CGTATGCACA GAGCATTAGG AACGCAAGAT
ACGGGGTCGG TCCGTTTCTC ATTTGGATTT TATACGACAG AGGATGAGAT ACATAAGGCA
ACCACGGCCC TTAATGAGCT AGCAAAATCG GTGATGTAG
 
Protein sequence
MIYLNCAATS YKRPQCVVDA VSRALCSFGS TGRGAASAEL DAARAVMVAR ERIALLLGFS 
HPERVVFTAN ATDALNKAIL GIVKPGDHVV ATDWDHNSVL RPLNRLQKKR NVKVDYVPAN
LQGCLDWDVL ERLVSPGTKL VVVTHASNLT GNVCDLERVV TVAHAVGALV LVDASQTAGS
IPINFDDLGV DVLAFTGHKA LMGPQGTGGL LVAPHVAIEA VIQGGTGVLS FEEEMPQVYP
EHLEAGTLNS HGIAGLSAAV DFVSTQGVCA VHRHDLALVR QFVAEARQVP GIELYGCFPD
NISELNGESS QKDHAPIVTL NLDGWTSSEL ARVLSVEYDI SVRAGAHCAP RMHRALGTQD
TGSVRFSFGF YTTEDEIHKA TTALNELAKS VM