Gene Apar_0399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0399 
Symbol 
ID8413248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp460274 
End bp461563 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content49% 
IMG OID645021967 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003179421 
Protein GI257784204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CGGAAGTTGC TCAGATTACC GATAACCCTT ACAAAGCAGA TTTTCCAATT 
CTGGCAGCAA ATCCCAAGGT AGCTTACTTG GACAGTGCCG CCACTTCTCA GCGTCCACGC
GTAGTCATTG AGGCACAGTC ACGCTTCTAT GAGACCATGA ACGCAAATGC TTTGCGCGGA
TTGTACCGTT GGAGCGTGGA TGCAACCGCA GCTATCGAGG ACGCTCGCGC TTCGATTGCA
AAGTTTATTG GTGCTGTCGA TAAAAACGAC AAGCCAGAAA GTCAGCAGAT TATCTTTACG
CGCAACACTT CTGAGGCACT TAATCTAGTT GCTTCAAGCC TTGGTCGCTA TGCGCTCAAA
CCAGGCGATG ACGTGGTCAT TTCCATCATG GAACACCACT CAAACATCAT TCCTTGGCAG
CAAATTTGCA AGGCTACCGG TGCTAACCTG GTGTATCTAC GCATGAACTC GCAGTATCAA
ATTACGCCCG AAGAAATTGC ATCAAAGATT ACCGAGCGCG CAAAAATTGT ATCTGTCACA
CACGTCTCCA ACGTGCTGGG AACTCGCAAT GACATCAAAG CCATTGCTAA GCGTACTCAT
GCTATGGGTG CCTACATGGT TGTTGATGCT GCTCAATCTG CACCTCACAT CCTTGTTAAC
GTGCATGATC TTGACTGTGA TTTGTTGGCT TTCTCAGCGC ATAAGATGTG CGGACCTATG
GGCATTGGTG TGCTTTGGGG TCGTGCCGAG CTGCTCAACG CTATGCCACC ATTTCTTACC
GGTGGCGAGA TGATTGATTC CGTCACCGAG ACCGGCGCTG TTTGGGCCCC TGTACCCGAG
AAGTTTGAGG CTGGTACACA GGATGCTGCA GGTATCTACG CTACCGGCGC CGCTATCAAA
TACCTCAACG GCTTGGATAT GGCAAAGATT GAGAAGCGAG AAGAACTTCT CGCCAGGTAC
TTGGTACAGC AACTATGCAC GCTGGACTTT GTTGACATTG TTGGCTCTAA GCTGGGACAA
AATCACGTGG GCGCTGTGGC ATTTAACGTA CGCGGCGTTC ATCCACACGA CGTCTCAAGC
ATTCTTGACA TGAATAATGT TTGCATTCGC GCTGGTCATC ACTGTGCTGA GCCGCTTCTG
ATTGAACTGC ATGAGTCCAG CACATGCCGC GCATCCGTTG CATTCTACAA CGATAAACAC
GACATTGATC AGCTTATCGA GGGTCTTAAC CAGGTTTGGA AGATCTTTGG CAGTGCGGTC
AACACTAAGC AAAACAAGGA GCTGAAATAA
 
Protein sequence
MTDTEVAQIT DNPYKADFPI LAANPKVAYL DSAATSQRPR VVIEAQSRFY ETMNANALRG 
LYRWSVDATA AIEDARASIA KFIGAVDKND KPESQQIIFT RNTSEALNLV ASSLGRYALK
PGDDVVISIM EHHSNIIPWQ QICKATGANL VYLRMNSQYQ ITPEEIASKI TERAKIVSVT
HVSNVLGTRN DIKAIAKRTH AMGAYMVVDA AQSAPHILVN VHDLDCDLLA FSAHKMCGPM
GIGVLWGRAE LLNAMPPFLT GGEMIDSVTE TGAVWAPVPE KFEAGTQDAA GIYATGAAIK
YLNGLDMAKI EKREELLARY LVQQLCTLDF VDIVGSKLGQ NHVGAVAFNV RGVHPHDVSS
ILDMNNVCIR AGHHCAEPLL IELHESSTCR ASVAFYNDKH DIDQLIEGLN QVWKIFGSAV
NTKQNKELK