Gene Apre_1637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1637 
Symbol 
ID8398449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1779152 
End bp1780375 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content40% 
IMG OID644996001 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003153379 
Protein GI257067123 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGATG TAGAAAAAAT TAGAGCTGAC TTTCCCTACC TTGATAGCGA AAAGGTGGGA 
AAGGAAGTTA TTTATTTAGA TACAGGAGCG ACAAGCCAGA AACCAGCCTA TGTGATTGAT
GCAGTTGACG AATACTACAG ATATTCTAAC GCCAACCCTC ACAGGGGAGC CCACTTTCTA
AGCTGGAAGG CGACAGAAGC TTACGAAGAA ACAAGACAAG TTGTCAAAGA CTTCATAGGA
GCTAGAAAAT CTTCTGAGAT TGTATTTACA AGATCAACTA CAGAGGCCCT AAACCTCTTG
GCCTACTCGT ATGGGCTAAA CAATCTCAAA AAAGATGACG AGATCCTAAT TACAATCCTA
GACCATCATG CAAATCTAGT TCCATGGCAA ATGGTAGCAA AAAAGACTGG GGCAAAGCTA
GTCTATGCCT ACCTAAATGA TGACTACGGC TTAGATTATG ATGATTTGAA AAGTAAAATC
AACGAGAAAA CTAAGATAGT TTCTGTAACT GGAGCAAGCA ATGTTACAGG GGAGCTTATC
GATTCAAAGC TTATTACTAA ATGGGCCCAT GAAGTAGGAG CCATATCAAT AGTAGACGGA
GCCCAACTTA TACCTCATGT AAAGACAGAC GTCAAAGATA TAGATTGTGA CTTCCTAGCC
TTTTCAGGAC ACAAGATGTT CTCTCCTATG GGAATCGGAG TCCTTTATGG AAAATACGAG
CTTTTAGATA AGCTTGAGCC TTTCAACTAC GGCGGAGATA TGATAGAATA TGTCTATGAA
CAAGAATCTA CTTTCCAAGA GCCACCTATA AAATTTGAAG CTGGAACTCC AAATGTAGGA
GGAGTCCTTG GATTAAAAGC TGCGATTGAG TATGTAGAAA AAATTGGCAT GGACGAGATA
TTTGCCTATG AGCATGAATT AACTTCCTAT GCCTATGATT TGATAAAGGA CATCCCAAAT
ATCAAAATCT TCTATCCGAC AAATGGCAAG GCAGGATCTG TAATATCATT TACCTTTACA
GACATCCACC CACACGATAT AGCTACAATC CTTGATAGCA AGGGGATAGC TGTAAGAAGC
GGCCACCATT GTGCTATGCC ACTTCACGGA TATCTAGGCA TATCTGCAAC AGCCAGAGCA
TCATTTTCTA TATACAATAC CAAGGAAGAA GCAGAGATTT TTGCTCGTGA GTTAAAGAAT
GTAAGAAAGG TGATGGGCCT ATAA
 
Protein sequence
MMDVEKIRAD FPYLDSEKVG KEVIYLDTGA TSQKPAYVID AVDEYYRYSN ANPHRGAHFL 
SWKATEAYEE TRQVVKDFIG ARKSSEIVFT RSTTEALNLL AYSYGLNNLK KDDEILITIL
DHHANLVPWQ MVAKKTGAKL VYAYLNDDYG LDYDDLKSKI NEKTKIVSVT GASNVTGELI
DSKLITKWAH EVGAISIVDG AQLIPHVKTD VKDIDCDFLA FSGHKMFSPM GIGVLYGKYE
LLDKLEPFNY GGDMIEYVYE QESTFQEPPI KFEAGTPNVG GVLGLKAAIE YVEKIGMDEI
FAYEHELTSY AYDLIKDIPN IKIFYPTNGK AGSVISFTFT DIHPHDIATI LDSKGIAVRS
GHHCAMPLHG YLGISATARA SFSIYNTKEE AEIFARELKN VRKVMGL