Gene Apre_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0365 
Symbol 
ID8397139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp415153 
End bp416289 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content43% 
IMG OID644994723 
ProductCysteine desulfurase 
Protein accessionYP_003152135 
Protein GI257065879 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01977] cysteine desulfurase family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.60313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTTTG ATAACGCTGC TACGACTATC CACAAGCCCA AGGCTATGAC AGATAAGCTA 
GTTGAAGTAA TTTCTTCTGG AAATTACGGC AATCCATCAA GGTCAGGCCA CATGCTTTCT
CAAAACTCCA TGATGGCAAT CTTTGATACG AAAAAGTCCC TGGCTAGGCT TTGCCATATA
GAAAATCCCT CAGATATCCT CCTTACAGCA AACGCCACCT TCGCCCTAAA CTTTGCCATC
AAGTCCTTGG TAAACAAGGA TGACCACATC ATCACCTCGA CTACAGAACA TAACTCTATC
CTCCGCCCCC TCTATCAAAC GGGAGCTGAT ATTTCCTTTG TAGACTTTGA TGAGAATTAC
GAGCTTAAAT ACCAGAGCCT TCCCAAGCTT TTAAGAAAAA ACACGAAATT TCTCGTCATA
AATTCCGCAT CAAATCTTTT AGGAGATGTA AAGGACCTGG ATAGGGTCTA TGATTTTGCC
AGAGAAAACG AGCTACTAAT GATAGTAGAT TGTGCCCAAA GCCTGGGCCT TATCGATATC
GATATGGGAA AATACGAAAA CTCACTCTTT GCCTTTACAG GACACAAATC CCTTTACGGG
CCAAGCGGGA CCGGCGGACT AATCAAAAAC GGAAACTTTC TTTTCGCTCA AGTATTTGCC
GGAGGTAGCG GGATAGATTC TTTCAGGGAG ACCATGCCCC CTTCCTTCCC CCAAATATTT
GAAGCAGGAA CGGCCAACTT CCTAGGTCAA ATCGCCCTAA AGGCAGGAGT CGACTTCATC
CTAGATGAGG GCATTGATAA GATAAATAAA AAAGTAAGAG AGCTTGCAAG TAGATTTTAC
CGAGGGATTG AAGAAATCCC TGACCTTAAG TTCTACTCCA AGGACCCAGA CTGTCTTAAA
ACAGGCCTAG TTTCTTTTAA TATTGGAGAT ATCTCCTCAG ATGAAATCTC CCTCATCCTC
GATGAGGACT ACAATATCCA AACAAGACCC GGTTCCCATT GTGCTCCCCT AATCCATAGG
CATTTTGCTA CTGAAAAACA GGGGATAGTG AGATTTTCCT TTTCTTACTT CAACACAAAT
GAGGAAGTCG ATAGGGCAAT CCTAGCCCTA AGAGATATAT CCCAAATATA CAAATAG
 
Protein sequence
MYFDNAATTI HKPKAMTDKL VEVISSGNYG NPSRSGHMLS QNSMMAIFDT KKSLARLCHI 
ENPSDILLTA NATFALNFAI KSLVNKDDHI ITSTTEHNSI LRPLYQTGAD ISFVDFDENY
ELKYQSLPKL LRKNTKFLVI NSASNLLGDV KDLDRVYDFA RENELLMIVD CAQSLGLIDI
DMGKYENSLF AFTGHKSLYG PSGTGGLIKN GNFLFAQVFA GGSGIDSFRE TMPPSFPQIF
EAGTANFLGQ IALKAGVDFI LDEGIDKINK KVRELASRFY RGIEEIPDLK FYSKDPDCLK
TGLVSFNIGD ISSDEISLIL DEDYNIQTRP GSHCAPLIHR HFATEKQGIV RFSFSYFNTN
EEVDRAILAL RDISQIYK