Gene Afer_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_0567 
Symbol 
ID8322626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp568568 
End bp569818 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID644951705 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003109194 
Protein GI256371370 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.93631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCGA CACGTGGCCT TGGGCTCGAC GTCGAGGAGC TGCGTAAGGA CTTCCCGATC 
TTCGCCGAGC GCGGTGCGGG CTTCCACTAC CTCGACTCGG CGGCCTCGGC CCAGAGGCCG
AGTGCGGTGC TCGAGGCGAT GGACGCCTAC TACCGTTCCC ACCACGCCAA CGTCCACCGC
GGCGTCTACG GCCTCGCCGA GGACGCCACC GATCGCTACG AGCGCGCACG CCGAGCCATC
GGCCGCTTCG TCAACGCCCC CGACCCGGAG CGCGAGGTGG TGTTCACGAA GAACGCCACC
GAAGCGCTCA ATCTCGTCGC GCAGGGTCTC GGCCGGGTGC TCCTCGGACC CGGTCGCGCC
GTCGTCCTGA CGGAGATGGA GCACCATGCC AACTTGGTCC CGTGGATGAT CCTGCAGGAG
CAGCTCGGGT TCGAGCTGCG CTACCTGCCC TTCGATGGTG ACGGCCAGCT GGTGCTCGAC
GACGCGGAGC GGATCCTCGA CGGCGCCGCC ATCCTCAGCG TCACCGCGAT GTCGAACGTG
CTCGGGACGC TGAACCCGAT CCCCCATCTC GCCGAGCTCG CGCACGGAGC AGGTGCCGTC
GTGGTCGTGG ACGCAGCGCA GTATGCGCCG CACCATCCGA TCGACGTGCA GGCCTGGGGC
GTCGATCTCG TTGCGATGAC CGGACACAAG ATGCTCGGCC CTACGGGCAT CGGCGCGCTG
TGGGGGCGGC TCGAGCTGCT CGAGCAGATG ACGCCGTTCC TCGGTGGCGG CGACATGATC
CTCGACGTGA CGCTGGAGGG GTTCGTGCCG AACGAGGTGC CGTACAAGTT CGAGGCCGGT
ACGCCCCCGA TCGCCGAGGC CATCGGGTGG GAGGTCGCCA TCGACTACCT CCGCGACCGT
GTCGGGTTCG AGGCACTCGC CGCCCACGAG CGCTCGCTGA CCGCCTATGC CCTCGGTAGC
TTGGCTGATG GCCTCGGGGA GCGCATTCGG ATCTTCGGCC CCCGCGACCC GGAGCGTCGA
GGCGGCGTGA TCTCGTTCGA GCTCCAGGGC GTGCATCCCC ACGACGTCGC CCAAGTGCTC
GATCGTCACG GAGTGTGCGT GCGTGCGGGA CACCACTGCG CCAAGCCTCT CATGCGAGAG
ATCGGCCAGG CCGCGACCGC CCGTGCGTCG CTGTATCTCT ACAACGACCG TGCTGACATC
GACGCTCTCG TCATGGCACT CCAAGACGCC TGGCAGCGCT TCAACGACTA G
 
Protein sequence
MTATRGLGLD VEELRKDFPI FAERGAGFHY LDSAASAQRP SAVLEAMDAY YRSHHANVHR 
GVYGLAEDAT DRYERARRAI GRFVNAPDPE REVVFTKNAT EALNLVAQGL GRVLLGPGRA
VVLTEMEHHA NLVPWMILQE QLGFELRYLP FDGDGQLVLD DAERILDGAA ILSVTAMSNV
LGTLNPIPHL AELAHGAGAV VVVDAAQYAP HHPIDVQAWG VDLVAMTGHK MLGPTGIGAL
WGRLELLEQM TPFLGGGDMI LDVTLEGFVP NEVPYKFEAG TPPIAEAIGW EVAIDYLRDR
VGFEALAAHE RSLTAYALGS LADGLGERIR IFGPRDPERR GGVISFELQG VHPHDVAQVL
DRHGVCVRAG HHCAKPLMRE IGQAATARAS LYLYNDRADI DALVMALQDA WQRFND