Gene Amir_5150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5150 
Symbol 
ID8329352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6138517 
End bp6139794 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content74% 
IMG OID644945589 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003102817 
Protein GI256379157 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCACCA CCGCTGCTCC GCTGGACGTC GAAGTCGTGC GCGCGGACTT CCCGATCCTG 
GGCCGCACCG TGCGCGAGGG CAAGCGGCTG GTCTACCTCG ACTCCGGCGC CACCTCGCAG
CGCCCGAGCC AGGTCCTGGA CGCCGAGCGG GCGTTCCTGG AGACCGCCAA CGCCGCGGTC
CACCGCGGCG CGCACCAGCT GGCCGAGGAG GCCACCGACG CCTACGAGGA CGCCCGCCGC
AGGATCGCGG GCTTCGTCGG CGTCGGCGTC GACGAGGTCG TGTTCACCAA GAACGCGACC
GAGGGCGTCA ACCTGGTCGC GTACGCCATG GGCAACGCGG CCACGGCGGG CCCGGAGGCC
GAGCGCTTCC TGCTGGGCCC CGGCGACGAG ATCGTCGTGA CCGAGATGGA GCACCACGCC
AACCTGGTGC CGTGGCAGCA GCTCGCGCTG CGCACCGGGG CCACGCTGCG CTGGCTCGGC
GTCACCGACG AGGGCAGGCT CGACCTGTCG AACCTGGACG AGGTGGTGAA CGAGCGCACC
AAGGTGCTCG CGTTCACCCA CCAGTCCAAC GTGCTCGGCA CGGTCAACCC GGTCGCCGCC
CTCGTCGCCG CGGCGGCGCG GGTCGGCGCG CTGACCGTGC TCGACGCCTG CCAGTCCGTG
CCGCACGCGC CCGTCGACTT CCGCGCCCTC GGCGTGGACT TCGCCGTCTT CAGCGGCCAC
AAGATGCTCG GTCCCTCGGG CGTCGGCGTC CTCTACGGCC GCCGCGCGCT CCTGGAGGCG
CTGCCCCCGT TCCTCACCGG CGGCTCCATG ATCGAGATGG TCGAGATGGC CCGCTCCACG
TTCGCCCCGC CGCCGCAGCG GTTCGAGGCG GGCGTGCCGA TGACCTCGCA GGCCGTCGCG
CTCGGCGCCG CCGTCGACTA CCTGAACGCG GTCGGCATGG ACCGGGTCGC CGCGCACGAG
CACGAACTGG TCGCCGCCGC CCTCAGCGGC CTGGCGGCCA TTCCCGGCGT GCGCGTGGTC
GGCCCCACCG ACCTCGCCGA CCGGGGCGGC GCGGTCTCGT TCGTGGTCGA CGGGGTGCAC
GCGCACGACG TCGGCCAGGT CCTGGACAGC CTCGGCGTCG CGGTCCGCGT CGGCCACCAC
TGCGCGTGGC CGCTGCACCG CAGGATGAAC GCCGCGGCCA CCGTGCGGGC CAGCTTCTAC
CTCTACAACA CGCAGGGCGA GGTGGACGCG CTGCTGTCCG CCGTCCGCGA GGCGCAGAAG
TTCTTCGGGG TGGCGTGA
 
Protein sequence
MTTTAAPLDV EVVRADFPIL GRTVREGKRL VYLDSGATSQ RPSQVLDAER AFLETANAAV 
HRGAHQLAEE ATDAYEDARR RIAGFVGVGV DEVVFTKNAT EGVNLVAYAM GNAATAGPEA
ERFLLGPGDE IVVTEMEHHA NLVPWQQLAL RTGATLRWLG VTDEGRLDLS NLDEVVNERT
KVLAFTHQSN VLGTVNPVAA LVAAAARVGA LTVLDACQSV PHAPVDFRAL GVDFAVFSGH
KMLGPSGVGV LYGRRALLEA LPPFLTGGSM IEMVEMARST FAPPPQRFEA GVPMTSQAVA
LGAAVDYLNA VGMDRVAAHE HELVAAALSG LAAIPGVRVV GPTDLADRGG AVSFVVDGVH
AHDVGQVLDS LGVAVRVGHH CAWPLHRRMN AAATVRASFY LYNTQGEVDA LLSAVREAQK
FFGVA