Gene BBta_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3969 
SymbolsufS 
ID5151531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4176796 
End bp4178043 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content62% 
IMG OID640558804 
Productcysteine desulfurase 
Protein accessionYP_001239945 
Protein GI148255360 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0413583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC ATCCGGCAGT CGCCAACGGG ACCTATGACG TCGCCCGCAT CCGCGAGGAT 
TTCCCGGCAC TTGCGCTGCA GGTCTATGGC AAGCCGCTCG TCTATCTCGA CAACGCCGCC
TCGGCGCAGA AGCCGCGCGC AGTGCTCGAC CGCATGACGC AGGCCTACAC CAGCGAATAC
GCCAACGTGC ATCGCGGCCT GCATTATCTC GCCAATGCCG CGACCGAAGC CTATGAAGGC
GGCCGCGAGC GGGTGACGCG CTTCCTCAAT GCAAGACGCA ACGAGGAGAT CATCTTCACC
CGCAACGCGA CCGAGGCGAT CAACCTCGTG GCGTCGTCGT GGGGCGGGCC GAATATCGGC
GAGGGCGATG AGATCGTGCT CTCGATCATG GAGCACCATT CCAACATCGT GCCGTGGCAT
TTCTTGCGCG AGCGCCAGGG TGCGGTGATC AAGTGGGCGC CGGTCGACGA TGAAGGCAAC
TTCCTGATCG ACGAGTTCGA AAAGCTGCTG ACCGCCAAGA CCAAGCTGGT CGCGATCACG
CAGATGTCGA ACATGCTCGG CACGCTCGTG CCGGTGAAGG AGGTCGTGCG CATTGCGCAT
GCGCGCGGCA TTCCTGTGCT GATCGATGGT AGCCAGGCAG CGGTGCATCT TGCCGTCGAT
GTCCAGGACA TCGACTGTGA TTTCTACGTC TTCACCGGTC ACAAGCTCTA TGGGCCGACC
GGGATCGGCG TGCTTTACGG GAAGTATGAT CGCCTCGCCG CGATGCGTCC CTTCAATGGC
GGCGGCGAGA TGATCCGCGA GGTCGCGCGC GACTGGGTCA CCTATGGCGA TCCGCCGCAT
AGGTTCGAGG CGGGCACGCC GATGATCGTC GAGGCGGTCG GACTGGGTGC TGCGATCGAC
TACGTGAATT CCGTCGGCAA GGATCGCATT GCCGCTCACG AGCACGACCT GCTGACCTAT
GCGCAAGAGC GGCTGCGCGA GATCAACGCG TTGCGGATCA TCGGTACGGC CAAAGGCAAG
GGGCCGGTGA TCTCGTTCGA GATGAAGGGC GCGCATGCCC ACGATATCGC GACCGTGATC
GACCGGCAGG GCGTCGCGGT GCGCGCGGGC ACGCATTGCG TCATGCCGCT TTTGGAGAGA
TTCAACGTCA CTGCGACCTG CCGAGCCTCG TTTGGCATGT ACAATACGAG AGAAGAGGTC
GATCAGCTGG CACAGGCTCT GATCAAGGCG CGGGATTTGT TCGCATGA
 
Protein sequence
MSTHPAVANG TYDVARIRED FPALALQVYG KPLVYLDNAA SAQKPRAVLD RMTQAYTSEY 
ANVHRGLHYL ANAATEAYEG GRERVTRFLN ARRNEEIIFT RNATEAINLV ASSWGGPNIG
EGDEIVLSIM EHHSNIVPWH FLRERQGAVI KWAPVDDEGN FLIDEFEKLL TAKTKLVAIT
QMSNMLGTLV PVKEVVRIAH ARGIPVLIDG SQAAVHLAVD VQDIDCDFYV FTGHKLYGPT
GIGVLYGKYD RLAAMRPFNG GGEMIREVAR DWVTYGDPPH RFEAGTPMIV EAVGLGAAID
YVNSVGKDRI AAHEHDLLTY AQERLREINA LRIIGTAKGK GPVISFEMKG AHAHDIATVI
DRQGVAVRAG THCVMPLLER FNVTATCRAS FGMYNTREEV DQLAQALIKA RDLFA