Gene Avin_39100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_39100 
Symbol 
ID7762799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3961166 
End bp3962515 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content72% 
IMG OID643806773 
Productcysteine desulfurase, sufS 
Protein accessionYP_002801025 
Protein GI226945952 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0233253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTGGAAA ACGCTGCGCG GTTTTCCACC CTACGGATCG CGGACGAATC CGTTTCTTAC 
AAAAACGGTG TTTACAACGA ATCGCATGAC GACAAACGTC GCCGCCCCGA AGGCCTCCCC
ATCATTCACG ACGACGCACC GACCATGTCC CTTCCCTCCC CCTGGCGGGC CGACTTCCCG
GCCTTCGCCG CCTTCGCCCG GGACGGCCAG ACCTACCTGG ACAGTGCCGC CACCGCGCAG
AAGCCACAGG CGATGCTGGA CGCCCTGCTC GGCCACTACG CCGGCGGCGC GGCCAACGTG
CACCGCGCCC AGCACTGCCC CGGCGAGCGC GCCACGCGGG CCTTCGAGGC GGCGCGGGCG
AAGGTCGCCG CCTGGCTGAA CGCCGGAAGC GCCGAGCGGA TCGTCTTCAC CCGCGGCGCC
ACCGAGTCCC TCAACCTGCT CGCCTACGGA CTCGAACACC TGTTCGCGCC GGGCGACACG
ATCGCCGTCG GCGCCCTGGA GCACCACGCC AACCTGCTGC CCTGGCAACG CCTGGCGCAG
CGCCGCGGTC TCGACCTGGT GGTGCTGCCG CTGGATGGCG CCGGCGACAT CGACCTGGAG
CAGGCCGAGC GGCTGATCGG CCCGCGCACC CGGCTGCTCG CGGTCAGTCA GTTGTCCAAC
GTGCTCGGCC GCTGGCAGCC GCTCGACGCA CTGCTCGCCC TGGCCAGTTC CCGGGGCGCG
CTGACGGTGG TCGACGGCGC CCAGGGCGCG GTCCACGGTC GCCACGACCT GCGGTCGCTG
GCCTGCGACT TCTACGTGTT CTCCGCGCAC AAGCTCTACG GCCCGGACGG CCTCGGCGTG
CTCCACGGCC GTCCGCAGGC CCTGGAACGC CTGCAGCACT GGCAGTTCGG CGGCGAGATG
GTGCAGCAGG CCGACTACCA CGAGGCGCGC TTTCGCCCGG CGCCGCTCGG CTTCGAGGCC
GGCACCCCGG CGATCGGCGC GGCCATCGCC TTCGGCGCCA CCCTGGACTA CCTGCAGAGC
CTGGACGGCA CGGCGGTCGC CGCCCACGAG GCGGCTCTGC ACCGGCGCCT GCTCGCCGGA
CTGCGCGCTG TCGCCGGTCT GCGCCTGCTC GCCGAGCCGC ACAACGCGCT GGCCAGTTTC
GTGATCGAAG GGGTGCACAA CGCCGATCTG GCCCATCTGC TCGCCGAACA GGGCATCGCC
GTGCGCGCCG GCCAGCACTG CGCCATGCCG CTGCTGCGCC GCCTCGGCCT GCCCGGCGCG
CTGCGCGTGT CGCTCGGCCT GTACAGCGAC GGGAACGATC TGGAGCGCTT CTTCGCCGCC
CTCGAACGCG CCCTGGAACT GCTGCGATGA
 
Protein sequence
MVENAARFST LRIADESVSY KNGVYNESHD DKRRRPEGLP IIHDDAPTMS LPSPWRADFP 
AFAAFARDGQ TYLDSAATAQ KPQAMLDALL GHYAGGAANV HRAQHCPGER ATRAFEAARA
KVAAWLNAGS AERIVFTRGA TESLNLLAYG LEHLFAPGDT IAVGALEHHA NLLPWQRLAQ
RRGLDLVVLP LDGAGDIDLE QAERLIGPRT RLLAVSQLSN VLGRWQPLDA LLALASSRGA
LTVVDGAQGA VHGRHDLRSL ACDFYVFSAH KLYGPDGLGV LHGRPQALER LQHWQFGGEM
VQQADYHEAR FRPAPLGFEA GTPAIGAAIA FGATLDYLQS LDGTAVAAHE AALHRRLLAG
LRAVAGLRLL AEPHNALASF VIEGVHNADL AHLLAEQGIA VRAGQHCAMP LLRRLGLPGA
LRVSLGLYSD GNDLERFFAA LERALELLR