Gene Avin_34140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_34140 
SymbolmetZ 
ID7762309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3486396 
End bp3487607 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content68% 
IMG OID643806276 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_002800538 
Protein GI226945465 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATCG AGTGGGATGC CGGGCGGCTG GACAGCGACC TGGAGGGTGC CGGCTTCGAC 
ACCCTGGCGG TGCGGGCCGG TCAGCGGCGC ACTCCCGAGG GCGAGCACGG CGAGGCCCTA
TTCATGACCT CCAGCTACGT GTTCCGCAGC GCTGCCGATG CCGCCGCCCG CTTTGCCGGC
GAGCAGCCCG GCAACGTCTA CTCCCGCTAC ACCAATCCCA CGGTGCGTAC CTTCGAGGAG
CGCATCGCCG CCCTCGAAGG GGCCGAGCAG GCGGTTGCCG CGGCCTCCGG CATGGGCGCC
ATCCTGGCGA TGGTGATGAG CCTGTGCAGC GCCGGCGACC ATGTGCTGGT GTCGCGCAGC
GTATTCGGTT CGACCATCAG CCTGTTCGAC AAGTATTTCA AGCGTTTCGG CATCGAGGTC
GACTATCCGC CCCTGGCGGA TCTGGAAGCC TGGGCCGCCG CCTGCAAGCC GAACACCCGG
CTGTTCGTCG TCGAGTCTCC ATCCAACCCC CTGGCCGAGT TGGTGGACAT CGCCGCGCTG
GCCGACATCG CCCATGCCCG TGGCGCCCTG CTGGCGGTGG ACAACTGCTT CTGCACGCCG
GCGTTGCAGA AGCCGTTGGC GCTCGGCGCC GATATCGTCA TCCATTCGGC GACCAAGTAC
ATCGACGGCC AGGGGCGTTG CCTCGGCGGC GTGGTGGCCG GCAGCGCCAA GCTGATGCAG
GAGGTGGTCG GCTTCCTGCG TACCGCCGGC CCGACGCTCA GCCCCTTCAA TGCCTGGCTG
TTCCTCAAGG GGCTGGAAAC CCTCCGGGTG CGCATGCAGG CGCACTGTGC CAGCGCCCAG
GCGCTGGCCG AGTGGCTGGA GCAGCAGCCC CAGGTGGTCA GGGTCCATTA CGCCGGTCTG
TCCAGCCATC CGCAGCACGA GTTGGCCAGG CGCCAGCAGA GCGGCTTCGG CGCGGTGGTC
AGTTTCGAGG TCCGGGGCGA CAAGGCGGCC GCCTGGCGGG TCATCGACAA TACCCGGATG
ATCTCCATCA CCACCAACCT GGGCGATACC AAAACCACCA TCGCCCATCC GGCCACCACT
TCCCACGGAC GCCTGACCCC GGAGGCGCGG GCGGCGGCCG GGATCAGTGA CAGCCTGATC
CGTGTGGCGG TCGGCCTGGA GGACATCGAA GACATCAAGG CCGATCTTGC CCGAGGGTTG
TCCGCACTGT GA
 
Protein sequence
MTIEWDAGRL DSDLEGAGFD TLAVRAGQRR TPEGEHGEAL FMTSSYVFRS AADAAARFAG 
EQPGNVYSRY TNPTVRTFEE RIAALEGAEQ AVAAASGMGA ILAMVMSLCS AGDHVLVSRS
VFGSTISLFD KYFKRFGIEV DYPPLADLEA WAAACKPNTR LFVVESPSNP LAELVDIAAL
ADIAHARGAL LAVDNCFCTP ALQKPLALGA DIVIHSATKY IDGQGRCLGG VVAGSAKLMQ
EVVGFLRTAG PTLSPFNAWL FLKGLETLRV RMQAHCASAQ ALAEWLEQQP QVVRVHYAGL
SSHPQHELAR RQQSGFGAVV SFEVRGDKAA AWRVIDNTRM ISITTNLGDT KTTIAHPATT
SHGRLTPEAR AAAGISDSLI RVAVGLEDIE DIKADLARGL SAL