Gene PP_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPP_2001 
SymbolmetZ 
ID1043012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida KT2440 
KingdomBacteria 
Replicon accessionNC_002947 
Strand
Start bp2269195 
End bp2270406 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID637145411 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionNP_744151 
Protein GI26988726 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATC AATGGGATGC CGGGCGACTG GACAGTGACC TCGAGGGTGT CGGTTTCGAC 
ACCCTGGCGG TACGCGCTGG CCAAAACCGT ACCCCGGAAG GCGAGCACAG CGAAGCGCTG
TTCCTGACCT CCAGCTATGT GTTCCGCACG GCAGCCGATG CTGCTGCGCG CTTTGCCGGC
GAAACGCCGG GCAACGTCTA CTCGCGCTAC ACCAACCCGT CGGTGCGTGC CTTCGAGGAG
CGCCTGGCGG CCATGGAAGG TGCCGAACAG GCCGTGGGTA CGTCCACCGG CATGGCGGCG
ATCCTGGCCG TGGTCATGTC GCTGTGCAGC GCCGGTGACC ATGTGCTGGT GTCGCAGAGC
GTATTCGGCT CCACCATCAG CCTGTTCGAG AAGTACTTCA AGCGCTTTGG TGTAGAAGTG
GACTACGTGC CACTGGTCGA CCTCACCGGT TGGGAAAAGG CCATCAAGGC CAACACCAAG
CTGCTGATCG TCGAATCGCC CTCCAACCCG CTGGCCGAGT TGGTCGATAT CACCGCGCTC
AGCGAAATCG CCCATGCTCA GGGTGCCATG CTGGTGGTGG ACAACTGTTT CAGTACCCCG
GCGTTGCAGC AGCCGCTGAA GCTGGGTGCC GACATTGTGT TCCACTCGGC CACCAAGTTC
ATCGACGGCC AGGGCCGCTG CATGGGCGGT GTGGTTGCCG GCCGTACTGA GCAAATGAAA
GAAGTGGTGG GTTTCCTGCG AACCGCAGGT CCAACCCTCA GCCCGTTCAA CGCCTGGATC
TTCACCAAGG GCCTGGAAAC GCTGCGCCTG CGTATGCGTG CGCACTGCGA AAGCGCTCAG
GCCCTGGCCG AATGGCTGGA GCAGCAGGAC GGCGTGGAGA AGGTGCATTA CGCCGGCCTG
CCCAGCCACC CGCAGCACGA ACTGGCCAAG CGCCAGATGA GCGGTTTTGG TGCAGTGGTC
AGCTTTGAAG TCAAGGGGGG CAAAGAGGGC GCCTGGCGTT TCATCGACGC TACCCGAGTG
ATTTCCATCA CGACCAACCT GGGTGACAGC AAAACCACCA TCGCTCATCC GGCGACCACC
TCACACGGTC GTCTGTCGCC GCAGGAGCGT GAAGCGGCTG GTATCCGCGA CAGCCTGATC
CGTGTTGCCG TGGGTCTGGA AGACGTGGCT GACCTGCAGG CTGACCTGGC GCGCGGGCTG
GCGGCATTGT GA
 
Protein sequence
MTDQWDAGRL DSDLEGVGFD TLAVRAGQNR TPEGEHSEAL FLTSSYVFRT AADAAARFAG 
ETPGNVYSRY TNPSVRAFEE RLAAMEGAEQ AVGTSTGMAA ILAVVMSLCS AGDHVLVSQS
VFGSTISLFE KYFKRFGVEV DYVPLVDLTG WEKAIKANTK LLIVESPSNP LAELVDITAL
SEIAHAQGAM LVVDNCFSTP ALQQPLKLGA DIVFHSATKF IDGQGRCMGG VVAGRTEQMK
EVVGFLRTAG PTLSPFNAWI FTKGLETLRL RMRAHCESAQ ALAEWLEQQD GVEKVHYAGL
PSHPQHELAK RQMSGFGAVV SFEVKGGKEG AWRFIDATRV ISITTNLGDS KTTIAHPATT
SHGRLSPQER EAAGIRDSLI RVAVGLEDVA DLQADLARGL AAL