Gene Pfl01_1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_1903 
Symbol 
ID3716320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp2174331 
End bp2175542 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_347635 
Protein GI77458130 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGG AATGGGATGC CGGTCGGCTG GACAGCGACC TCGAAGGCGT AGCGTTCGAT 
ACCCTGGCCG TACGTGCCGG TCAGCACCGT ACGCCGGAAG GCGAGCACGG TGATCCGATG
TTCTTCACTT CCAGCTATGT GTTTCGCACC GCCGCCGATG CGGCTGCGCG GTTTGCCGGG
GAAGTGCCGG GCAACGTCTA CTCGCGTTAC ACCAACCCGA CCGTTCGCGC CTTCGAAGAG
CGCATTGCCG CGCTGGAAAG CGCCGAGCAG GCGGTGGCCA CGGCCACCGG CATGGCCGCG
ATCATGGCTG TGGTGATGAG CCTGTGCAGC GCCGGCGACC ATGTGCTGGT GTCGCGCAGT
GTGTTCGGTT CGACCATCAG CCTGTTCGAG AAGTATTTCA AGCGTTTTGG CGTGGAAGTC
GATTACGTGC CGCTGGCCGA TCTGTCGGCC TGGGGCGCAG CGATCAAGTC CAACACCAAA
TTGCTGTTCG TCGAGTCGCC GTCCAATCCG TTGGCTGAAC TGGTGGATAT CACCGCTCTG
TCGGAAATCG CGCACGCCAA GGGTGCGATG CTGGTAGTCG ACAACTGCTT CTGCACGCCT
GCCTTGCAGC AGCCGCTGAA GCTGGGCGCA GACATCGTTG TGCATTCGGC CACCAAGTTC
ATCGATGGCC AGGGCCGTTG CATGGGCGGC GTGGTTGCCG GTCGCAGCGA ACAGATGAAA
GAAGTCGTCG GCTTCCTGCG TACCGCCGGG CCGACCCTCA GCCCGTTCAA CGCCTGGATC
TTCCTCAAGG GGCTGGAAAC CCTGAACCTG CGGATGAAGG CGCACTGCGC CAATGCCCAA
CAACTGGCTG AGTGGCTGGA GCAGCAGGAT GGCATCGAGA AGGTGCATTA CGCCGGTCTC
AAGAGCCATC CGCAGCACGA ACTGGCTCAG CGTCAGCAGA AGGGCTTCGG CGCGGTGGTG
AGCTTTGAGG TCAAGGGCGG CAAAGAGGGT GCCTGGCGTT TCATCGATGC GACCCGTTTG
ATCTCGATTA CTGCCAACCT CGGTGACAGC AAAACCACCA TCACCCACCC GAGCACCACG
TCCCATGGCC GTCTGGCGCC GCAGGAGCGT GAGGCAGCCG GCATTCGTGA CAGCCTGATC
CGCATCGCGG TCGGTCTGGA AGACGTGGCT GACCTGCAAG CCGACCTGTC GCGCGGTCTG
GCGGCGTTGT GA
 
Protein sequence
MSQEWDAGRL DSDLEGVAFD TLAVRAGQHR TPEGEHGDPM FFTSSYVFRT AADAAARFAG 
EVPGNVYSRY TNPTVRAFEE RIAALESAEQ AVATATGMAA IMAVVMSLCS AGDHVLVSRS
VFGSTISLFE KYFKRFGVEV DYVPLADLSA WGAAIKSNTK LLFVESPSNP LAELVDITAL
SEIAHAKGAM LVVDNCFCTP ALQQPLKLGA DIVVHSATKF IDGQGRCMGG VVAGRSEQMK
EVVGFLRTAG PTLSPFNAWI FLKGLETLNL RMKAHCANAQ QLAEWLEQQD GIEKVHYAGL
KSHPQHELAQ RQQKGFGAVV SFEVKGGKEG AWRFIDATRL ISITANLGDS KTTITHPSTT
SHGRLAPQER EAAGIRDSLI RIAVGLEDVA DLQADLSRGL AAL