Gene Sde_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2038 
Symbol 
ID3967397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2565220 
End bp2566413 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content47% 
IMG OID637921126 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_527510 
Protein GI90021683 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCGC CCCTGCAAAC CTTATTTTTA CAGCTACACC AAGAGCTTGT TAGCAGTAAA 
TTGCTGTGGC AGCCGTCGCT TTTTGTAAAC CCATCCCCTT CTTGGCTAAA GCTCTACCCG
GCTTTAACAG AGCAATGCCT AAGCTTAAGC GAAAACCAGC TCACCGAGCT AGAGCAACAA
CCAGAGCTAA TCCCCCTGTG GCTTAGCGCA CACTTGCCAC ACATTACACA GCTTACTGCT
CTAACCGAAC TAGAGATGAA CGCAAATTCA GCCACAGCGC TGCCCAAGCA GTGGGATGCT
GGCATACCCG GCAGAAAAGC CAAGCAAATA AAAGCGTTTG CCGAAGCGTT CAAACCAGAG
GGCGACATAC TTGTAGATTG GTGCAGTGGC AAAGCGCACC TAGGCCGAAC CCTTTCTGCG
CTATATGCAG CCCCCTGCCT AGCCTTGGAA TATAACCCAA CCCTGTGCCA GCAAGGCAAT
GTGTTAGCGC GTAAACGCAA CCTTAACACG CACTTTGTGG CAACCGATGT ACTTAAGCTA
GGTGTAGCCT TACCTGCAAG CTCACATATT TGCGCCTTAC ACGCCTGCGG CGACCTGCAT
AGAAGCTTGG TAGCCCAAGC TACAAGCCAC CCTGTTGCCG CCCTAACGTT TGCACCGTGC
TGCTACCCAT TATGGCTAGA CGATACTTAC ACACCGCTTT CGAAAACTGC ACTTAAACAC
AATTTGCAGC TAGACCGCAC AGATTTGCAT TTGGCTGTAC AAGAGTGCGT TACAGCTACA
CCAAGAGAGC AAAGCCTTAG CCATAAACAA GCAACGTGGC GGTTAGGGTT TGATTGTTTA
CAGCGAGACA TAAGACAAAG TGATAGCTAT TTAAACACTC CATCTTTGCC CCTTTCCGCC
CTTAATAATG GCTTCGAGCA TTATTGCCGT ACATTAGCTG CATTAAAAAA TTTAACGCTA
CCGCAAAATA TTCAATGGCA GCATTATGAA AAAGCAGGCG AAGTGCGCTG GGCAAAATTA
CGCAGACTAC AACTAGTTCG CCATGCATAT AGGCGAGCGT TAGAGTTATG GCTGGTGTTA
GATTTAGCAC TGCGCCTCGA AGAAGCGAAT TACACTGTAG TCATTAATCA GTTTTGCGAC
CGGGCTTTAA CACCCAGAAA TATTATTATA AATGCTAAAT TAAACACGCA TTAA
 
Protein sequence
MPSPLQTLFL QLHQELVSSK LLWQPSLFVN PSPSWLKLYP ALTEQCLSLS ENQLTELEQQ 
PELIPLWLSA HLPHITQLTA LTELEMNANS ATALPKQWDA GIPGRKAKQI KAFAEAFKPE
GDILVDWCSG KAHLGRTLSA LYAAPCLALE YNPTLCQQGN VLARKRNLNT HFVATDVLKL
GVALPASSHI CALHACGDLH RSLVAQATSH PVAALTFAPC CYPLWLDDTY TPLSKTALKH
NLQLDRTDLH LAVQECVTAT PREQSLSHKQ ATWRLGFDCL QRDIRQSDSY LNTPSLPLSA
LNNGFEHYCR TLAALKNLTL PQNIQWQHYE KAGEVRWAKL RRLQLVRHAY RRALELWLVL
DLALRLEEAN YTVVINQFCD RALTPRNIII NAKLNTH