Gene TM1040_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1938 
Symbol 
ID4076889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2041410 
End bp2042597 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content60% 
IMG OID638007254 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_613933 
Protein GI99081779 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.168999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGA GCTGGAACAA ACGCACCAAA CTGGTCCATG GCGGCACCCG ACGCAGCCAG 
TACAACGAGG TCAGCGAAGC GATCTTTCTG ACCCAGGGGT TTGTGTACGA CAGTGCAGAG
CAGGCGGAGG CCCGGTTCAT CGAGACCGGC GCGGATGAAT TCATCTACGC CCGCTATGGC
AATCCGACCG TGGCAATGTT TGAAGAGCGC ATCGCCCTTT TAGAAGGGGC CGAGGATGCC
TTTGCCACCG CATCGGGCAT GGCGGCGGTG AATGGCGCGC TCACGTCGAT CCTGAAAGCG
GGCGACCATG TGGTCTCTGC CAAGGCGCTC TTTGGGTCCT GTCTTTATAT TCTCGAAAAC
ATCCTGACCC GGTATGGGGT CGAGGTGACA TTTGTCGACG GCACGGATCT GGACCAGTGG
CGCGCTGCCG TGCGCCCGGA CACCAAGGCC GTGTTTTTTG AGAGCATGTC GAACCCAACG
CTGGAAGTCA TTGATATTGA AGGCGTTGCC GAGATCGCCC ATGCGGTCGG CGCGACCGTT
GTGGTCGATA ATGTGTTCTC TACTCCGGTG TTTTCCAATG CCATCGCGCA AGGCGCGGAT
GTGGTGGTGT ATTCGGCCAC CAAACATATC GACGGGCAGG GGCGTGCGCT TGGCGGCGTG
GTGCTTGGCA CCAAGGATTA CATTCGCGGC ACGCTCGAAC CCTACATGAA ACATACCGGT
GGCTCGCTGA GCCCGTTTCA TGCCTGGATG TTCGTCAAGG GGCTTGAGAC CATTGATCTG
CGCGTCAATG CTCAGGCTGC CAGCGCGCTC AAGATCGCCG AGGCCTTTGA GAGCCATCCA
GTCTTGGCTC GCACCATCTA TCCGGGCCTG AAAACCCACG CGCAGAATGC GCTGGTGCAG
CGCCAGCTTG GAGGCAAGGG TGGGACGGTG CTGTCGCTCG ACCTCAAGGG CGGCAAGGAG
GCGGCCTTCG CCTTTCTCAA TGCGCTCTCC ATTCCGGTGA TCTCCAATAA TCTGGGCGAT
GCCAAATCCA TTGCCACCCA TCCGGCCACC ACCACGCACC AACGTCTTTC CGAGGCGCAG
CGCGCAGAAC TCGGGATCAC CGATGGTCTG GTGCGGTTTT CGGTCGGGCT TGAGGATGCG
GATGATCTGA TCGCGGATCT GTCACAGGCG CTGCAAACGC TGGCGTGA
 
Protein sequence
MTESWNKRTK LVHGGTRRSQ YNEVSEAIFL TQGFVYDSAE QAEARFIETG ADEFIYARYG 
NPTVAMFEER IALLEGAEDA FATASGMAAV NGALTSILKA GDHVVSAKAL FGSCLYILEN
ILTRYGVEVT FVDGTDLDQW RAAVRPDTKA VFFESMSNPT LEVIDIEGVA EIAHAVGATV
VVDNVFSTPV FSNAIAQGAD VVVYSATKHI DGQGRALGGV VLGTKDYIRG TLEPYMKHTG
GSLSPFHAWM FVKGLETIDL RVNAQAASAL KIAEAFESHP VLARTIYPGL KTHAQNALVQ
RQLGGKGGTV LSLDLKGGKE AAFAFLNALS IPVISNNLGD AKSIATHPAT TTHQRLSEAQ
RAELGITDGL VRFSVGLEDA DDLIADLSQA LQTLA