Gene TM1040_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1249 
Symbol 
ID4076364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1342754 
End bp1343974 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID638006557 
Productcysteine desulfurase 
Protein accessionYP_613244 
Protein GI99081090 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.491768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATG TCGAAAAAAT CCGCTCTGAC TTTCCGATCC TGTCACGCGA AGTGAACGGA 
AAGCCGCTGG TCTATCTCGA TAACGGGGCC TCTGCGCAGA AACCTCAGGT TGTGATCGAT
GCCGTGACCA AGGCATACGC GGAAGAATAT TCAAATGTTC ACCGTGGCTT GCATTATCTT
TCTAATCTCG CGACTGAGAA ATACGAAAGC GTACGCGGCA TCATCGCAAA GTTTTTGAAC
GCGGGTCACG AAGATCACAT CGTGCTGAAT TCCGGAACCA CAGAGGGTAT CAACCTCGTG
GCGTATTCCT GGGCCATGCC GCGGATGGAA GCCGGCGATG AGATCATCCT CTCCGTCATG
GAGCACCATG CGAACATCGT GCCCTGGCAT TTCCTGCGCG AACGCCAAGG TGTCGTCATC
AAATGGATCG ACACGGACGC GGACGGAAGC CTGGACCCGC AAAAGGTTCT GGACGCCATT
ACACCAAAAA CCAAGCTGAT CGCAGTCACC CAGTGTTCCA ATGTCTTGGG AACCGTTGTT
GATGTAAAAT CCATCACAAA AGGCGCGCAT GACAAAGGCG TGCCAGTGCT TGTCGACGGC
TCTCAGGGCG CGGTTCACAT GCCCGTGGAT GTGCAGGATC TCGACTGTGA TTTCTATGCC
GTCACCGGGC ACAAGCTTTA TGGTCCGTCT GGGTCCGGCG CGATCTATAT CAAGCCGGAG
CGTATGGCCG AGATGCGTCC GTTTATTGGC GGCGGCGATA TGATCAAAGA AGTGTCCAAG
GATCAGGTGA TCTACAACGA TCCGCCGATG AAGTTCGAGG CTGGTACGCC AGGGATCGTG
CAGACGATCG GTTTCGGCGT CGCGCTCGAA TATATGATGG AGATCGGAAT GGCAGAAATT
GCCGCCCATG AGGCCGATCT GCGAGACTAT GCATCGGAGC GTTTCAAGGG GTTGAACTGG
TTGAATATTC AGGGCCATGC TCCTGGAAAA GCAGCGATAT TCAGTCTGAC CCTTGAGGGC
GCGGCACATG CGCATGACAT TTCAACCATT CTCGACAAGA AAGGTGTCGC GGTGCGTGCC
GGGCATCATT GTGCGGGGCC TTTGATGGAT CATCTTGGCG TCTCTGCAAC TTGCCGCGCG
AGCTTTGGCA TGTACAACAC GCGTCCAGAG GTAGATACGT TGATTGAGGC GCTAGAACTC
GCACATGAGC TTTTTGGCTA G
 
Protein sequence
MYDVEKIRSD FPILSREVNG KPLVYLDNGA SAQKPQVVID AVTKAYAEEY SNVHRGLHYL 
SNLATEKYES VRGIIAKFLN AGHEDHIVLN SGTTEGINLV AYSWAMPRME AGDEIILSVM
EHHANIVPWH FLRERQGVVI KWIDTDADGS LDPQKVLDAI TPKTKLIAVT QCSNVLGTVV
DVKSITKGAH DKGVPVLVDG SQGAVHMPVD VQDLDCDFYA VTGHKLYGPS GSGAIYIKPE
RMAEMRPFIG GGDMIKEVSK DQVIYNDPPM KFEAGTPGIV QTIGFGVALE YMMEIGMAEI
AAHEADLRDY ASERFKGLNW LNIQGHAPGK AAIFSLTLEG AAHAHDISTI LDKKGVAVRA
GHHCAGPLMD HLGVSATCRA SFGMYNTRPE VDTLIEALEL AHELFG