Gene Daro_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1950 
Symbol 
ID3567879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2102758 
End bp2103975 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content57% 
IMG OID637680421 
Productcysteine desulfurase IscS 
Protein accessionYP_285166 
Protein GI71907579 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR02006] cysteine desulfurase IscS
[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00000000843977 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.404205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA AACTCCCCAT CTACCTGGAT TACTCGGCAA CCACCCCGGT CGACCCCCGT 
GTCGCCGAAA AAATGATTCC CTACCTCTGC GAGCATTTCG GCAATCCGGC GTCGCGTTCG
CACAGCTTTG GCTGGGTCGC CGATGCGGCG GTCGAAGAGG CCCGTGAGCA GGTTGCTGCG
CTGGTCAATG CAGATCCCAA GGAAATCGTC TGGACCTCCG GTGCGACCGA ATCCAACAAC
CTTGCCATCA AGGGCGCGGC CAATTTCTAT GCCAGCACCA AGGGCAAGCA CATCATCACG
GTCAAGACCG AGCACAAGGC CATTCTCGAT ACCGTGCGTG AAATGGAGCG CCAGGGTTTC
GAGGCGACTT ATCTTGACGT CAAGGAAGAC GGTCTGCTTG ATCTGGAAGT CTTCAAGGCG
GCCATTCGTC CGGATACCGT GCTCGCCTCG GTGATGTTCG TCAATAACGA AGTCGGTGTC
ATTCAGCCGA TCGCCGAACT CGGTGAAATC TGCCGCGAGA AGGGCATCAT CTTTCACGTC
GATGCTGCAC AGGCCACCGG CAAGGTCGAT ATCGATCTGA GCAAGCTGAA GGTCGATCTG
ATGAGCTTCT GCGCCCACAA GACCTATGGT CCGAAGGGTA TCGGCGCGCT GTACGTCCGC
CGCAAGCCGC GTATCCGTCT CGAAGCCCAG ATGCACGGCG GCGGTCATGA GCGCGGTTTC
CGCTCCGGCA CCTTGCCGAC CCATCAGATC GTCGGCATGG GCGAGTGCTT CCGTTTGGCC
AAGGAAGAAA TGGCTGAAGA GAACAAGCGC GTTGGTGCTC TGCGCGACAA ATTGCTGAAG
GGCTTGCAGG ATATCGAGGC CACTTTCGTC AATGGTGACC TGACGCAACG CGTGGCGCAC
AATCTCAACA TCAGCTTTGC CTATGTTGAG GGTGAGTCGA TGATCATGGC GATCAAGGAT
CTGGCGGTTT CGTCCGGTTC GGCCTGCACC TCGGCCAGCC TGGAACCTTC CTACGTGCTA
CGTGCCCTGG GGCGTGATGA TGAACTGGCT CACAGTTCCA TCCGTTTCAG CATCGGTCGC
TTTACGACAG AAGAAGAAAT TGACTATGCA ATCAAATTGT TGCATCAGAA AGTTGGTAAG
TTGCGCGAAC TTTCACCGCT GTGGGAGATG TACAAGGATG GCATCGATCT GAGCACCGTT
CAGTGGGCAG CGCACTAA
 
Protein sequence
MTMKLPIYLD YSATTPVDPR VAEKMIPYLC EHFGNPASRS HSFGWVADAA VEEAREQVAA 
LVNADPKEIV WTSGATESNN LAIKGAANFY ASTKGKHIIT VKTEHKAILD TVREMERQGF
EATYLDVKED GLLDLEVFKA AIRPDTVLAS VMFVNNEVGV IQPIAELGEI CREKGIIFHV
DAAQATGKVD IDLSKLKVDL MSFCAHKTYG PKGIGALYVR RKPRIRLEAQ MHGGGHERGF
RSGTLPTHQI VGMGECFRLA KEEMAEENKR VGALRDKLLK GLQDIEATFV NGDLTQRVAH
NLNISFAYVE GESMIMAIKD LAVSSGSACT SASLEPSYVL RALGRDDELA HSSIRFSIGR
FTTEEEIDYA IKLLHQKVGK LRELSPLWEM YKDGIDLSTV QWAAH