Gene Daro_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2138 
Symbol 
ID3569780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2305885 
End bp2306859 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content63% 
IMG OID637680609 
Productcysteine synthase 
Protein accessionYP_285349 
Protein GI71907762 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00000000166923 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000932984 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCAAAT GGTTTGCAGA TAATTCACAG TCGATCGGCC GCACGCCGCT GATCAAGCTC 
AACCGTGTCA TCGACGGCGC CAAGGCTACC GTGCTCGCCA AGATCGAAGG TCGCAACCCG
GCCTATTCGG TCAAGTGCCG GATCGGCGCC GCCCTGATCA ATGATGCCGA GAAGCGTGGC
CTGCTCGGTC CGGGCAAGGA GCTGGTCGAG CCGACTTCCG GCAACACCGG CATCGCGCTG
GCTTTCGTCG CCGCCGCCAA GGGTATTCCG CTGACGCTGA CCATGCCCGA AACGATGAGT
ATCGAACGGC GCAAGCTGCT GACGGCTTTT GGTGCCAAGC TGGTGCTGAC CGAAGGCGCC
AAGGGCATGT CCGGCGCCAT CGCCAAGGCC GAGGAAATTG CCGCTTCCGA TGCCAAGTAC
GTGTTGTTGC AGCAGTTCAA GAACCCGGCC AATCCGGCCA TCCACGAACT GACCACCGGT
CCGGAAATCT GGGACGACAC CGATGGCGCC ATCGACATTC TGGTGTCCGG GGTCGGCACT
GGCGGCACGA TCACCGGTGT TTCGCGTTAC ATCAAGAACA CCAAGGGCAA GGCGATCCAG
TCGGTCGCCG TCGAGCCGAC CGCCAGCCCG GTGCTGACCC AGGCTCGTGC CGGCGAGCCA
ATCAAGCCCG GTCCGCACAA GATTCAGGGG ATTGGCGCCG GTTTCGTGCC GGCCGTGCTC
GATCTGTCGC TGCTCGATGC CGTTGAGCAA GTGTCTAATG AGGATGCCGT GCTTTACGCC
CGCCGCCTGG CCAAGGAAGA GGGCATCATC TCCGGGATTT CCAGCGGTGC TGCGGTTGCG
GCGGCCGCTC GTCTGGCCCG GATACCGGAA AATGCCGGCA AGACCATTGT CGCCATCCTG
CCTGACTCCG GCGAGCGTTA CCTCAGCTCC ATCCTGTTCG AAGGCTTGTT CAACGAAGCC
GGGCTGGCCG CATGA
 
Protein sequence
MSKWFADNSQ SIGRTPLIKL NRVIDGAKAT VLAKIEGRNP AYSVKCRIGA ALINDAEKRG 
LLGPGKELVE PTSGNTGIAL AFVAAAKGIP LTLTMPETMS IERRKLLTAF GAKLVLTEGA
KGMSGAIAKA EEIAASDAKY VLLQQFKNPA NPAIHELTTG PEIWDDTDGA IDILVSGVGT
GGTITGVSRY IKNTKGKAIQ SVAVEPTASP VLTQARAGEP IKPGPHKIQG IGAGFVPAVL
DLSLLDAVEQ VSNEDAVLYA RRLAKEEGII SGISSGAAVA AAARLARIPE NAGKTIVAIL
PDSGERYLSS ILFEGLFNEA GLAA