Gene Csal_2287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2287 
Symbol 
ID4026440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2575837 
End bp2576868 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content67% 
IMG OID637967491 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_574336 
Protein GI92114408 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACCC ACGTCTCATC TCAGGCGATG CCGTTGCCGC GAATCGGCGC CTACGCATTG 
TCCAGCCGAG CGATTCTCGC GCCCATGGCC GGCGTGACGG ATCGCCCGTT TCGTCAGTTG
TGCCGGGAGC TGGGCGCGGG GCTGGTGGTA TCCGAAATGG TGACGTCCGA CACGCGCCTG
TGGCATACCC GCAAATCGCG CCAGCGCCTC GACCACACCG GCGAGCCCGG CCCGCGTGCC
GTGCAGATCG CAGGCGGCGA TGCCGCGATG CTGGCCGAGG CCGCGCGCCT CAACGTTGCC
CAGGGCGCCG AGATCGTCGA CATCAACATG GGCTGCCCGG CCAAGAAGGT ATGCAACAAG
GCCGCCGGCT CGGCATTGTT GCGCGACGAA CGCCTGGTCG CGGAGATCCT CGAGGCGGTC
GTCGCCGCCG TGGATGTCCC GGTGACCCTG AAGATTCGAA CCGGCTGGTG TCCGCAAACC
CGCAATGGCG TACGGGTCGC CAAGCTTGCC GAGTCGGCGG GCATCCAGGC CCTTGCCGTG
CATGGGCGCA CGCGTGAGCA GCGCTATCGC GGCGAGGCCG AATACGACAC CATCGCCGCC
ATCAAGCAGG CGGTCTCGCT GCCGGTCTTC GCCAACGGCG ACATCGACGG CGCCGAGAAA
GCTGCCCGCG TCCTCGACTA CACTAAGGCG GATGCAGTGA TGATCGGCCG CGGCGCCCAG
GGCAATCCCT GGATCTTCCG CGAGATCGAT CACTACCTGC GTACCGGCGA CTGCCTGCCG
CGCCCGACGC CCGACGATAT CGCCACCCTG ATGCACCGTC ATCTCGAGGC ATTGCATGCC
TTCTACGGTG AGCACATGGG CGTGCGCATC GCGCGCAAGC ATGTCGGCTG GTATCTGGCG
ACGCAACCGC AAGCCGCGGC ACTACGCGCA CGCTTCAACG TACTGGAACA GCCCTCGGCC
CAACACCGTT TCGTGGATGC CTTGGCCCAC GACACGCTGG AACTGGCCTC AACTGGAAGC
AATGCAGCAT GA
 
Protein sequence
MPTHVSSQAM PLPRIGAYAL SSRAILAPMA GVTDRPFRQL CRELGAGLVV SEMVTSDTRL 
WHTRKSRQRL DHTGEPGPRA VQIAGGDAAM LAEAARLNVA QGAEIVDINM GCPAKKVCNK
AAGSALLRDE RLVAEILEAV VAAVDVPVTL KIRTGWCPQT RNGVRVAKLA ESAGIQALAV
HGRTREQRYR GEAEYDTIAA IKQAVSLPVF ANGDIDGAEK AARVLDYTKA DAVMIGRGAQ
GNPWIFREID HYLRTGDCLP RPTPDDIATL MHRHLEALHA FYGEHMGVRI ARKHVGWYLA
TQPQAAALRA RFNVLEQPSA QHRFVDALAH DTLELASTGS NAA